Deleted member 75160
Reviewer
there are multiple packages and implementation will take not more than 10 mins....however, problem is cleaning the data (80% of the task) for ex. some monger use "rajmehel" while some ""rajmahel" also, getting program to understand hindi words is also a task........it's just a very minor example, when you work you will come across many abnormalities, that said world is not a perfect place, have to work with whtever we have and here it's more than sufficient data to scrape for analysis.......If memory serves me right, python had a library called Beautiful Soup (or something like that) that can be used for parsing HTML and scrapping websites. Lets see if that can be put to use on MassagePlanet.net. That would make it easy to scrape the entire string of conversations.
Automation for reading new FR is not a problem, as the algos i am using is used for twitter feeds which are updated realtime and algo automatically takes it into account.......if someone wants to work on basu1970 questions lemme know i can forward the string for extracted data for first 1500 pages of this forum.