The Custom Harvester for Scrapebox allows you to add new custom engines to scrapebox to harvest urls from.
It comes preloaded with several engines, but you get many options when adding new engines. You can teach it multiple elements, add custom header data, user agents etc...
It allows you to also auto harvest and test proxies and refresh them on the fly. You can specify the number of proxies needed to start a harvest run, then how often scrapebox should get new proxies. All of this will happen automatically after you hit "Go" in the new harvester.
Of course you can also use your own proxy sources and private/share proxies or a combination of all of the above. The new harvester brings many new elements into play and adds a wide array of scraping functionality.
At the time of this recording the engines that you can scrape with scrapebox are:
Google AOL Yahoo Yandex Lycos USA Sky.com Ask.com IXQuick TalkTalk.com Dogpile Bing |
No comments:
Post a Comment