Category Archives: WebHarvy Feature

WebHarvy 5.2 | UI revamp + Oracle db support

Changes in 5.2 are mainly related to user interface and experience. The most visible change is the introduction of the ribbon menu system for providing easy access to most software features. In addition to the main interface, other windows like Scheduler … Continue reading

Posted in Release update, WebHarvy, WebHarvy Feature | Tagged , , , , , | Leave a comment

WebHarvy 4.1.5.141 released

The main changes in this release are :- Pagination via JavaScript – see https://www.webharvy.com/tour3.html#JS This powerful feature is the main highlight of this release. When all other methods of pagination fails, this method, where you can directly provide a JavaScript … Continue reading

Posted in Release update, WebHarvy, WebHarvy Feature | Tagged , , , , , , , , | Leave a comment

WebHarvy : 2 new methods of handling pagination

The latest version of WebHarvy Web Scraper supports 2 new types of pagination styles for scraping data from multiple pages of websites. Pages where pagination links are shown in sets In these types of pages the pagination links are provided in sets. … Continue reading

Posted in WebHarvy Feature | Tagged , , , , , | Leave a comment

WebHarvy version 3.3 released !

3.3 version of WebHarvy was released on June 16, 2014. The major changes are : Fixed issues related to URL encoding in Category Scraping Added option to disable automatic pattern (data field repetition) detection in start page (more details) Option … Continue reading

Posted in Release update, Uncategorized, WebHarvy, WebHarvy Feature | Tagged , , , | Leave a comment

Use ‘Capture Following Text’ option to scrape data from details pages

While extracting data from details pages (page reached by navigating a link from the start page), it is recommended that the ‘Capture Following Text‘ option be used whenever possible to correctly and consistently scrape data. This is because the layout … Continue reading

Posted in WebHarvy, WebHarvy Feature | Tagged , , , | Leave a comment

Scrape HTML

WebHarvy allows you  to scrape HTML of page contents in addition to plain text. In the Capture window, click ‘More Options’ button and select the ‘Capture HTML’ option to scrape the HTML of the selected content. To capture only a … Continue reading

Posted in WebHarvy, WebHarvy Feature | Tagged , , | Leave a comment

Scraping hidden (click to display) fields using WebHarvy

Certain web pages require that you to click on a link or button for the data to be displayed. There are many websites where email addresses or phone numbers are partially displayed, they will be fully displayed only if you … Continue reading

Posted in WebHarvy, WebHarvy Feature | Tagged , , | Leave a comment