International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 135 - Number 11 |
Year of Publication: 2016 |
Authors: Neeraj Raheja, Vijay Kumar Katiyar |
10.5120/ijca2016908537 |
Neeraj Raheja, Vijay Kumar Katiyar . Performance Comparison of Web Data Extraction Techniques. International Journal of Computer Applications. 135, 11 ( February 2016), 6-13. DOI=10.5120/ijca2016908537
Websites in today world consist of a large amount of data as per the requirements of the users. So web data extraction systems helps user in extracting the required data from these types of websites. The basic techniques used for web data extraction are manual and web wrapper. Web wrapper further consists of wrapper induction and automatic approaches. A lot of methods are available which uses wrapper induction and automatic methods. This research work provides performance comparison of manual, web wrapper induction and automatic approaches on the basis of methods chosen as manual (By manual efforts), nX1 (web wrapper induction), DEPTA and MDR (Automatic). The results are compared on the basis of various parameters like precision, recall, F-measure and data extraction time.