Web Scraping: State-of-the-Art and Areas of Application
2019pp. 6040–6042
Citations Over TimeTop 10% of 2019 papers
Rabiyatou Diouf, Edouard Ngor Sarr, Ousmane Sall, Babiga Birregah, Mamadou Bousso, Seny Ndiaye Mbaye
Abstract
Main objective of Web Scraping is to extract information from one or many websites and process it into simple structures such as spreadsheets, database or CSV file. However, in addition to be a very complicated task, Web Scraping is resource and time consuming, mainly when it is carried out manually. Previous studies have developed several automated solutions. The purpose of this article is to revisit the different existing Web Scraping approaches, categories, and tools, but also its areas of application.
Related Papers
- → Web Engineering Revisited(2008)12 cited
- Analyzing modern web applications to recognize features-based web engineering methods(2015)
- → Engineering Semantic Web Applications by Using Object-Oriented Paradigm(2010)
- Fuzzification of Web Objects: A Semantic Web Mining Approach(2012)