RCrawler: An R package for parallel web crawling and scraping
RCrawler is a contributed R package for domain-based web crawling and content scraping. As the first implementation of a parallel web crawler in the R environment, RCrawler can crawl, parse, store pages, extract contents, and produce data that can be directly employed for web content mining applicat...
Main Authors: | Salim Khalil, Mohamed Fakir |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2017-01-01
|
Series: | SoftwareX |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352711017300110 |
Similar Items
-
Evaluation of web scraping methods : Different automation approaches regarding web scraping using desktop tools
by: Oucif, Kadday
Published: (2016) -
Web Scraping using Machine Learning
by: Carle, Victor
Published: (2020) -
Less Detectable Web Scraping Techniques
by: Färholt, Fredric
Published: (2021) -
Towards End-User Web Scraping for Customization
by: Katongo, Kapaya, et al.
Published: (2022) -
Automated Data Collection with R - A Practical Guide to Web Scraping and Text Mining
by: Stefano M. Iacus
Published: (2015-11-01)