Posts

Showing posts from July, 2022

Scrapy Framework

Image
( Useful : 1] Click  2] Click  3] Click  ) Web Crawling W eb scraping is about extracting the data  from one or more websites. While  crawling is about finding or discovering URLs  or links on the web.  Usually, in web data extraction projects, you need to combine crawling and scraping . So you first crawl - or discover - the URLs, download the HTML files, and then scrape the data from those files. In web scraping, it's  all about the data . The data fields you want to extract from specific websites. And it's a big difference because with scraping you usually know the target websites. With crawling, you probably don't know the specific URLs and you probably don't know the domains either. And this is the reason you crawl: you want to find the URLs. So that you can do something with them later.  With web crawling the output is a lot more simple because it's just a list of URLs , you can have other fields as well but t...