Scrape Dynamic Sites with Splash and Python Scrapy – From Docker Installation to Scrapy Project

テクノロジー



In this tutorial, you will see how to scrape dynamic sites with Splash and Scrapy. This tutorial covers all the steps, right from installing Docker to writing the code for the Scrapy project.

WHAT IS SPLASH:
Splash is a lightweight browser specifically aimed at web scraping.

WHAT IS SCRAPY
Scrapy is a Python framework to make web scraping very powerful, fast, and efficient. If you are a beginner, start here: https://youtu.be/y8l14bys7Nw

SEE ALSO:
Using Selenium: https://youtu.be/yzCLL5c_4PA
Scraping Dynamic Sites without Splash/Selenium: https://youtu.be/Pu3gmdWsLYc

LINKS IN THIS VIDEO:
Source: https://github.com/eupendra/scrapy_splash_demo
Splash Docs: https://github.com/scrapy-plugins/scrapy-splash

CHAPTERS:
00:00 Introduction
00:30 Install Docker
01:30 Downloading Splash Image
02:10 Setting up Splash
04:25 Install scrapy-splash
04:55 Create Scrapy Project
09:12 Changing the Project for Splash

#scrapy #splash

-~-~~-~~~-~~-~-
Please watch: “Making Scrapy Playwright fast and reliable”


-~-~~-~~~-~~-~-

Comments

Copied title and URL