Web scraping is a term used to describe the process of downloading and extracting structured data from the web using a program or algorithm. It's a useful skill to have when you need to extract data from a website that does not have a public API.

The tutorials and articles on TestDriven.io teach how to leverage parallelism and concurrency in order to speed up web scrapers that scrape large amounts of data.

Latest Posts (2)

Concurrent Web Scraping with Selenium Grid and Docker Swarm

Posted by Michael Herman
Last updated on March 31st, 2022

Run a Python and Selenium-based web scraper in parallel with Selenium Grid and Docker Swarm.

DevOps Docker Web Scraping

Building a Concurrent Web Scraper with Python and Selenium

Posted by Caleb Pollman
Last updated on December 22nd, 2021

Speed up a Python web scraping and crawling script with multithreading via the concurrent.futures module.

Web Scraping

Featured Course

Developing a Real-Time Taxi App with Django Channels and React

Learn how to create a ride-sharing app with Django Channels, React, and Docker. Along the way, you'll learn to manage client/server communication with Django Channels and WebSockets, develop a front-end with React, build a RESTful API with Django REST Framework, and test your application using the Cypress testing framework.

Buy Now $45 View Course

Featured Course

Developing a Real-Time Taxi App with Django Channels and React

Buy Now $45 View Course

Description

Latest Posts (2)

Developing a Real-Time Taxi App with Django Channels and React

Developing a Real-Time Taxi App with Django Channels and React