Web Scraping with Python

About this Course

In this 2-hour long project-based course, you will learn how to analyze complex HTML structures and identify the relevant data to be extracted using Scrapy and XPath. You will apply the concepts of web scraping, including setting up a Scrapy project, generating spiders, and using XPath queries to extract data from websites that do not provide an API. Additionally, you will evaluate the effectiveness and efficiency of your scraping code, considering factors such as changing webpage structures, scalability, and coding defensively to ensure robustness. The course includes hands-on labs where you will create a spider and parse complex HTML, allowing you to practice and reinforce the concepts learned.

Created by: Duke University


Related Online Courses

The course begins with a discussion about data: how to improve data quality and perform exploratory data analysis. We describe Vertex AI AutoML and how to build, train, and deploy an ML model... more
Foundational Skills and Knowledge Required of High Performing Salespeople. This course takes the mystery out of sales call preparation, execution, and follow up. Learners will gain a clear... more
In this course, you will learn the fundamentals of database concepts in a cloud environment, get basic skilling in cloud data services, and build your foundational knowledge of cloud data services... more
PowerPoint 365 offers many advanced options that will really make your presentations shine and stand out above the rest. In this project, you will learn how to use animations and transitions in... more
This course introduces you to the nonprofit sector, nonprofit organizations, and the concepts of leadership and governance. While this course has been developed with North American culture in mind,... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL