Web Scraping with Python

About this Course

In this 2-hour long project-based course, you will learn how to analyze complex HTML structures and identify the relevant data to be extracted using Scrapy and XPath. You will apply the concepts of web scraping, including setting up a Scrapy project, generating spiders, and using XPath queries to extract data from websites that do not provide an API. Additionally, you will evaluate the effectiveness and efficiency of your scraping code, considering factors such as changing webpage structures, scalability, and coding defensively to ensure robustness. The course includes hands-on labs where you will create a spider and parse complex HTML, allowing you to practice and reinforce the concepts learned.

Created by: Duke University


Related Online Courses

This Specialization is designed to help you navigate the complex legal framework that governs modern health systems. You will learn the fundamentals of American health law, then explore how privacy... more
The focus of this course is on the process of managing projects. After planning is done, we need to know how to successfully execute the plan and deliver within the allocated timeframe and cost.... more
The Construction Aspects of Steel Buildings course offers comprehensive insights into key construction components. It begins with detailed exploration of shear, moment, and splice connections,... more
This specialization is intended for English as a Second Language Learners (elementary, intermediate, and advanced). No prior knowledge is required for these courses. Through 4 courses,... more
The Large Language Models Specialization equips learners with a solid foundation and advanced skills in NLP, covering LLM fundamentals, data preparation, fine-tuning, and advanced techniques.... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL