Web Scraping with Python

About this Course

In this 2-hour long project-based course, you will learn how to analyze complex HTML structures and identify the relevant data to be extracted using Scrapy and XPath. You will apply the concepts of web scraping, including setting up a Scrapy project, generating spiders, and using XPath queries to extract data from websites that do not provide an API. Additionally, you will evaluate the effectiveness and efficiency of your scraping code, considering factors such as changing webpage structures, scalability, and coding defensively to ensure robustness. The course includes hands-on labs where you will create a spider and parse complex HTML, allowing you to practice and reinforce the concepts learned.

Created by: Duke University


Related Online Courses

Arts and culture leaders have a tough but rewarding task: creating and leading sustainable organizations that deliver real social value. There is a lot of competition out there. Being an effective... more
It seems anymore that you can\'t listen to the news without hearing of a data breach. You may have heard it said before that there are 2 types of companies out there, the ones who have been... more
This Specialization is designed to help anyone involved in developing software for Arm Cortex-M processors. Over four courses you will develop your knowledge of this popular microcontroller variant... more
Fundamentals of Red Hat OpenShift for Developers is an introduction to deploying applications in the OpenShift ecosystem. This course provides the foundational knowledge to build, run, and manage... more
This course introduces the artificial intelligence (AI) and machine learning (ML) offerings on Google Cloud that support the data-to-AI lifecycle through AI foundations, AI development, and AI... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL