Data Science: Wrangling
About this Course
In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data scientist will likely face them all at some point. Very rarely is data easily accessible in a data science project. It's more likely for the data to be in a file, a database, or extracted from documents such as web pages, tweets, or PDFs. In these cases, the first step is to import the data into R and tidy the data, using the tidyverse package. The steps that convert data from its raw form to the tidy form is called data wrangling. This process is a critical step for any data scientist. Knowing how to wrangle and clean data will enable you to make critical insights that would otherwise be hidden.Created by: Harvard University
Level: Introductory

Related Online Courses
If you have specific questions about this course, please contact us at sds-mm@mit.edu. Data science requires multi-disciplinary skills ranging from mathematics, statistics, machine learning,... more
You may have heard of actuarial science, or you might even know an actuary, but do you know what an actuary does? During the course you’ll hear from a wide variety of actuaries about their c... more
The R programming language is purpose-built for data analysis. R is the key that opens the door between the problems you want to solve with data and the answers you need to meet your objectives.... more
Do you want to build systems that learn from experience? Or exploit data to create simple predictive models of the world? In this course, part of the Data Science MicroMasters program, you will... more
In the information age, data is all around us. Within this data are answers to compelling questions across many societal domains (politics, business, science, etc.). But if you had access to a... more