Data Science: Wrangling

About this Course

In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data scientist will likely face them all at some point. Very rarely is data easily accessible in a data science project. It's more likely for the data to be in a file, a database, or extracted from documents such as web pages, tweets, or PDFs. In these cases, the first step is to import the data into R and tidy the data, using the tidyverse package. The steps that convert data from its raw form to the tidy form is called data wrangling. This process is a critical step for any data scientist. Knowing how to wrangle and clean data will enable you to make critical insights that would otherwise be hidden.

Created by: Harvard University

Level: Introductory


Related Online Courses

A majority of the world's data resides in databases. SQL (or Structured Query Language) is a powerful language for communicating with and extracting data from databases. A working knowledge of... more
We will explain how to perform the standard processing and normalization steps, starting with raw data, to get to the point where one can investigate relevant biological questions. Throughout the... more
Este curso te permitirá desarrollar habilidades como un tomador de decisiones con base a las siguientes competencias: análisis de elementos estadístico de la información conceptos y fun... more
In data science, data is called "big" if it cannot fit into the memory of a single standard laptop or workstation. The analysis of big datasets requires using a cluster of tens, hundreds or... more
El análisis exploratorio de datos (EDA, por sus siglas en inglés, Exploratory Data Analysis) es el proceso o tratamiento estadístico al cual se someten los datos de una muestra con la que se bu... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL