Data Science: Productivity Tools

About this Course

A typical data analysis project may involve several parts, each including several data files and different scripts with code. Keeping all this organized can be challenging. Part of our Professional Certificate Program in Data Science, this course explains how to use Unix/Linux as a tool for managing files and directories on your computer and how to keep the file system organized. You will be introduced to the version control systems git, a powerful tool for keeping track of changes in your scripts and reports. We also introduce you to GitHub and demonstrate how you can use this service to keep your work in a repository that facilitates collaborations. Finally, you will learn to write reports in R markdown which permits you to incorporate text and code into a document. We'll put it all together using the powerful integrated desktop environment RStudio.

Created by: Harvard University

Level: Introductory


Related Online Courses

Las decisiones hoy día se realizan considerando múltiples variables en forma simultánea, para ello debemos analizar conjuntos de datos multivariantes medidos simultáneamente para cada individuo u o... more
The R language plays a critical role in data analysis and a common programming language when working in the field of data science & analytics. This course will introduce you to R language... more
This course discusses properties and applications of random variables. When you’re done, you’ll have enough firepower to undertake a wide variety of modeling and analysis problems; and you’ll be we... more
The job of a data scientist is to glean knowledge from complex and noisy datasets. Reasoning about uncertainty is inherent in the analysis of noisy data. Probability and Statistics provide the... more
This course covers two important methodologies in statistics – confidence intervals and hypothesis testing. Confidence intervals are encountered in everyday life, and allow us to make p... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL