Data Science: Productivity Tools
About this Course
A typical data analysis project may involve several parts, each including several data files and different scripts with code. Keeping all this organized can be challenging. Part of our Professional Certificate Program in Data Science, this course explains how to use Unix/Linux as a tool for managing files and directories on your computer and how to keep the file system organized. You will be introduced to the version control systems git, a powerful tool for keeping track of changes in your scripts and reports. We also introduce you to GitHub and demonstrate how you can use this service to keep your work in a repository that facilitates collaborations. Finally, you will learn to write reports in R markdown which permits you to incorporate text and code into a document. We'll put it all together using the powerful integrated desktop environment RStudio.Created by: Harvard University
Level: Introductory

Related Online Courses
This course provides an introduction to basic probability concepts. Our emphasis is on applications in science and engineering, with the goal of enhancing modeling and analysis skills for a variety... more
Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills you’ve acquired in thi... more
This is an introductory-level course in supervised learning, with a focus on regression and classification methods. The syllabus includes: linear and polynomial regression, logistic regression and... more
This course, presented by the IMF's Statistics Department, teaches you how to compile timely, high quality national accounts statistics based on the system of national accounts (SNA) framework. The... more
A majority of the world's data resides in databases. SQL (or Structured Query Language) is a powerful language for communicating with and extracting data from databases. A working knowledge of... more