02 Learning Goals For The Course
02 Learning Goals For The Course
In this course you will be introduced to a Data Scientist's workbench or toolkit that consists of a variety
of tools, languages, libraries, APIs, data sets, models, etc. used by Data Scientists. Try not to be
overwhelmed by the sheer number of components and tools that exist in the Data Science ecosystem.
The main goal of the course is for you to be knowledgeable about the kinds of tools Data Scientists use,
their examples, and get some hands-on time with a few key tools.
As such, you are not required to recall the name of every single tool covered in the course. However, be
familiar with the categories or types of tools and 1 or 2 examples each type. Modules 4 and 5 of this
course will cover some of the most important ones for a beginner Data Scientist in greater depth and
enable you to get hands-on experience with them. As you take additional Data Science courses, you will
become more acquainted with some of the other tools and libraries. Some may be required to perform
more specialized or advanced Data Science or Machine Learning tasks. So don't try to remember all of
the names just now. Pay special attention, though, to Video and Lesson summaries.
To successfully complete the course, you are required to complete the first 6 out of the 7 modules in the
course. The 7th module is an optional one.
In this module, you will learn about the different types and categories of tools that data scientists use
and popular examples of each. You will also become familiar with Open Source, Cloud-based, and
Commercial options for data science tools.
Describe the components of a Data Scientist's toolkit and list various tool categories
List examples of Open Source, Commercial, and Cloud-based tools in various categories
This module will bring awareness about the criteria determining which language you should learn. You
will learn the benefits of Python, R, SQL, and other common languages such as Java, Scala, C++,
JavaScript, and Julia. You will explore how you can use these languages in Data Science. You will also
look at some sites for more information about the languages.
Learning Objectives
Identify the criteria and roles for determining the language to learn.
This module will give you in-depth knowledge of different libraries, APIs, dataset sources and models
used by data scientist.
Learning Objectives
List examples of the various libraries: scientific, visualization, machine learning, and deep
learning.
List the tasks that a data scientist needs to perform to build a model.
This module introduces the Jupyter Notebook and JupyterLab. You will learn how to work with different
kernels and the basic Jupyter architecture. In addition, you will identify the tools in an Anaconda Jupyter
environment. Finally, the module overviews cloud-based Jupyter environments and their data science
features.
Learning Objectives
This module will start with an introduction to R and RStudio and will end up with Github usage. You will
learn about the different R visualization packages and how to create visual charts using the plot
function.
Further in the module, you will develop the essential conceptual and hands-on skills to work with Git
and GitHub. You will start with an overview of Git and GitHub, creating a GitHub account and a project
repository, adding files, and committing your changes using the web interface. Next, you will become
familiar with Git workflows involving branches, pull requests (PRs), and merges. You will also complete a
project at the end to apply and demonstrate your newly acquired skills.
Learning Objectives
Explain version control and describe the Git and GitHub environment.
Describe the purpose of source repositories and explain how GitHub satisfies the needs of a
source repository.
In this module, you will work on a final project to demonstrate some of the skills learned in the course.
You will also be tested on your knowledge of various components and tools in a Data Scientist's toolkit
learned in the previous modules.
Learning Objectives
This is as an optional module if you are interested in learning about and working with data science tools
from IBM such as Watson Studio.
Learning Objectives
Find common resources in Watson Studio and IBM Cloud Pak for Data.
Use different types of Jupyter Notebook templates and kernels on IBM Watson Studio.
Describe how to connect a Watson Studio account and publish a notebook in GitHub.