0% found this document useful (0 votes)
143 views3 pages

Data Science in R With Rubén Sánchez Sancho

This document provides an overview and summary of the book "Data Science in R". The book aims to teach the most important tools in data science using R. It is organized based on the typical steps in a data science project: programming in R, importing and tidying data, exploring data through transformation, visualization, and modeling, and communicating results. The book covers topics such as R syntax, data structures, packages, importing data, transforming data, statistical analysis, and visualization. It provides readers with the necessary skills and resources to work with a variety of data analysis tasks using R.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
143 views3 pages

Data Science in R With Rubén Sánchez Sancho

This document provides an overview and summary of the book "Data Science in R". The book aims to teach the most important tools in data science using R. It is organized based on the typical steps in a data science project: programming in R, importing and tidying data, exploring data through transformation, visualization, and modeling, and communicating results. The book covers topics such as R syntax, data structures, packages, importing data, transforming data, statistical analysis, and visualization. It provides readers with the necessary skills and resources to work with a variety of data analysis tasks using R.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

DATA SCIENCE IN R

Data Science is an exciting discipline that allows analyzing data to extract


knowledge or unify knowledge of data in its different forms. The main objective of
"Data Science with R" is to help you learn the most important tools in "Data
Science". After following this book, you will have the necessary resources to deal
with a wide variety of tasks in data analysis, using the best parts of R. Definitely,
it is the most complete and exhaustive Spanish book you are looking for on data
analysis with R. Perfectly organized and methodical, it presents all the topics in
a simple, gradual and 100% practical way. It is necessary to mention, in addition
that it uses the same material developed as part of the course in Data Science
with R of the Udemy platform

What will you learn with this book?

The objective of this book is to provide you with solid foundations in the vast
majority of tools. Our model of necessary tools in a typical project of Data Science
is shown in the following figure:

https://i.imgur.com/DO2BKK7.png

How Is This Book Organized?

The book is organized according to the necessary tools in a typical Data Science
project, in the order in which we will use them in our data analysis.

Programming in R

In the first part of the book, we will learn the programming language R:

1. The Syntax of R.

2. Data Structures in R.
3. Control Structures in R.

4. Functions in R.

5. Packages in R.

Import Data in R

In the second part of the book we will deal with the two tasks of the data import
phase and which we will analyze next:

First, we will see how to import our data to R. With this we mean, that we will find
data stored in files, databases, or in a web API, and the objective of this task will
be to load the data in a dataframe.

Once we have imported our data, the next task will be tidy our data. The objective
of this task is to store our data in a consistent format in which the semantics of
the data set coincide with the medium in which they are stored. In summary, our
data will be in tidy format when each variable is in a column and each observation
in its own row.

Data Exploration

In the third part of the book we will deal with the tasks of the exploration phase of
our data, known in English as data wrangling.

First, a common task is to transform our data. Transforming our data includes
filtering the observations of our interest (such as all the people in a city, or all the
data of the last year), creating new variables resulting from the calculation of
functions of existing variables (for example, calculating the speed with space and
time) and, calculate a set of statistical indicators in a summary (such as means
or standard deviations).

Afterwards, we will visualize and model our data. In this course we will only treat
visualization.

Visualization is fundamental in human activity. A well-made visualization shows


us things we did not expect, or leads us to formulate new questions in our data.
In addition, you can tell us that we are not asking the right questions, or that we
need to collect new data.

Communicate the Results

The last step in data science is communication, an absolutely critical part of a


data analysis project. Unquestionably, it does not matter how well we have
modeled or visualized our data, if later we are not able to communicate our results
with others.

Ruben Sanchez Sancho

Ingeniero Técnico en Informática de Gestión

Master en Data Science

You might also like