R Proook Pages 1
R Proook Pages 1
Dr. G. Sudhamathy
If you are looking for a complete step-by-step instructions for learning R Programming
for Statistical Data Analysis, Graphical Visualization and Data Mining, authors Dr.
Sudhamathy & Dr. Jothi venkateswaran’s “R Programming - An Approach to Data
Analytics” is a hands-on book packed with examples and references that would help
you get started coding in R for variety of data science problems.
Hopefully you can take the instructions provided in this book to get started in R
programming for your next data analysis project, do some exciting data visualization
and data mining on your own.
It’s my immense happiness in penning this foreword for a book that is quite impressive
for any techie who is interested in R-programming. It’s also equally joyous to have a
book written by experts, Dr. G. Sudhamathy and Dr. C. Jothi Venkateswaran. When
a book can teach you and guide you as you work hands on the tool, you are in the
right direction in your learning path.
One can be definitively sure that book will be of great help and guidance for the
learner to carry out their works on Analytics using R, either in the research, practice
or just to learn the tool.
Best wishes for a bestselling of this book in the Academia, Research and Practice.
Dr. S. Justus
Associate Professor & Chair - Software Engineering Research Group
VIT University, Chennai
Libros de Estadística-Ciencia de Datos|Statistics-Data Science Books (PDF)
Libros de Estadística-Ciencia de Datos|Statistics-Data Science Books (PDF)
PREFACE
Huge volumes of data are being generated by many sources like commercial
enterprises, scientific domains and general public daily. According to a recent
research, data production will be 44 times greater in 2020 than it was in 2010.
Data being a vital resource for business organizations and other domains like
education, health, manufacturing etc., its management and analysis is becoming
increasingly important. This data, due to its volume, variety and velocity, often
referred to as Big Data, also includes highly unstructured data in the form of textual
documents, web pages, graphical information and social media comments. Since
Big Data is characterised by massive sample sizes, high dimensionality and intrinsic
heterogeneity, traditional approaches to data management, visualisation and
analytics are no longer satisfactorily applicable. There is therefore an urgent need
for newer tools, better frameworks and workable methodologies for such data to
be appropriately categorised, logically segmented, efficiently analysed and securely
managed. This requirement has resulted in an emerging new discipline of Data
Science that is now gaining much attention with researchers and practitioners in
the field of Data Analytics.
R programming language and make it easy to approach by any one. The chapters
are designed in such a fashion that it targets the beginners with the first 4 chapters
and targets the advanced concept learners in the next 3 chapters. The book also
helps the reader with the list of all packages and functions used in this book along
with the page numbers to know the usage of those. Every concept discussed in the
various sections in this book has proper example dealt with a set of code and its
results (as text or as graphs).
The book is organized into 7 chapters and the concept discussed in each chapter
is as detailed below.
Chapter 2 discusses on the basic data types in R, the primitive data types such
as vectors, matrices and arrays, lists and factors. It also deals with the complex data
types such as data frames, strings, dates and times. The chapter not only discusses
on the data creation, but also basic operations on the data of different data types.
Chapter 3 deals with data preparation in which it details on how and where to
fetch the datasets from, how to import and export data from various sources which
are of different types like CSV files, XML files, etc. It also discusses on the ways of
accessing various databases. The data cleaning and transformation techniques such
as data reshaping, grouping functions are also outlined in this chapter.
Chapter 4 is about using the graphical features in R for exploratory data analysis.
It gives examples of pie charts, scatter plots, line plots, histograms, box plots and
bar plots using the various graphical packages such as base, lattice and ggplot2.
Chapter 5 deals with statistical analysis concepts using R such as the basic
statistical measures like mean, median, mode, standard deviation, variance and
ranges. It discusses on the distribution of data as normal distribution and binomial
distribution and how it can be viewed and analyzed using R. Then, the chapter
explores on the complex statistical techniques such as correlation analysis, regression
analysis, ANOVA and hypothesis testing which can be implemented using R.
Chapter 7 is mainly to explore the various essential case studies such as text
analytics, credit risk analysis, social network analysis and few exploratory data
analysis. The main purpose of this chapter is to use the basic and advanced concepts
presented in the other previous chapters of this book.
The author would like to mention her special regards and thanks to Dr. G.
P. Jeyanthi, Research and Consultancy Director, Dr. A. Parvathi, Dean, Faculty of
Science and Dr. V. Radha, Head, Department of Computer Science, Avinashilingam
Universty, Coimbatore, for their constant encouragement and support to turn this
work into a useful product.
The author wishes to thank all the faculty members of the Department of
Computer Science, Avinashilingam University, Coimbatore, for their continuous
support and suggestions for this book.
We are grateful to the students and teacher community who kept us on our
toes with their constant bombardment of queries which prompted us to learn more,
simplify our learning and findings and place them neatly in a book.
Our Special regards for the experts Mr. Sajeev Madhavan, Director of
Architecture, Oracle, USA and Dr. S. Justus, Associate Professor, VIT, Chennai who
gave their expert opinion in shaping this book into a more appealing format.
Most importantly we would like to thank our family members without whose
support this book would not have been a reality.
Last, but not the least, this work is a dedication to God, the Almighty whose
grace has showered upon us in making our dream come true.
G. Sudhamathy
C. Jothi Venkateswaran