Introduction To R Language For Data Science
Introduction To R Language For Data Science
INTRODUCTION
Introduction to R language for Data Science
R was invented by two R authors "Robert Gentleman and Ross Ihaka" of the University of Auckland in
1993.R was an implementation of S language, which was invented by John Chambers at Bell
Laboratories. Since the late 2000s, R's reputation has raised and has spread itself contributing banking,
marketing, pharmaceutical, politics, genomics, and many other areas. This has got the top 6th position in
the programming languages. Users frequently shift from low-level, compiled languages such as C++,
other statistical packages such as SAS or SPSS, and from the 800-pound gorilla, Excel. Adding on
updated packages- libraries within the code the extend R's functionality in this time period with a rapid
flow.
R Language supports high-level Language that originally intends to execute interactively whenever the
user runs a command, gets output and then runs another command. It has the capacity to emerge into
systems and handles complex problems. R can produce amazing graphics and reports comfortably by
transforming and analyzing. It is quietly used as full stack for data analysis, extracting and transforming
fitting models, drawing inferences and making predictions, plotting and reporting results.
Applicational usage of R
Current R practice in Companies
Software Companies like facebook, twitter, Google, Ford, BCG, Mckinsey&Company, Microsoft, Uber
are highly making use of R Figure1. As it satisfies most predictive and statistical models. O’Reilly has
worked on data science through R programming and identified that it is first to be ranked for SQL.
Analytics software by KDnuggets surveyed R programming as a leading path for Analytics.
This book will help you to work on different analytical models and give an opportunity for learners to
work on predictive as well as statistical models.Study on R programming has interesting facts that
enhance technological growth.
Important features of R
1. More than 10,000 free packages are supported by R. they are used for analysis in data science.
Following graph describes the rise of different packages are high. This shows that people are
more attracted towards R programming packages especially CRAN packages.
Fig.1.2 R: R Programming CRAN Packages Download
2. R grabs the attention of Data Analysts because it performs maximum analysis for different
data for free. Suppose if u want to work on SAS with this programming language, SAS software
can also perform same by achieving similar stuff. Below are the similar kind of package lists that
are mandatory for data analysis –
For Data Visualization, a package like ggplot2, patchwork so on are used, likewise an
Equivalent SAS Product uses Visual Analytics packages
For Ensemble Learning and Machine Learning, SAS Product uses SAS Enterprise Miner
packages
For Text and Social Media Mining, SAS Product uses SAS Text Miner.
For Optimization and Forecasting, SAS Product uses SAS ETS, PROC OPTMODEL
For RStudio IDE, SAS Product uses SAS Enterprise Guide.
3. With the survey of O’Reilly and study on LinkedIn, R has reached to a top place with high IT
skills by influencing advanced analytics software.
4. Maintains efficient combination of suitable software such as Tableau, SQL Server etc.
Microsoft solutions have achieved revolutionary analytics by merging R enterprise with visual
studio, SQL server and so on.
5. Implementation of different New statistical and machine learning techniques in R is very
much efficient when compared to the statistical. This has lead researchers to work easily by
considering R as their first option.
Following are the Large organizations with top brands with subsequent usage of R:
1. Facebook usages R for behavioral analysis relevant to status updates and profile pictures.
2. Google for advertising and forecasting economic status.
3. Twitter for data visualization and semantic clustering.
4. Microsoft for a variety of purposes in Revolution R company.
5. Uber for different statistical analysis
6. Airbnb for scale data science
7. IBM for joined R consortium group
8. ANZ for credit risk modeling
9. New York times for data visualization
10. HP
1. A.T. Kearney
2. AbsolutData
3. AC Nielsen
4. Accenture
5. Bain & Company
6. Booz Allen Hamilton
7. Capgemini
8. Convergytics
9. Deloitte Consulting
10. Evalueserve
11. EXL
12. EY
13. Fractal Analytics
14. Gartner
15. Genpact
16. IBM
17. KPMG
18. Latent view
19. Manthan Systems
20. McKinsey & Company
21. Mu Sigma
22. PricewaterhouseCoopers
23. SIBIA Analytics
24. Simplify360
25. SmartCube
26. Target
27. The Boston Consulting Group
28. Tiger Analytics
29. Tower Watson
30. WNS
31. ZS Associate
Financial Institutions using R
Following are the companies from US banks, European Banks and Insurance companies using
R:
1. American Express
2. ANZ
3. Bank of America
4. Barclays Bank
5 Bazajallianz Insurance
6. Bharti Axa insurance
7. Blackrock
8. Citibank
9. Dun & Bradstreet
10. Fidelity
11. HSBC
12. JP Morgan
13. KeyBank
14. Lloyds Banking
15. RBS
16. Standard Chartered
17. UBS
18. Wells Fargo
19. Goldman Sachs
20. Morgan Stanley
21. PNC Bank
22. Citizens Bank
23. Fifth Third Bank