Using R and Tableau Software - 1 PDF
Using R and Tableau Software - 1 PDF
Table of Contents
What is R?.......................................................................................................................3
What is R?
R is a popular statistical language used to perform sophisticated statistical
analysis and predictive analytics, such as linear and nonlinear modeling, statistical
tests, time-series analysis, classification, clustering, etc.
The R-console primarily utilizes a command-line interface, but there are many
GUI tools available for download to make it easier to write R programs (most
R is a free software environment
s
for statistical computing and of them are free). With R, users can create variables, formulas, functions, and
graphics. It compiles and runs on
graphs to visualize their analysis and predictions.
a wide variety of UNIX platforms,
Windows and MacOS.
As a free, open-source language, there is a community of contributors who
continually create new packages (extensions) for R that define advanced
statistical functions that were not originally built in to R. These packages can be
downloaded into R to expand R’s capabilities. Most of these packages are also
developed and made available for free. It is for this reason that R has become so
popular and why it continues to gain functionality over time.
6
Sepal length
4
Species
I. setosa
2 I. versicolor
I. virginica
0
3
Sepal width
0
0 2 4 6 0.0 0.5 1.0 1.5 2.0 2.5
Petal length Petal width
² RServe is a free R server and can be installed locally on a desktop or laptop computer.
See below for more information on RServe.
5
Users who are already proficient with R will find the integration beneficial
for several reasons:
• They will be able to access any R package or function that has been installed
in an R server which they can access.
• Users must have proficiency with the R language to write the appropriate
scripts and functional calls they require.
² RServe is a free R server and can be installed locally on a desktop or laptop computer.
See below for more information on RServe.
6
There are four new built-in functions that are used to call specific R models and
functions. The functions are:
• SCRIPT_REAL
• SCRIPT_STR
• SCRIPT_INT
• SCRIPT_BOOL
These functions are distinct only in the type of result they return: a real number, a
string, an integer, or a Boolean.
The arguments you pass into each of these functions include R-language scripts
and function calls. You can pass 1 or more arguments to R, which are then passed
dynamically via Tableau. For instance, you can pass the sales of each customer on a viz.
When using Desktop, you will need to have an R server (via Rserve³) that Tableau can
access to successfully call the new functions. Additionally, you can import datasets
from R directly into Tableau without using functions or calculated fields.
Tableau Server:
Tableau Server must be able to communicate with your R server in order for workbooks
with R function calls to work. When this is done, you can share workbooks and views
with others while still incorporating the results from the R calculations. If there is no R
server running that Tableau Server can access, then these workbooks will not be able
to show the correct output.
At the current time, Tableau Online and Tableau Public are not supporting R, so the R
statistical capabilities will not be available through these services.
³ What is Rserve? Rserve allows other programs to use facilities of R (see http://www.r-project.org)
without the need to initialize R or link against an R library. Rserve supports remote connection,
authentication and file transfer. Typical use is to integrate R backend for computation of statistical
models, plots etc. in other applications.
7
• You cannot export data from Tableau into R directly to run a new model outside
of using the SCRIPT_* functions.
1. Download and Install R. Click here to find the file and instructions
on downloading R.
2. Download and Install Rserve. You will need to install an Rserve for Tableau
to connect to in order to utilize the new script functions. In the R console,
enter the following commands:
install.packages(“Rserve”)
library(Rserve)
Rserve()
c. Click on the “Test Connection” button to make sure everything runs smoothly.
You should see a successful message. Click OK to close.
4. Start using the R scripts in Tableau. Now you will be able to create new
calculated fields in Tableau Desktop that utilize the SCRIPT_* functions to
make R functional calls.
9
About Tableau
Tableau Software helps people see and understand data. Tableau helps anyone quickly analyze,
visualize and share information. More than 15,000 customer accounts get rapid results with Tableau
in the office and on-the-go. And tens of thousands of people use Tableau Public to share data in
their blogs and websites. See how Tableau can help you by downloading the free trial at
www.tableausoftware.com/trial.
Learning R
The R Project for Statistical Computing
R Training Tutorials
Coursera: Data and Statistics Courses, and Computing for Data Analysis (Jan 2014)
R Scripts Repository
Data Plus Science: Geocoding addresses directly in Tableau 8.1 using integration with R
Tableau and Tableau Software are trademarks of Tableau Software, Inc. All other company and product
names may be trademarks of the respective companies with which they are associated.