Data Science in Spark With Sparklyr::: Cheat Sheet

sparklyr

Uploaded by

ram179

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

130 views2 pages

Data Science in Spark With Sparklyr::: Cheat Sheet

sparklyr

Uploaded by

ram179

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Data Science in Spark with Sparklyr : : CHEAT SHEET

Intro Data Science Toolchain with Spark + sparklyr Using

sparklyr is an R interface for Apache Spark™,
it provides a complete dplyr backend and the Import Tidy
Understand
Communicate
sparklyr
Transform Visualize
option to query directly using Spark SQL • Export an R • dplyr verb Transformer Collect data into • Collect data A brief example of a data analysis using
statement. With sparklyr, you can orchestrate DataFrame • Direct Spark function R for plotting into R Apache Spark, R and sparklyr in local mode
distributed machine learning using either • Read a file SQL (DBI) • Share plots,
Spark’s MLlib or H2O Sparkling Water. • Read existing • SDF function Wrangle Model documents, library(sparklyr); library(dplyr); library(ggplot2);
Hive table (Scala API) • Spark MLlib and apps library(tidyr);
Starting with version 1.044, RStudio Desktop, Install Spark locally
R for Data Science, Grolemund & Wickham
• H2O Extension set.seed(100)
Server and Pro include integrated support for
the sparklyr package. You can create and spark_install("2.0.1") Connect to local version
manage connections to Spark clusters and local Getting Started sc <- spark_connect(master = "local")
Spark instances from inside the IDE.
LOCAL MODE (No cluster required) ON A YARN MANAGED CLUSTER
RStudio Integrates with sparklyr
1. Install a local version of Spark: 1. Install RStudio Server or RStudio Pro on import_iris <- copy_to(sc, iris, "spark_iris",
Open connection log Disconnect spark_install ("2.0.1") one of the existing nodes, preferably an overwrite = TRUE)
2. Open a connection edge node Copy data to Spark memory
sc <- spark_connect (master = "local") 2. Locate path to the cluster’s Spark Home
Directory, it normally is “/usr/lib/spark” partition_iris <- sdf_partition( Partition
import_iris,training=0.5, testing=0.5) data
3. Open a connection
Open the ON A MESOS MANAGED CLUSTER
Spark UI spark_connect(master=“yarn-client”,
version = “1.6.2”, spark_home = sdf_register(partition_iris,
1. Install RStudio Server or Pro on one of the
existing nodes [Cluster’s Spark path]) c("spark_iris_training","spark_iris_test"))
Preview
Spark & Hive Tables 1K rows
2. Locate path to the cluster’s Spark directory
Create a hive metadata for each partition
3. Open a connection
Cluster Deployment spark_connect(master=“[mesos URL]”,
version = “1.6.2”, spark_home = ON A SPARK STANDALONE CLUSTER
tidy_iris <- tbl(sc,"spark_iris_training") %>%
select(Species, Petal_Length, Petal_Width)
[Cluster’s Spark path]) 1. Install RStudio Server or RStudio Pro on
MANAGED CLUSTER Spark ML
Worker Nodes one of the existing nodes or a server in the
Cluster Manager Decision Tree
same LAN Model
Driver Node model_iris <- tidy_iris %>%
USING LIVY (Experimental) 2. Install a local version of Spark:
fd 1. The Livy REST application should be spark_install (version = “2.0.1")
ml_decision_tree(response="Species",
features=c("Petal_Length","Petal_Width"))
YARN running on the cluster 3. Open a connection
fd or
Mesos 2. Connect to the cluster spark_connect(master=“spark:// test_iris <- tbl(sc,"spark_iris_test") Create
sc <- spark_connect(method = "livy", host:port“, version = "2.0.1", reference to
fd master = "http://host:port") spark_home = spark_home_dir()) pred_iris <- sdf_predict(
Spark table
model_iris, test_iris) %>%
Bring data back
STAND ALONE CLUSTER Worker Nodes
Tuning Spark collect
into R memory
for plotting
Driver Node pred_iris %>%
fd EXAMPLE CONFIGURATION IMPORTANT TUNING PARAMETERS with defaults inner_join(data.frame(prediction=0:2,
lab=model_iris$model.parameters$labels)) %>%
fd config <- spark_config()
config$spark.executor.cores <- 2
•
•
spark.yarn.am.cores • spark.executor.instances
spark.yarn.am.memory 512m • spark.executor.extraJavaOptions ggplot(aes(Petal_Length, Petal_Width, col=lab)) +
config$spark.executor.memory <- "4G" • spark.network.timeout 120s • spark.executor.heartbeatInterval 10s geom_point()
fd sc <- spark_connect (master="yarn-client", • spark.executor.memory 1g • sparklyr.shell.executor-memory
config = config, version = "2.0.1") • spark.executor.cores 1 • sparklyr.shell.driver-memory spark_disconnect(sc) Disconnect

RStudio® is a trademark of RStudio, Inc. • CC BY SA RStudio • [email protected] • 844-448-1212 • rstudio.com • Learn more at spark.rstudio.com • sparklyr 0.5 • Updated: 2016-12
Reactivity Visualize & Communicate Model (MLlib)
COPY A DATA FRAME INTO SPARK SPARK SQL COMMANDS DOWNLOAD DATA TO R MEMORY ml_decision_tree(my_table,
sdf_copy_to(sc, iris, "spark_iris") r_table <- collect(my_table) response = “Species", features =
DBI::dbWriteTable(sc, "spark_iris", iris)
plot(Petal_Width~Petal_Length, data=r_table)
sdf_copy_to(sc, x, name, memory, repartition, c(“Petal_Length" , "Petal_Width"))
DBI::dbWriteTable(conn, name, dplyr::collect(x)
overwrite) value) Download a Spark DataFrame to an R DataFrame ml_als_factorization(x, user.column = "user",
sdf_read_column(x, column) rating.column = "rating", item.column = "item",
IMPORT INTO SPARK FROM A FILE FROM A TABLE IN HIVE Returns contents of a single column to R rank = 10L, regularization.parameter = 0.1, iter.max = 10L,
Arguments that apply to all functions: my_var <- tbl_cache(sc, name= ml.options = ml_options())
sc, name, path, options = list(), repartition = 0, "hive_iris") SAVE FROM SPARK TO FILE SYSTEM ml_decision_tree(x, response, features, max.bins = 32L, max.depth
memory = TRUE, overwrite = TRUE Arguments that apply to all functions: x, path = 5L, type = c("auto", "regression", "classification"), ml.options =
tbl_cache(sc, name, force = TRUE)
CSV spark_read_csv( header = TRUE, Loads the table into memory spark_read_csv( header = TRUE, ml_options()) Same options for: ml_gradient_boosted_trees
columns = NULL, infer_schema = TRUE, CSV
delimiter = ",", quote = "\"", escape = "\\", ml_generalized_linear_regression(x, response, features,
delimiter = ",", quote = "\"", escape = "\\", my_var <- dplyr::tbl(sc,
charset = "UTF-8", null_value = NULL) intercept = TRUE, family = gaussian(link = "identity"), iter.max =
charset = "UTF-8", null_value = NULL) name= "hive_iris")
dplyr::tbl(scr, …) JSON spark_read_json(mode = NULL) 100L, ml.options = ml_options())
JSON spark_read_json()
Creates a reference to the table PARQUET spark_read_parquet(mode = NULL) ml_kmeans(x, centers, iter.max = 100, features = dplyr::tbl_vars(x),
PARQUET spark_read_parquet() without loading it into memory compute.cost = TRUE, tolerance = 1e-04, ml.options = ml_options())
ml_lda(x, features = dplyr::tbl_vars(x), k = length(features), alpha =
Wrangle Reading & Writing from Apache Spark (50/k) + 1, beta = 0.1 + 1, ml.options = ml_options())
ml_linear_regression(x, response, features, intercept = TRUE,
SPARK SQL VIA DPLYR VERBS ML TRANSFORMERS tbl_cache
sdf_copy_to alpha = 0, lambda = 0, iter.max = 100L, ml.options = ml_options())
Translates into Spark SQL statements ft_binarizer(my_table,input.col=“Petal_Le dplyr::tbl
dplyr::copy_to Same options for: ml_logistic_regression
ngth”, output.col="petal_large", DBI::dbWriteTable
my_table <- my_var %>% ml_multilayer_perceptron(x, response, features, layers, iter.max =
threshold=1.2)
filter(Species=="setosa") %>% 100, seed = sample(.Machine$integer.max, 1), ml.options =
sample_n(10) Arguments that apply to all functions: ml_options())
x, input.col = NULL, output.col = NULL spark_read_<fmt>
sdf_collect ml_naive_bayes(x, response, features, lambda = 0, ml.options =
DIRECT SPARK SQL COMMANDS dplyr::collect File
ft_binarizer(threshold = 0.5) ml_options())
Assigned values based on threshold sdf_read_column System
my_table <- DBI::dbGetQuery( sc , ”SELECT * ml_one_vs_rest(x, classifier, response, features, ml.options =
spark_write_<fmt>
FROM iris LIMIT 10") ft_bucketizer(splits) ml_options())

Extensions
DBI::dbGetQuery(conn, statement) Numeric column to discretized column
ml_pca(x, features = dplyr::tbl_vars(x), ml.options = ml_options())
ft_discrete_cosine_transform(inverse
Create an R package that calls the full Spark API & ml_random_forest(x, response, features, max.bins = 32L,
SCALA API VIA SDF FUNCTIONS = FALSE)
provide interfaces to Spark packages. max.depth = 5L, num.trees = 20L, type = c("auto", "regression",
Time domain to frequency domain
sdf_mutate(.data) CORE TYPES "classification"), ml.options = ml_options())
Works like dplyr mutate function ft_elementwise_product(scaling.col)
spark_connection() Connection between R and the ml_survival_regression(x, response, features, intercept =
Element-wise product between 2 cols
sdf_partition(x, ..., weights = NULL, seed = Spark shell process TRUE,censor = "censor", iter.max = 100L, ml.options = ml_options())
sample (.Machine$integer.max, 1)) ft_index_to_string() spark_jobj() Instance of a remote Spark object
Index labels back to label as strings ml_binary_classification_eval(predicted_tbl_spark, label, score,
sdf_partition(x, training = 0.5, test = 0.5) spark_dataframe() Instance of a remote Spark metric = "areaUnderROC")
sdf_register(x, name = NULL) ft_one_hot_encoder() DataFrame object
Continuous to binary vectors ml_classification_eval(predicted_tbl_spark, label, predicted_lbl,
Gives a Spark DataFrame a table name CALL SPARK FROM R metric = "f1")
sdf_sample(x, fraction = 1, replacement = ft_quantile_discretizer(n.buckets=5L) invoke() Call a method on a Java object
Continuous to binned categorical ml_tree_feature_importance(sc, model)
TRUE, seed = NULL) invoke_new() Create a new object by invoking a
values
sdf_sort(x, columns) constructor
Sorts by >=1 columns in ascending order ft_sql_transformer(sql) invoke_static() Call a static method on an object sparklyr
sdf_with_unique_id(x, id = "id") ft_string_indexer( params = NULL) is an R
Column of labels into a column of label MACHINE LEARNING EXTENSIONS
sdf_predict(object, newdata) indices. ml_options() interface
ml_create_dummy_variables()
Spark DataFrame with predicted values for
ft_vector_assembler() ml_model()
ml_prepare_dataframe()
Combine vectors into single row-vector
ml_prepare_response_features_intercept()
RStudio® is a trademark of RStudio, Inc. • CC BY SA RStudio • [email protected] • 844-448-1212 • rstudio.com • Learn more at spark.rstudio.com • sparklyr 0.5 • Updated: 2016-12

Machine Learning Models For Salary Prediction Dataset Using Python
No ratings yet
Machine Learning Models For Salary Prediction Dataset Using Python
5 pages
Registration and Login System Report
100% (1)
Registration and Login System Report
13 pages
L74899DL1989PLC034923 14
No ratings yet
L74899DL1989PLC034923 14
2,406 pages
CHP 8 Pandas
No ratings yet
CHP 8 Pandas
49 pages
Regression Project
100% (1)
Regression Project
60 pages
Adm2000 Lab Guide
100% (1)
Adm2000 Lab Guide
48 pages
Weka Tutorial
100% (2)
Weka Tutorial
60 pages
Sparql: Parql Rotocol ND DF Uery Anguage
No ratings yet
Sparql: Parql Rotocol ND DF Uery Anguage
22 pages
Synchronous Replication
100% (2)
Synchronous Replication
26 pages
SQL Notes For S.Y BSC It
No ratings yet
SQL Notes For S.Y BSC It
273 pages
Apache Flume Tutorial PDF
No ratings yet
Apache Flume Tutorial PDF
43 pages
Ultimate Salesforce Data Cloud for Customer Experience: Explore, Implement and Elevate B2C Experiences Through Customer Data Innovations Using Salesforce Data Cloud
From Everand
Ultimate Salesforce Data Cloud for Customer Experience: Explore, Implement and Elevate B2C Experiences Through Customer Data Innovations Using Salesforce Data Cloud
Gourab Mukherjee
No ratings yet
Introduction To Spark With Sparklyr in R
No ratings yet
Introduction To Spark With Sparklyr in R
11 pages
Polars Vs Pandas - Benchmarking Performances and Beyond - LinkedIn
No ratings yet
Polars Vs Pandas - Benchmarking Performances and Beyond - LinkedIn
12 pages
DBMS OS CN OOPs MostFrequentlyAskedQuestions
No ratings yet
DBMS OS CN OOPs MostFrequentlyAskedQuestions
91 pages
Intro To Analytics and ML With Sparklyr
No ratings yet
Intro To Analytics and ML With Sparklyr
63 pages
Matplotlib Fundamentals
No ratings yet
Matplotlib Fundamentals
31 pages
Motherboard PowerPoint
100% (4)
Motherboard PowerPoint
50 pages
Hive Queries
No ratings yet
Hive Queries
5 pages
Cognizant Cloud Security Solutions
No ratings yet
Cognizant Cloud Security Solutions
4 pages
Python Cheat Sheet
No ratings yet
Python Cheat Sheet
14 pages
Hbase PDF
No ratings yet
Hbase PDF
33 pages
Tutorial-HDP-Administration V III
100% (1)
Tutorial-HDP-Administration V III
274 pages
Computer Systems Servicing NC Ii: Reviewer
No ratings yet
Computer Systems Servicing NC Ii: Reviewer
33 pages
Computer Organization Assignment
No ratings yet
Computer Organization Assignment
8 pages
Spark Scala Protected
No ratings yet
Spark Scala Protected
211 pages
Sparkly R
No ratings yet
Sparkly R
2 pages
Oracle Log4j
No ratings yet
Oracle Log4j
7 pages
Jupyter Installation
100% (1)
Jupyter Installation
19 pages
Byzantine Machine Learning: A Primer: Rachid Guerraoui Nirupam Gupta Rafael Pinot
No ratings yet
Byzantine Machine Learning: A Primer: Rachid Guerraoui Nirupam Gupta Rafael Pinot
39 pages
Lab Sheet 05 - Numpy and Matplotlib
No ratings yet
Lab Sheet 05 - Numpy and Matplotlib
12 pages
MapR Sandbox For Hadoop DocUpdateFor3.1.1
No ratings yet
MapR Sandbox For Hadoop DocUpdateFor3.1.1
7 pages
Mapreduce Lab
No ratings yet
Mapreduce Lab
36 pages
How Sqoop Works?: Sqoop "SQL To Hadoop and Hadoop To SQL"
No ratings yet
How Sqoop Works?: Sqoop "SQL To Hadoop and Hadoop To SQL"
27 pages
Top 50 Pandas Interview Questions and Answers (2024)
No ratings yet
Top 50 Pandas Interview Questions and Answers (2024)
34 pages
Advance Python Sheet 1696337837
No ratings yet
Advance Python Sheet 1696337837
237 pages
14-Lesson Cloudera Hive
No ratings yet
14-Lesson Cloudera Hive
9 pages
Hive Main Installation
No ratings yet
Hive Main Installation
2 pages
Pig
No ratings yet
Pig
16 pages
cs301 MCQS
No ratings yet
cs301 MCQS
6 pages
Unit-4 (Memory Management)
No ratings yet
Unit-4 (Memory Management)
103 pages
Hadoop Echosystem and Ibm Big Insights: Rafie Tarabay Eng - Rafie@Mans - Edu.Eg
No ratings yet
Hadoop Echosystem and Ibm Big Insights: Rafie Tarabay Eng - Rafie@Mans - Edu.Eg
112 pages
SQL-Transactions Theory and Hands-On Exercises
No ratings yet
SQL-Transactions Theory and Hands-On Exercises
85 pages
Sqoop Commands - Latest
No ratings yet
Sqoop Commands - Latest
4 pages
MongoDB CheatSheet
No ratings yet
MongoDB CheatSheet
9 pages
Mapr Snapshots
No ratings yet
Mapr Snapshots
31 pages
Machine Learning - Brief
No ratings yet
Machine Learning - Brief
12 pages
Splunk Punk: Taming Logs, Alerts, and the Chaos of SIEM
From Everand
Splunk Punk: Taming Logs, Alerts, and the Chaos of SIEM
Scott Markham
No ratings yet
Lecture 4 - Pair RDD and DataFrame
No ratings yet
Lecture 4 - Pair RDD and DataFrame
38 pages
Scaladayslambda Architecture Spark Cassandra Akka Kafka 150609194508 Lva1 App6891 PDF
No ratings yet
Scaladayslambda Architecture Spark Cassandra Akka Kafka 150609194508 Lva1 App6891 PDF
100 pages
Qlik Sense Installation Guide
No ratings yet
Qlik Sense Installation Guide
63 pages
Hive and Impala
No ratings yet
Hive and Impala
46 pages
Servo
No ratings yet
Servo
209 pages
HP Compaq Presario c500 - Compal La-3341p Ibl30 - Rev 0.2sec
No ratings yet
HP Compaq Presario c500 - Compal La-3341p Ibl30 - Rev 0.2sec
40 pages
File Types in Data Engineering!
No ratings yet
File Types in Data Engineering!
18 pages
Bonus Program Foxpro
No ratings yet
Bonus Program Foxpro
18 pages
Chapter 2 R Ggplot2 Examples
No ratings yet
Chapter 2 R Ggplot2 Examples
22 pages
NoSQL Intro
No ratings yet
NoSQL Intro
26 pages
MapReduce Introduction
No ratings yet
MapReduce Introduction
34 pages
The EU Hydrogen Strategy - Hydrogen Europe's Top 10 Key Recommendations - FINAL
No ratings yet
The EU Hydrogen Strategy - Hydrogen Europe's Top 10 Key Recommendations - FINAL
22 pages
Manual Database Creation On Oracle 10G
No ratings yet
Manual Database Creation On Oracle 10G
3 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Timers and ATD
No ratings yet
Timers and ATD
17 pages
BCG Ten Lessons From 20 Years of Value Creation Insights Nov 2018 Tcm21 208175
No ratings yet
BCG Ten Lessons From 20 Years of Value Creation Insights Nov 2018 Tcm21 208175
8 pages
Mining Data Streams (Part 2)
No ratings yet
Mining Data Streams (Part 2)
56 pages
Cognizant Digital Education Enablement Solution Brochure
No ratings yet
Cognizant Digital Education Enablement Solution Brochure
5 pages
MapGuide Programming Manual
No ratings yet
MapGuide Programming Manual
164 pages
Tutorial Hbase
No ratings yet
Tutorial Hbase
100 pages
Juniper SRX220 Services Gateway Hardware Guide For H2 Model Numbers
No ratings yet
Juniper SRX220 Services Gateway Hardware Guide For H2 Model Numbers
175 pages
Journal IBM 3
No ratings yet
Journal IBM 3
6 pages
Administration of Hadoop Summer 2014 Lab Guide v3.1
No ratings yet
Administration of Hadoop Summer 2014 Lab Guide v3.1
107 pages
NV+series+Manual+EN+v7 7 (SP4)
No ratings yet
NV+series+Manual+EN+v7 7 (SP4)
357 pages
Learning Apache Spark 2
From Everand
Learning Apache Spark 2
Muhammad Asif Abbasi
No ratings yet
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet
PP3 B1377e
No ratings yet
PP3 B1377e
26 pages
Basic Programming: CS 111: Computer Science For Scientists
No ratings yet
Basic Programming: CS 111: Computer Science For Scientists
47 pages
Spark: Big Data Cluster Computing in Production
From Everand
Spark: Big Data Cluster Computing in Production
Ilya Ganelin
No ratings yet
BCG Zig Zag and The Art of Strategic Creativity June 2019 Tcm21 221683
No ratings yet
BCG Zig Zag and The Art of Strategic Creativity June 2019 Tcm21 221683
5 pages
1.1 What Is A Neural Network?
No ratings yet
1.1 What Is A Neural Network?
3 pages
Pyspark
No ratings yet
Pyspark
10 pages
Sparkr: Scaling R Programs With Spark: Data Sources
No ratings yet
Sparkr: Scaling R Programs With Spark: Data Sources
6 pages
Wakiso Comprehensive Institute of Health Sciences Lab Report 2022 August 15
No ratings yet
Wakiso Comprehensive Institute of Health Sciences Lab Report 2022 August 15
6 pages
Deep Tech Infographic 05 D Apr29 Tcm21-219018
No ratings yet
Deep Tech Infographic 05 D Apr29 Tcm21-219018
1 page
Integrating AI With Visual Analytics To Enhance Quality of Care Delivery
No ratings yet
Integrating AI With Visual Analytics To Enhance Quality of Care Delivery
2 pages
TL WR940N (EU) 6.0 Datasheet PDF
No ratings yet
TL WR940N (EU) 6.0 Datasheet PDF
5 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Spark With R
No ratings yet
Spark With R
6 pages
Instructiuni Programare Bucla 56160 BA ICheck 416
No ratings yet
Instructiuni Programare Bucla 56160 BA ICheck 416
72 pages
Sanskrit Iast Adi Parashakti Parvati Decline of Buddhism in India
No ratings yet
Sanskrit Iast Adi Parashakti Parvati Decline of Buddhism in India
2 pages
O2 CRIME Spec
No ratings yet
O2 CRIME Spec
236 pages
Dell Networking 6.2.6.6 Release Notes
No ratings yet
Dell Networking 6.2.6.6 Release Notes
50 pages
The Realm of The Nebulae: Spiral Galaxies Form A
No ratings yet
The Realm of The Nebulae: Spiral Galaxies Form A
2 pages
Understanding The Sector Impact of COVID-19: Media & Entertainment
No ratings yet
Understanding The Sector Impact of COVID-19: Media & Entertainment
2 pages
Sun Sign
No ratings yet
Sun Sign
1 page
Procédure de Test de L'afficheur LED EB004
No ratings yet
Procédure de Test de L'afficheur LED EB004
2 pages
Related Articles: Mac Releases
No ratings yet
Related Articles: Mac Releases
1 page
04 Task Performance Platform
No ratings yet
04 Task Performance Platform
2 pages
Experiment 13 - MP 8085 Lab Manual
No ratings yet
Experiment 13 - MP 8085 Lab Manual
7 pages
Pin Diagram of 8085 8086
No ratings yet
Pin Diagram of 8085 8086
3 pages
Log-2024 06 07 03 59
No ratings yet
Log-2024 06 07 03 59
5 pages
Computer Science Syllabus
No ratings yet
Computer Science Syllabus
3 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
BCG How AI and Robotics Will Disrupt The Defense Industry Apr 2018 Tcm21 188429
No ratings yet
BCG How AI and Robotics Will Disrupt The Defense Industry Apr 2018 Tcm21 188429
6 pages
Quantum Computing
No ratings yet
Quantum Computing
1 page

Data Science in Spark With Sparklyr::: Cheat Sheet

Uploaded by

Data Science in Spark With Sparklyr::: Cheat Sheet

Uploaded by

Data Science in Spark with Sparklyr : : CHEAT SHEET

Intro Data Science Toolchain with Spark + sparklyr Using

You might also like