Learning Spark

This document provides an introduction to the book Learning Spark which teaches how to use the Spark framework for big data analysis. It includes the table of contents and information about the authors who created Spark and contributed to its development.

Uploaded by

eshwar152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

27% found this document useful (11 votes)

2K views3 pages

Learning Spark

Uploaded by

eshwar152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Learning

Spark

Table of Contents
1. introduction

Learning Spark

Learning Spark: Lightning-Fast Big Data Analysis

Chinese translation
Translation the book of Learning Spark: Lightning-Fast Big Data Analysis is only for spark developer educational purposes.
If I violated your copyright, please let me know.
Learning Spark: Lightning-Fast Big Data AnalysisSpark

About the Author

Holden Karau is a software development engineer at Databricks and is active in open source. She is the author of an
earlier Spark book. Prior to Databricks she worked on a variety of search and classification problems at Google,
Foursquare, and Amazon. She graduated from the University of Waterloo with a Bachelors of Mathematics in Computer
Science. Outside of software she enjoys playing with fire, welding, and hula hooping.
Most recently, Andy Konwinski co-founded Databricks. Before that he was a PhD student and then postdoc in the AMPLab
at UC Berkeley, focused on large scale distributed computing and cluster scheduling. He co-created and is a committer on
the Apache Mesos project. He also worked with systems engineers and researchers at Google on the design of Omega,
their next generation cluster scheduling system. More recently, he developed and led the AMP Camp Big Data Bootcamps
and first Spark Summit, and has been contributing to the Spark project.
Patrick Wendell is an engineer at Databricks as well as a Spark Committer and PMC member. In the Spark project, Patrick
has acted as release manager for several Spark releases, including Spark 1.0. Patrick also maintains several subsystems
of Spark's core engine. Before helping start Databricks, Patrick obtained an M.S. in Computer Science at UC Berkeley. His
research focused on low latency scheduling for large scale analytics workloads. He holds a B.S.E in Computer Science
from Princeton University
Matei Zaharia is the creator of Apache Spark and CTO at Databricks. He holds a PhD from UC Berkeley, where he started
Spark as a research project. He now serves as its Vice President at Apache. Apart from Spark, he has made research and
open source contributions to other projects in the cluster computing area, including Apache Hadoop (where he is a
committer) and Apache Mesos (which he also helped start at Berkeley).

Examples for Learning Spark

codes https://github.com/gaoxuesong/learning-spark/ forked from https://github.com/databricks/learning-spark

introduction

Azure Machine Learning Guide
100% (1)
Azure Machine Learning Guide
1,748 pages
Apache Cassandra Essentials
From Everand
Apache Cassandra Essentials
Padalia Nitin
4/5 (1)
Spark: Prepared by Dulari Bhatt
No ratings yet
Spark: Prepared by Dulari Bhatt
19 pages
Hadoop With Python
100% (6)
Hadoop With Python
71 pages
(Smtebooks - Com) Big Data Processing With Hadoop 1st Edition
100% (1)
(Smtebooks - Com) Big Data Processing With Hadoop 1st Edition
255 pages
Data Engineering Cookbook
100% (1)
Data Engineering Cookbook
125 pages
Real-Time Streaming with Apache Kafka, Spark, and Storm: Create Platforms That Can Quickly Crunch Data and Deliver Real-Time Analytics to Users
From Everand
Real-Time Streaming with Apache Kafka, Spark, and Storm: Create Platforms That Can Quickly Crunch Data and Deliver Real-Time Analytics to Users
Brindha Priyadarshini Jeyaraman
No ratings yet
ML Use Cases Ebook
100% (2)
ML Use Cases Ebook
53 pages
Learning Spark
100% (1)
Learning Spark
4 pages
Minder Chen, Ph.D. Mchen@gmu - Edu: Member Is A Member of
100% (1)
Minder Chen, Ph.D. Mchen@gmu - Edu: Member Is A Member of
150 pages
Rebuilding Reliable Data Pipelines Through Modern Tools PDF
100% (1)
Rebuilding Reliable Data Pipelines Through Modern Tools PDF
99 pages
Learning Spark Preview Ed
No ratings yet
Learning Spark Preview Ed
18 pages
Apache Spark Python Slides
No ratings yet
Apache Spark Python Slides
186 pages
Day1 Main
No ratings yet
Day1 Main
188 pages
Azure-Databricks-Virtual-Workshop-21-Apr - FINAL PDF
No ratings yet
Azure-Databricks-Virtual-Workshop-21-Apr - FINAL PDF
43 pages
Data Engineering Cookbook
100% (1)
Data Engineering Cookbook
124 pages
Py Spark
No ratings yet
Py Spark
427 pages
7 Steps For A Developer To Learn Apache Spark
No ratings yet
7 Steps For A Developer To Learn Apache Spark
30 pages
Azure Synapse With Power BI Dataflows
100% (1)
Azure Synapse With Power BI Dataflows
19 pages
Fast Data Processing with Spark 2 - Third Edition
From Everand
Fast Data Processing with Spark 2 - Third Edition
Krishna Sankar
No ratings yet
Packt - Hands On - Big.data - Analytics.with - Pyspark.2019
100% (1)
Packt - Hands On - Big.data - Analytics.with - Pyspark.2019
253 pages
Databricks Guide
No ratings yet
Databricks Guide
27 pages
Practitioner’s Guide to Data Science: Streamlining Data Science Solutions using Python, Scikit-Learn, and Azure ML Service Platform
From Everand
Practitioner’s Guide to Data Science: Streamlining Data Science Solutions using Python, Scikit-Learn, and Azure ML Service Platform
Nasir Ali Mirza
No ratings yet
Pyspark Tutorial
100% (2)
Pyspark Tutorial
27 pages
Mlops: 5 Steps To Operationalize Machine Learning Models
No ratings yet
Mlops: 5 Steps To Operationalize Machine Learning Models
17 pages
Tamr EB Getting DataOps Right Full 05-23-19
100% (1)
Tamr EB Getting DataOps Right Full 05-23-19
66 pages
Spark For Python Developers - Sample Chapter
100% (6)
Spark For Python Developers - Sample Chapter
32 pages
Pentaho Data Integration Cookbook - Second Edition
From Everand
Pentaho Data Integration Cookbook - Second Edition
María Carina Roldán
No ratings yet
Loan Risk Analysis With Databricks and XGBoost - A Databricks Guide, Including Code Samples and Notebooks (2019)
No ratings yet
Loan Risk Analysis With Databricks and XGBoost - A Databricks Guide, Including Code Samples and Notebooks (2019)
11 pages
Learn How Databricks Streamlines The Data Management Lifecycle
No ratings yet
Learn How Databricks Streamlines The Data Management Lifecycle
20 pages
Spark Summit East 2015 - Adv Dev Ops - Student Slides
No ratings yet
Spark Summit East 2015 - Adv Dev Ops - Student Slides
219 pages
Data Engineering With Databricks Da
100% (2)
Data Engineering With Databricks Da
232 pages
Eschatology - Kingdom of God
50% (2)
Eschatology - Kingdom of God
13 pages
Making Big Data Simple With Databricks
No ratings yet
Making Big Data Simple With Databricks
25 pages
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Architecting Data Lakes Zaloni PDF
No ratings yet
Architecting Data Lakes Zaloni PDF
63 pages
50 Safety Director Interview Questions and Answers 1734275478
No ratings yet
50 Safety Director Interview Questions and Answers 1734275478
5 pages
Data Engineering Nanodegree Program Syllabus
No ratings yet
Data Engineering Nanodegree Program Syllabus
16 pages
Verbal Section Passage Test: Prediction Module Iup Medicine Ugm 2019/2020
No ratings yet
Verbal Section Passage Test: Prediction Module Iup Medicine Ugm 2019/2020
13 pages
O Reilly Data Lake Bootcamp Day 11694182865124
No ratings yet
O Reilly Data Lake Bootcamp Day 11694182865124
46 pages
COT-MATH - Identifying Parallel, Inter-Secting and Perpendicular Lines
70% (10)
COT-MATH - Identifying Parallel, Inter-Secting and Perpendicular Lines
3 pages
Spark Interview
No ratings yet
Spark Interview
17 pages
Apache Spark 2.x Cookbook
From Everand
Apache Spark 2.x Cookbook
Rishi Yadav
No ratings yet
Intro To Data Engineering Databricks Webinar 13may
No ratings yet
Intro To Data Engineering Databricks Webinar 13may
59 pages
Azure Data Lake and U-SQL
No ratings yet
Azure Data Lake and U-SQL
51 pages
8888888888888888888
100% (1)
8888888888888888888
131 pages
Preliminary PET Speaking Examiner Pack
100% (1)
Preliminary PET Speaking Examiner Pack
24 pages
4 - Action and RDD Transformations
No ratings yet
4 - Action and RDD Transformations
25 pages
Apache Spark Interview Questions Book
100% (1)
Apache Spark Interview Questions Book
15 pages
Apache Spark Graph Processing
From Everand
Apache Spark Graph Processing
Ramamonjison Rindra
No ratings yet
Data Warehousing by Example
No ratings yet
Data Warehousing by Example
178 pages
Azure Data Explorer From Synapse Analytics Workspace
No ratings yet
Azure Data Explorer From Synapse Analytics Workspace
22 pages
Introduction To Spark For Data Engineers / Data Scientists
100% (3)
Introduction To Spark For Data Engineers / Data Scientists
100 pages
Airflow Introduction
No ratings yet
Airflow Introduction
9 pages
India's Consumer Durables Market
No ratings yet
India's Consumer Durables Market
5 pages
Spark Interview Questions
100% (1)
Spark Interview Questions
8 pages
SSIS Succinctly
No ratings yet
SSIS Succinctly
116 pages
Databricks - Spark Streaming
No ratings yet
Databricks - Spark Streaming
55 pages
1,3 Butadiene
No ratings yet
1,3 Butadiene
7 pages
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet
Religions in Your Lips
No ratings yet
Religions in Your Lips
98 pages
Simplifying Data Engineering Databricks
100% (1)
Simplifying Data Engineering Databricks
20 pages
Databricks Essentials: A Guide to Unified Data Analytics
From Everand
Databricks Essentials: A Guide to Unified Data Analytics
Robert Johnson
No ratings yet
Apache Spark Interview Questions
No ratings yet
Apache Spark Interview Questions
12 pages
Databricks Cloud How To Log Analysis Example
No ratings yet
Databricks Cloud How To Log Analysis Example
9 pages
GBT 1591 2018 en
No ratings yet
GBT 1591 2018 en
33 pages
Exam AZ-900 Azure Fundamentals Last Edited 23-December-2020
No ratings yet
Exam AZ-900 Azure Fundamentals Last Edited 23-December-2020
4 pages
Spark Tutorial
No ratings yet
Spark Tutorial
8 pages
Answers To End-Of-Chapter Questions For Chapter 4, Chemical Calculations
0% (1)
Answers To End-Of-Chapter Questions For Chapter 4, Chemical Calculations
2 pages
For More Details, Please Consult Your Hyundai Dealer. Hyundai Motor India LTD 5th-6th Floor, Corporate One - Baani Building, Plot No.-5, Commercial Centre, Jasola, New Delhi-110076
No ratings yet
For More Details, Please Consult Your Hyundai Dealer. Hyundai Motor India LTD 5th-6th Floor, Corporate One - Baani Building, Plot No.-5, Commercial Centre, Jasola, New Delhi-110076
8 pages
Delta Lake Cheat Sheet-1
100% (1)
Delta Lake Cheat Sheet-1
2 pages
Unit 12 Lexis: Commentary
No ratings yet
Unit 12 Lexis: Commentary
5 pages
Ai 2023
No ratings yet
Ai 2023
29 pages
Fixed Displacement Vane Pumps Datasheet
No ratings yet
Fixed Displacement Vane Pumps Datasheet
6 pages
Console Log ZC026
No ratings yet
Console Log ZC026
7 pages
De Cuong On Thi Tieng Anh Hoc Ky II Lop 11 Nang Cao
No ratings yet
De Cuong On Thi Tieng Anh Hoc Ky II Lop 11 Nang Cao
13 pages
Datasheet LT1171HV
No ratings yet
Datasheet LT1171HV
20 pages
Digital Marketing Be Etc (Insem.) (2019 Pattern) (Semester Viii) (Elective Vi) March 24
No ratings yet
Digital Marketing Be Etc (Insem.) (2019 Pattern) (Semester Viii) (Elective Vi) March 24
1 page
(2018) Fittingness - Christopher Howard
No ratings yet
(2018) Fittingness - Christopher Howard
14 pages
Patent and Intellectual Property Rights Issues With Technology Transfer in Bhutan
No ratings yet
Patent and Intellectual Property Rights Issues With Technology Transfer in Bhutan
16 pages
Presentation 1 Adjectives-1
No ratings yet
Presentation 1 Adjectives-1
13 pages
Evergreen State - Music Cultures of The World (1993-1994) Sean Williams
No ratings yet
Evergreen State - Music Cultures of The World (1993-1994) Sean Williams
4 pages
Fox Pueblo Baseball A New Use For Old Witchcraft 1961
No ratings yet
Fox Pueblo Baseball A New Use For Old Witchcraft 1961
9 pages
Document
No ratings yet
Document
5 pages
Tifac Core at Nit Hamirpur
No ratings yet
Tifac Core at Nit Hamirpur
6 pages
Practice Chapter 3 Conformations of Alkanes and Cycloalkanes
No ratings yet
Practice Chapter 3 Conformations of Alkanes and Cycloalkanes
22 pages
IIE Bachelor of Commerce in Law Factsheet 2020 (New) V1 PDF
No ratings yet
IIE Bachelor of Commerce in Law Factsheet 2020 (New) V1 PDF
2 pages
744845889-Murvin-Krak-1 2
No ratings yet
744845889-Murvin-Krak-1 2
1 page

Learning Spark

Uploaded by

Learning Spark

Uploaded by

Learning

Learning Spark: Lightning-Fast Big Data Analysis

About the Author

Examples for Learning Spark

You might also like