0% found this document useful (0 votes)

46 views3 pages

Atelier4-4 Spark

This document describes exploring a file containing New York City taxi trip data using Spark Streaming. It involves running a Python script to stream taxi trip data from a CSV file to a Spark Streaming application via a TCP socket. The Spark Streaming application receives the data, parses the rows, counts the number of passengers by vehicle make, and prints the results.

Uploaded by

Mazozi safae

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views3 pages

Atelier4-4 Spark

Uploaded by

Mazozi safae

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Atelier 4-4: SPARK STREAMING

Il s’agit d’explorer un fichier contenant des parcours de taxis, avec des

informations telles que le nombre de passagers et la marque de la voiture.

https://itabacademy.com/bigdata/hadoop/Spark/taxistreams.py
https://itabacademy.com/bigdata/hadoop/Spark/ss-test.scala
https://itabacademy.com/bigdata/hadoop/Spark/nyctaxi100.csv

Ecrire le script python ;

[cloudera@quickstart ~]$cat > taxistreams.py

# coding: utf-8

import socket
import time
s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
s.bind(("localhost", 7777))
s.listen(1)
print("Started...")
while(1):
c, address = s.accept()
for row in open("nyctaxi100.csv"):
print(row)
c.send(row.encode())
time.sleep(0.5)
c.close()

Ctr D … pour sauvegarder et sortir de fichhier

Lancer le script python ;

[cloudera@quickstart ~]$ python taxistreams.py
Started...

Ouvrir une autre console, Lancer spark-shell puis le traitement :

import org.apache.log4j.Logger
import org.apache.log4j.Level
Logger.getLogger("org").setLevel(Level.OFF)
Logger.getLogger("akka").setLevel(Level.OFF)
import org.apache.spark._
import org.apache.spark.streaming._
import org.apache.spark.streaming.StreamingContext._
val ssc = new StreamingContext(sc, Seconds(1))

val lines = ssc.socketTextStream("localhost", 7777)

val pass = lines.map(_.split(",")).

map(pass=>(pass(15), pass(7).toInt)).
reduceByKey(_+_)

pass.print()

ssc.start()
ssc.awaitTermination()

---------------------------------------------------------------------------
[cloudera@quickstart ~]$ spark-shell
Setting default log level to "WARN".
To adjust logging level use sc.setLogLevel(newLevel).
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-
1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/usr/lib/flume-ng/lib/slf4j-log4j12-
1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/usr/lib/parquet/lib/slf4j-log4j12-
1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/usr/lib/avro/avro-tools-1.7.6-
cdh5.12.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
Welcome to
____ __
/ __/__ ___ _____/ /__
_\ \/ _ \/ _ `/ __/ '_/
/___/ .__/\_,_/_/ /_/\_\ version 1.6.0
/_/

Using Scala version 2.10.5 (Java HotSpot(TM) 64-Bit Server VM, Java
1.7.0_67)
Type in expressions to have them evaluated.
Type :help for more information.
22/11/07 10:22:22 WARN util.NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
Spark context available as sc (master = local[*], app id = local-
1667845345596).
22/11/07 10:22:29 WARN shortcircuit.DomainSocketFactory: The short-circuit
local reads feature cannot be used because libhadoop cannot be loaded.
SQL context available as sqlContext.

scala> import org.apache.log4j.Logger

import org.apache.log4j.Logger

scala> import org.apache.log4j.Level

import org.apache.log4j.Level

scala> Logger.getLogger("org").setLevel(Level.OFF)

scala> Logger.getLogger("akka").setLevel(Level.OFF)
scala> import org.apache.spark._
import org.apache.spark._

scala> import org.apache.spark.streaming._

import org.apache.spark.streaming._

scala> import org.apache.spark.streaming.StreamingContext._

import org.apache.spark.streaming.StreamingContext._

scala> val ssc = new StreamingContext(sc, Seconds(1))

ssc: org.apache.spark.streaming.StreamingContext =
org.apache.spark.streaming.StreamingContext@11467ced

scala>

scala> val lines = ssc.socketTextStream("localhost", 7777)

lines: org.apache.spark.streaming.dstream.ReceiverInputDStream[String] =
org.apache.spark.streaming.dstream.SocketInputDStream@75e011d9

scala>

scala> val pass = lines.map(_.split(",")).

| map(pass=>(pass(15), pass(7).toInt)).
| reduceByKey(_+_)
pass: org.apache.spark.streaming.dstream.DStream[(String, Int)] =
org.apache.spark.streaming.dstream.ShuffledDStream@48b75e7f

scala> pass.print()

scala>

scala> ssc.start()

scala> ssc.awaitTermination()

Résultat de Traitement :

Ilovepdf Merged
No ratings yet
Ilovepdf Merged
30 pages
Day1 Main
No ratings yet
Day1 Main
188 pages
Structured Streaming Programming Guide - Spark 3.4.0 Documentation
No ratings yet
Structured Streaming Programming Guide - Spark 3.4.0 Documentation
1 page
Sreenivas IIS Updated CV
50% (2)
Sreenivas IIS Updated CV
3 pages
Learning Spark - Chapter 2
No ratings yet
Learning Spark - Chapter 2
6 pages
Create An Spark Streaming App: 1. Architecture and Abstraction
No ratings yet
Create An Spark Streaming App: 1. Architecture and Abstraction
8 pages
TUTORIAL - How To Install EmuELEC Onto Android TV Box
No ratings yet
TUTORIAL - How To Install EmuELEC Onto Android TV Box
13 pages
Bit Locker Bypass
No ratings yet
Bit Locker Bypass
7 pages
Flash 530 Installation Guide
No ratings yet
Flash 530 Installation Guide
8 pages
Threads
No ratings yet
Threads
18 pages
How To Mount VMware Virtual Disks Without VMware
No ratings yet
How To Mount VMware Virtual Disks Without VMware
5 pages
ATAS User Guide - English
No ratings yet
ATAS User Guide - English
15 pages
Kapil-CV 15cd1d59d3cb605087c3
No ratings yet
Kapil-CV 15cd1d59d3cb605087c3
1 page
Windows XP Home Edition: Minimum
No ratings yet
Windows XP Home Edition: Minimum
8 pages
Java Learning
No ratings yet
Java Learning
123 pages
Apollo Cinematic Guitars - Install Instructions
No ratings yet
Apollo Cinematic Guitars - Install Instructions
2 pages
Symantec Endpoint - maxDNA - User Guide
No ratings yet
Symantec Endpoint - maxDNA - User Guide
3 pages
Office365 CMD Code
No ratings yet
Office365 CMD Code
2 pages
Settings: Files/Opencv Directory. Now, We Have To Configure Devcpp That He Can Take
No ratings yet
Settings: Files/Opencv Directory. Now, We Have To Configure Devcpp That He Can Take
5 pages
App Cache 132505916753937232
No ratings yet
App Cache 132505916753937232
23 pages
LinError 110628 2011 06 29 12 44
No ratings yet
LinError 110628 2011 06 29 12 44
4 pages
Mcse Resume
No ratings yet
Mcse Resume
2 pages
Apache Tomee!: Java Ee Web Profile and More On Apache Tomcat
No ratings yet
Apache Tomee!: Java Ee Web Profile and More On Apache Tomcat
33 pages
WorkSite Supported Platforms (9.0, English)
No ratings yet
WorkSite Supported Platforms (9.0, English)
3 pages
Java Error in Datagrip 22903
No ratings yet
Java Error in Datagrip 22903
61 pages
Restore Old Right-Click Context Menu in Windows 11 - Microsoft Community
No ratings yet
Restore Old Right-Click Context Menu in Windows 11 - Microsoft Community
6 pages
Driver Installation Guide: Installing Drivers and Software
No ratings yet
Driver Installation Guide: Installing Drivers and Software
20 pages
Vrealize Automation 8forward Support Matrix
No ratings yet
Vrealize Automation 8forward Support Matrix
8 pages
Repairing Windows XP in Eight Commands: Roshan Pratihast
No ratings yet
Repairing Windows XP in Eight Commands: Roshan Pratihast
6 pages
Windows Server 2016 Core Installation
No ratings yet
Windows Server 2016 Core Installation
10 pages
4.2 - SOI For Hardening Exchange
No ratings yet
4.2 - SOI For Hardening Exchange
4 pages
Singlethreadmodel Interface: Dr. S Jagannath
No ratings yet
Singlethreadmodel Interface: Dr. S Jagannath
22 pages
Preemptive
No ratings yet
Preemptive
3 pages
How To Downgrade - Roll Back ESXi 6.5 - VirtuBytes
No ratings yet
How To Downgrade - Roll Back ESXi 6.5 - VirtuBytes
5 pages
Java Books Some Example
No ratings yet
Java Books Some Example
4 pages
The Book of JavaScript, 2nd Edition: A Practical Guide to Interactive Web Pages
From Everand
The Book of JavaScript, 2nd Edition: A Practical Guide to Interactive Web Pages
Thau
4.5/5 (3)
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
JavaScript. A Comprehensive manual for creating dynamic, responsive websites and applications: Suitable For Both Novice And Experts.
From Everand
JavaScript. A Comprehensive manual for creating dynamic, responsive websites and applications: Suitable For Both Novice And Experts.
Abdulrazak Nugwa Ibrahim
5/5 (1)
The Complete Developer: Master the Full Stack with TypeScript, React, Next.js, MongoDB, and Docker
From Everand
The Complete Developer: Master the Full Stack with TypeScript, React, Next.js, MongoDB, and Docker
Martin Krause
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Advanced Java Interview Questions and Answers
From Everand
Advanced Java Interview Questions and Answers
Jaishree Soni
No ratings yet
Node.js, Express.js, and More
From Everand
Node.js, Express.js, and More
Tom Henricksen
No ratings yet
How to Hack Like a Ghost: Breaching the Cloud
From Everand
How to Hack Like a Ghost: Breaching the Cloud
Sparc Flow
No ratings yet
Azure For Starters
From Everand
Azure For Starters
Chinmoy Mukherjee
No ratings yet
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
Learning Apache Spark 2
From Everand
Learning Apache Spark 2
Muhammad Asif Abbasi
No ratings yet
Professional Heroku Programming
From Everand
Professional Heroku Programming
Chris Kemp
4/5 (2)
IGNOU BCA Object-Oriented Technologies and Java Programming Previous Year Solved Papers MCS 024
From Everand
IGNOU BCA Object-Oriented Technologies and Java Programming Previous Year Solved Papers MCS 024
Manish Soni
No ratings yet
JDK Tutorials - Herong's Tutorial Examples
From Everand
JDK Tutorials - Herong's Tutorial Examples
Herong Yang
No ratings yet
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mamood Alassouli
No ratings yet
Fast Data Processing with Spark 2 - Third Edition
From Everand
Fast Data Processing with Spark 2 - Third Edition
Krishna Sankar
No ratings yet
JAVASCRIPT FRONT END PROGRAMMING: Crafting Dynamic and Interactive User Interfaces with JavaScript (2024 Guide for Beginners)
From Everand
JAVASCRIPT FRONT END PROGRAMMING: Crafting Dynamic and Interactive User Interfaces with JavaScript (2024 Guide for Beginners)
DAISY JOHNSTON
No ratings yet
Mastering Go Network Automation
From Everand
Mastering Go Network Automation
Ian Taylor
No ratings yet
Professional JavaScript for Web Developers
From Everand
Professional JavaScript for Web Developers
Nicholas C. Zakas
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
From Everand
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
Anand Vemula
No ratings yet
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
From Everand
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
Ian Taylor
No ratings yet
API Gateway, Cognito and Node.js Lambdas
From Everand
API Gateway, Cognito and Node.js Lambdas
Matthew Casperson
5/5 (1)
Game and Graphics Programming for iOS and Android with OpenGL ES 2.0
From Everand
Game and Graphics Programming for iOS and Android with OpenGL ES 2.0
Romain Marucchi-Foino
No ratings yet
Build your own Blockchain: Make your own blockchain and trading bot on your pc
From Everand
Build your own Blockchain: Make your own blockchain and trading bot on your pc
Magelan Cybersecurity
No ratings yet
Extending Docker
From Everand
Extending Docker
Russ McKendrick
5/5 (1)
Network Security All-in-one: ASA Firepower WSA Umbrella VPN ISE Layer 2 Security
From Everand
Network Security All-in-one: ASA Firepower WSA Umbrella VPN ISE Layer 2 Security
Redouane MEDDANE
No ratings yet
Native Docker Clustering with Swarm
From Everand
Native Docker Clustering with Swarm
Fabrizio Soppelsa
No ratings yet
NoSQL Injection for Elasticsearch
From Everand
NoSQL Injection for Elasticsearch
Gary Drocella
No ratings yet
Master The Configuration Of Apache Tomcat On Linux
From Everand
Master The Configuration Of Apache Tomcat On Linux
Koru Lenag
No ratings yet
The Little Book of Sitecore® Tips: Volume 1
From Everand
The Little Book of Sitecore® Tips: Volume 1
Neil P Shack
No ratings yet
Linux DevOps Tools Engineer (701) Practice Tests: 400 Questions to Ace Your Certification
From Everand
Linux DevOps Tools Engineer (701) Practice Tests: 400 Questions to Ace Your Certification
Steve Brown
No ratings yet
Deploying Certificates Cisco Meeting Server: Design your certificates for CMS services and integrate with Cisco UCM Expressway and TMS
From Everand
Deploying Certificates Cisco Meeting Server: Design your certificates for CMS services and integrate with Cisco UCM Expressway and TMS
Redouane MEDDANE
No ratings yet
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
From Everand
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
Gabriel Clemente
No ratings yet
CONFIGURATION OF APACHE SERVER TO SUPPORT ASP
From Everand
CONFIGURATION OF APACHE SERVER TO SUPPORT ASP
DR. HIDAIA MAHMOOD ALASSOULI
No ratings yet
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mahmood Alassouli
No ratings yet
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Node.js, JavaScript, API: Interview Questions and Answers
From Everand
Node.js, JavaScript, API: Interview Questions and Answers
John Edward Cooper Berg
5/5 (1)
Master Roblox Studio Advanced Game Development Techniques: Roblox Studio, #3
From Everand
Master Roblox Studio Advanced Game Development Techniques: Roblox Studio, #3
Steven Mcananey
No ratings yet
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hedaya Alasooly
No ratings yet
Introduction to PHP, Part 5, Second Edition
From Everand
Introduction to PHP, Part 5, Second Edition
Adam Majczak
No ratings yet
Living with Linux in the Industrial World
From Everand
Living with Linux in the Industrial World
Elaiya Iswera Lallan
No ratings yet
Configuration of Apache Server To Support ASP
From Everand
Configuration of Apache Server To Support ASP
Dr. Hedaya Mahmood Alasooly
No ratings yet
The Beginner’s Guide to Node.js
From Everand
The Beginner’s Guide to Node.js
Steven Mcananey
No ratings yet
Node.js: The Definitive Resource
From Everand
Node.js: The Definitive Resource
Tom Henricksen
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet
JavaScript Fundamentals: JavaScript Syntax, What JavaScript is Use for in Website Development, JavaScript Variable, Strings, Popup Boxes, JavaScript Objects, Function, and Event Handlers
From Everand
JavaScript Fundamentals: JavaScript Syntax, What JavaScript is Use for in Website Development, JavaScript Variable, Strings, Popup Boxes, JavaScript Objects, Function, and Event Handlers
Steven Bright
No ratings yet
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
From Everand
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
John Edward Cooper Berg
No ratings yet
Some Tutorials in Computer Networking Hacking
From Everand
Some Tutorials in Computer Networking Hacking
Dr. Hidaia Mahmood Alassouli
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SRS - How to build a Pen Test and Hacking Platform
From Everand
SRS - How to build a Pen Test and Hacking Platform
alasdair gilchrist
2/5 (1)
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet

Atelier4-4 Spark

Uploaded by

Atelier4-4 Spark

Uploaded by

Atelier 4-4: SPARK STREAMING

Il s’agit d’explorer un fichier contenant des parcours de taxis, avec des

Ecrire le script python ;

Ctr D … pour sauvegarder et sortir de fichhier

Lancer le script python ;

Ouvrir une autre console, Lancer spark-shell puis le traitement :

val lines = ssc.socketTextStream("localhost", 7777)

val pass = lines.map(_.split(",")).

scala> import org.apache.log4j.Logger

scala> import org.apache.log4j.Level

scala> import org.apache.spark.streaming._

scala> import org.apache.spark.streaming.StreamingContext._

scala> val ssc = new StreamingContext(sc, Seconds(1))

scala> val lines = ssc.socketTextStream("localhost", 7777)

scala> val pass = lines.map(_.split(",")).

You might also like