Streams Lab Introduction
Streams Lab Introduction
Downloads
Get help
Events
Blog
InfoSphere Streams Introductory Hands-On Lab, Lab Version 1.0 For Use with Streams QuickStart Version 3.2.0.0
Robert Uleman
https://www.ibmdw.net/streamsdev/docs/streams-lab-introduction/
1/6
1/26/2014
IBM InfoSphere Streams (Streams) enables continuous and fast analysis of massive volumes of moving data to help improve the speed of business insight and decision-making. InfoSphere Streams provides an execution platform and services for user-developed applications that ingest, filter, analyze, and correlate the information in data streams. InfoSphere Streams includes the following major components: Streams Runtime Engine A collection of distributed processes that work together to facilitate the running of stream processing applications on a given set of host computers in a cluster. A single instantiation of these is referred to as a Streams Instance. Streams Processing Language (SPL) Declarative language and framework for writing stream-processing applications. (This lab does not cover direct SPL programming.) Streams Studio (Studio) Eclipse-based Development Environment for writing, compiling, running, visualizing, and debugging Streams applications. Development tooling supports graphical (drag & drop) design and editing, as well as quick data visualization in a running application. The Instance Graph feature provides different views into the applications running within a Streams Instance, including the display of live data flow metrics and numerous color highlighting schemes for quick understanding and diagnostics of data flows. InfoSphere Streams Console The Streams Console is a web-based graphical user interface that is provided by the Streams Web Service (SWS). You can use the Streams Console to monitor and manage your Streams instances and applications from any computer that has HTTPS connectivity to the server that is running SWS. The Streams Console also supports data visualization in charts and tables. Streamtool Command-line interface to the Streams Runtime Engine. (This lab does not use the command-line interface.)
https://www.ibmdw.net/streamsdev/docs/streams-lab-introduction/
2/6
1/26/2014
1/26/2014
Overview
This hands-on lab provides a broad introduction to the components and tools of InfoSphere Streams.
Figure 1. InfoSphere Streams lab overview The lab is based on a trivial example from a Smarter Cities automotive scenario: handling vehicle locations and speeds (and other variables). This is used to illustrate a number of techniques and principles, including: Graphical design of an application graph and using a Properties view to configure program details Runtime visualization of Streams applications and data flow metrics Application integration using importing and exporting of data streams based on stream properties. Figure 1 provides a graphical overview of the completed lab. The lab environment includes five Eclipse workspaces (directories) numbered 1 through 5. Labs build on one another, but each workspace already contains a project with what would be the results of the previous lab. (Workspace 1 is empty; workspace 5 has the final result but there are no instructions for modifying it.) This way, attendees can experiment and get themselves in trouble any way they like in any lab and still go on to the next lab with everything in place, simply by switching workspaces. The lab is broken into four parts: Lab 1 A simple Streams app. Start a Streams Instance. Open Streams Studio; explore its views. Create an SPL application project; create an application graph with three operators; configure the operators. Run the application and verify the results. Lab 2 Enhance the app: add the ability to read multiple files from a given directory and slow down the flow so you can watch things happen. Learn about jobs and PEs. Use the Instance Graph to monitor the stream flows and show data. Lab 3 Enhance the app: add an operator to compute the average speed every five observations, separately for two cars. Use the Streams Console to visualize results. Lab 4 Enhance the app: add an operator to check the vehicle ID format and separate records with an unexpected ID structure onto an error stream. Use exported application streams to create a modular application. If possible (internet access required), bring in live data. If you want some more information about the labs, the following is a really nice Video
https://www.ibmdw.net/streamsdev/docs/streams-lab-introduction/
4/6
1/26/2014
Start Lab 1.
Need Help? Ask the forum!
Like Tw eet 0 0 1
Leave a comment
https://www.ibmdw.net/streamsdev/docs/streams-lab-introduction/ 5/6
1/26/2014
You must be logged in to post a comment. RSS Feed Contact us Report Abuse Terms of Use Third Party Notice IBM Privacy IBM
https://www.ibmdw.net/streamsdev/docs/streams-lab-introduction/
6/6