This is the code repository for Fundamentals of Apache Flink [Video], published by Packt. It contains all the supporting project files necessary to work through the video course from start to finish.
Have you heard of Apache Flink, but don't know how to use it to get on top of big data? Have you used Flink, but want to learn how to set it up and use it properly? Either way, this course is for you. This course first introduces Flink concepts and terminology, and then moves on to building a Flink instance, collecting data, and using that data to generate output that can be used as processed data input into other systems. You will also use the Flink APIs to process data in batch and streaming modes. By the end of the course, you will be capable of using the Apache Flink ecosystem to achieve complex tasks such as event processing and machine learning.
- Build your own Flink development environment on a Linux server
- Monitor your stream processing in real-time using the Flink UI
- Organize your data comprehensively using data processing pipelines
- Build end-to-end, real-time analytics projects
- Design a distributed Flink environment to efficiently process, transform, and aggregate your data
This course targets big data developers keen to process batch and real-time data on distributed systems. A basic knowledge of Hadoop and big data is assumed, and a reasonable knowledge of Java or Scala is expected.
This course has the following software requirements:
Minimum Hardware Requirements
For successful completion of this course, students will require the computer systems with at least the following, deployed in Amazon Web Services (AWS):
OS: Ubuntu AMI (https://aws.amazon.com/marketplace/pp/B01JBL2M0O)
Processor: c4.large (2 vCPU)
Memory: 4GB RAM
Storage: 50GB additional storage for Flink installation
Recommended Hardware Requirements
For an optimal experience with hands-on labs and other practical activities, we recommend the following configuration, deployed in Amazon Web Services (AWS):
OS: Ubuntu AMI (https://aws.amazon.com/marketplace/pp/B01JBL2M0O)
Processor: c4.xlarge (4 vCPU)
Memory: 8GB RAM
Storage: 50GB additional storage for Flink installation
Software Requirements Operating system: Any Windows, Linux, or MacOS system with a web browser and SSH client Browser: Any recent version of Google Chrome, Mozilla FireFox, or Microsoft Edge Flink: Lab examples are built on Flink 1.7.2. Latest version is available for download here: https://flink.apache.org/downloads.html