0% found this document useful (0 votes)
44 views

Xenstack Fullstack 03 PDF

This document provides an overview of topics related to big data including introductory courses on setting up Elasticsearch, Hadoop, MongoDB, Cassandra, and the ELK stack. It also discusses using Cloudera for Hadoop clusters and HBase, and hosting large tables with billions of rows and millions of columns using HBase. The document promotes the source as an expert on big data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
44 views

Xenstack Fullstack 03 PDF

This document provides an overview of topics related to big data including introductory courses on setting up Elasticsearch, Hadoop, MongoDB, Cassandra, and the ELK stack. It also discusses using Cloudera for Hadoop clusters and HBase, and hosting large tables with billions of rows and millions of columns using HBase. The document promotes the source as an expert on big data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

xenstack

xenstack.club

FullStack - 03
How big is BigData

01.
What is bigData

- Intro to BigData world


- Beyond Relational Databases
- RDBMS vs NoSQL
- MySQL 101
- PostgreSQL 101
- Setup Hadoop
- Setup Elasticsearch
- BigData use cases

02.
Elasticsearch 6
- Setup 5 node ES Cluster
- Automate installation - Ansible
- Automate installtion - Puppet
- Setup ES cluster in AWS
- Fully automate ES cluster setup
in AWS using Terraform
- Launch elasticsearch service -
AWS

03.
Hadoop

- What is Hadoop
- Why HDFS
- Install Hadoop cluster - Horton works
- Learn MapReduce, YARN
- Programming Hadoop with PIG
- Programming Hadoop with SPARK

04.
cloudera Ecosystem

- Deploy a Cloudera Hadoop cluster


- Deploy Cloudera HBASE cluster
- host very large tables -- billions of rows
millions of columns - HBASE
- use Java API for client HBAE access
- Vagrant Provision
- get familiar with Sqoop, PIG & HIVE

05.
mongodb
- Intro Mongodb
- Creating, Updating, and Deleting
Documents
- Querying
- Indexes , Special Index and Collection
Types
- Introduction to the Aggregation
Framework
- Replication
- Administration

06.
cassandra

- Introducing Cassandra
- Installing Cassandra
- The Cassandra Query Language
- Data Modeling
- The Cassandra Architecture
- Configuring Cassandra
- Clients
- Reading and Writing Data
- Administration

07. ELK Stack

- ELK Architecture
- Setup ELK Cluster
- ingest logs from apps via filebeat
- create Kibana Dashboards
- Logstash in-depth
- ELK In real world use cases

IM THE BIG DATA GURU !!!

xenstack.club

You might also like