Big Data Analytics Overview
Big Data Analytics Overview
Big Data refers to extremely large datasets that are too complex and vast for
traditional data processing software to manage effectively. It involves the
collection, storage, and analysis of data from various sources to extract
meaningful insights.
1. Data
Big Data analytics involves using tools and techniques to extract insights:
Cloud solutions provide scalable infrastructure for Big Data storage and
processing:
HDFS Architecture:
1.2.2 MapReduce
2.1 Hive
2.2 Pig
2.3 HBase
2.4 Sqoop
5. Conclusion
3. Introduction to MySQL
Each column in a table has a specific data type to define the kind of values
it can store.
CRUD stands for Create, Read, Update, and Delete, which are the core
operations used for managing database records. In addition to these, MySQL
provides commands to modify the structure of a table and enhance
database management.
1. Create (INSERT) – Adding New Data
INSERT INTO students (id, name, age) VALUES (1, 'John Doe', 22);
Renaming a Column
Removing a Column
Conclusion
INSERT INTO students (id, name, age) VALUES (2, 'Alice', 21);
Types of Joins
LEFT JOIN – Returns all records from the left table and matching
records from the right table.
RIGHT JOIN – Returns all records from the right table and matching
records from the left table.
A Super Key is a set of one or more attributes (columns) that can uniquely
identify a row in a table.
A Primary Key (PK) is the main key chosen from the Candidate Keys to
uniquely identify each row in a table.
Example
Conclusion