0% found this document useful (1 vote)
53 views

Big Data Analytic Lab Syllabus

This document contains a syllabus for a Big Data Analytics lab course divided into 3 units totaling 15 hours. The syllabus lists 9 experiments mapping to course outcomes involving configuring Hadoop clusters, performing MapReduce jobs, interacting with HDFS, using Apache Spark, connecting to MongoDB databases, modeling social network data in NoSQL, performing data migration to NoSQL, performing predictive analytics with MLlib, performing machine learning with scikit-learn, and designing a prescriptive analytics solution for a case study.

Uploaded by

sujaniankratos68
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (1 vote)
53 views

Big Data Analytic Lab Syllabus

This document contains a syllabus for a Big Data Analytics lab course divided into 3 units totaling 15 hours. The syllabus lists 9 experiments mapping to course outcomes involving configuring Hadoop clusters, performing MapReduce jobs, interacting with HDFS, using Apache Spark, connecting to MongoDB databases, modeling social network data in NoSQL, performing data migration to NoSQL, performing predictive analytics with MLlib, performing machine learning with scikit-learn, and designing a prescriptive analytics solution for a case study.

Uploaded by

sujaniankratos68
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

BIG DATA ANALYTICS LAB SYLLABUS (20CSP/ITP-471)

S. No. Name of Experiments Hours /Cos


Mapped
Unit-1 15 Hours
1.1 Write a program to configure a small Hadoop cluster with at least one master CO1
and two worker nodes.

1.2 Write a program for Map Reduce to analyses a dataset and understand the CO2
MapReduce workflow with execution on the Hadoop cluster.

1.3 Write a program to interact with HDFS using the Hadoop File System API to CO2
create a new file, write some content to it, and then read and display the
content from the file.

1.4 Write a program using Apache Spark to process a large dataset. CO2

Unit -2
2.1 Write a program to connect to a MongoDB instance to create a new database and a CO3
collection, insert multiple documents into the collection, and then query and display
the documents.
2.2 Write a program in to model a sample dataset for a social networking application in a CO3
NoSQL database.
2.3 Write a program using a suitable NoSQL driver (e.g., pymongo for MongoDB) to CO3
perform data migration from a CSV file to a NoSQL database.
Unit-3
3.1 Write a program using MLlib to perform predictive analytics on a large dataset. CO5
Choose a suitable machine-learning algorithm to predict a target variable.
3.2 Write a Python program using a machine learning library (e.g., scikit-learn) to perform CO5
feature engineering and evaluate a predictive model.
3.3 Write a program to design a prescriptive analytics solution for one of the real-world CO4
case studies (Walmart, Uber, Netflix, or eBay).

You might also like