The document outlines 10 programming assignments involving data analysis and machine learning techniques like clustering, splitting datasets, handling missing values, and classification. Students are tasked with reading data from CSV files, preprocessing data, performing clustering and plotting results, and classification.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
9 views1 page
Assignment 3 DM Practical
The document outlines 10 programming assignments involving data analysis and machine learning techniques like clustering, splitting datasets, handling missing values, and classification. Students are tasked with reading data from CSV files, preprocessing data, performing clustering and plotting results, and classification.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1
Assignment 3 DM practical
1. Write a Python program to create own data sets of
students from a given csv file into a dataframe and perform cluster. 2. Write a Python program to create own data sets from a given csv file into a dataframe and perform cluster display in scattered plot. 3. Write a Python program to create Mall_customer.csv from Kaggle data sets of students from a given csv file into a dataframe and perform cluster. 4. Write a Python program to create Mall_customer.csv from Kaggle data sets of students from a given csv file into a dataframe and perform cluster with the help of scattered plot. 5. Write a Python program to create Mall_customer.csv from Kaggle data sets of students from a given csv file into a dataframe and perform cluster with the help of scattered plot and display centroid 6. Write a Python program to drop Id column from a given Dataframe and print the modified part. Call iris.csv to create the Dataframe. 7. Write a Python program to access first four cells from a given Dataframe using the index and column labels. Call iris.csv to create the Dataframe. 8. Write a Python program using Scikit-learn to split the dataset into 70% train data and 30% test data. Out of total 150 records, the training set will contain 120 records and the test set contains 30 of those records. Print both datasets. 9. Write a python program to display total no of missing values and dropna values. 10. Write a python program to accept data set and perform if the based classification and display the result.