Iit Project
Iit Project
PROJECT
N A M E : N I T H I S H W A R A N . G
S U B M I S S I O N D A T E : 2 0 / 1 2 / 2 4
2.INTRODUCTION
Objective:
ANALYSIS
KEY FINDINGS:
1. Temperature (°C): Visibility (km):
Maximum: 45.0°C Maximum: 55.0 km
Minimum: 12.0°C Minimum: 0.2 km
Average: 30.79°C Average: 2.42 km
Median: 32.0°C Median: 2.0 km
2. Humidity (%): Rain Presence:
Maximum: 100% Rainy Days: 18
Minimum: 6% Dry Days: 712
Average: 36.34% Most Frequent Values:
Median: 34.0% Weather Condition: Haze
3. Pressure (hPa): Wind Direction: WNW
Maximum: 1026 hPa (West-Northwest)
Minimum: 994 hPa
Average: 1007.74 hPa
Median: 1008.0 hPa
3.EXPLORATORY DATA
ANALYSIS
GRAPHICAL REPRESENTATION
4.METHODOLOGY
K-Nearest Neighbors (K-NN) Classification K-Means Clustering
Steps: Steps:
1. Distance Calculation: 1. Initialization:
Calculate the distance between the new data Randomly select k data points as initial cluster
point and each of the training data points. centroids.
Common distance metrics include Euclidean 2. Assignment:
distance, Manhattan distance, and Minkowski Calculate the distance between each data point
distance. and each cluster centroid.
2. Selecting Nearest Neighbors: Assign each data point to the cluster with the
Sort the training data points based on their closest centroid.
distance from the new data point. 3. Recomputation:
Select the k closest neighbors. Calculate the new cluster centroids by taking the
3. Predicting Outcomes: mean of all data points assigned to each cluster.
Determine the most frequent class label among Repeat steps 2 and 3 until the cluster
the k nearest neighbors. assignments no longer change or a maximum
Assign this class label to the new data point. number of iterations is reached. 1
4.METHODOLOGY
VISUAL REPRESENTATION
5.RESULT
Explanation: Explanation:
Data Point A: The nearest neighbors are B, C, and D. Two of these neighbors have the Cluster 1: Data points A, B, and C are assigned to this cluster. The centroid of
outcome "Rain," so the predicted outcome for A is "Rain." this cluster is calculated as the average of the coordinates of these points.
Data Point B: The nearest neighbors are E, F, and G. All three of these neighbors have the Cluster 2: Data points D, E, and F are assigned to this cluster. The centroid of
outcome "No Rain," so the predicted outcome for B is "No Rain." this cluster is calculated as the average of the coordinates of these points.
Data Point C: The nearest neighbors are A, D, and H. Two of these neighbors have the
outcome "Rain," so the predicted outcome for C is "Rain."
5.RESULT
UNIQUE INSIGHTS:
This project aimed to uncover the relationship between hours studied and
exam scores. We found a strong positive correlation, meaning more study
time leads to higher scores. This highlights the importance of consistent effort
in academic success. Data science and AI can be powerful tools for extracting
meaningful insights from large datasets, enabling data-driven decision-
making in various fields.
CHAT GPT GEMINI OTHER
WEBSITIES
I USED IT TO IT GENERATE ME
REFERENCES
GENERATE SOME SOME TABLES , AND WEBSITIES :
TEXT THAT I CANT I USED IN THIS CONVERITO
TYPE IN PROPER PROJECT GOOGLE SHEETS
ENGLISH EXEL
POWER POINT
CANVA