Data Science Presentation SSJ.01
Data Science Presentation SSJ.01
Presented by:
Under Guidance of
Prof. Dr. Jayant. V. Kulkarni
MAHINDRA & MAHINDRA (M&M) OVERVIEW
2) Agriculture
4) Renewable Energy
KEY POINTS :-
1. Data Cleaning
2. Data Science Pipeline
3. Descriptive Statistics and Distributions
4. Hypothesis Testing
5. Regression Analysis
6. Evaluation Metrics
7. Visualization Tools
Data Science Pipeline
• ‘The process and tools used to gather raw data from multiple sources, analyze it, and present the results in an
understandable format.’
Key Features:
Common Uses:
Key Features:
• Provides multi-dimensional arrays (ndarray) for efficient numerical operations.
• Supports vectorized computations for high performance.
• Includes mathematical functions like linear algebra, statistics, and Fourier transforms.
Common Uses:
• Handling large datasets efficiently.
• Performing mathematical and statistical operations.
• Serving as a foundation for libraries like Pandas, SciPy, and TensorFlow.
DATA OVERVIEW :- SALE’S AND SHARE PRICE
SALE’S DATA:-
1) Timeframe:- Mentioned the years covered (2015-2023)
2) Frequency :- Quarterly data.
STEPS TAKEN :-
a. Handling missing values :-
Checked and filled missing values.
b. Standardizing Formats :-
Converted columns like "Date" to proper datetime format.
c. Derived Metrics :-
Added a new column for Average Stock Price using,
CHALLENGES :-
a. Any inconsistencies in data (e.g., unexpected missing quarters or outliers).
b. Limited timeframe of data.
EXPLORATORY DATA ANALYSIS
A graph also displays the co-relation between sales and share price.
1. Correlation:
1. If the blue points are close to the red line, it indicates a strong linear
relationship between car sales and share price.
2. Trend:
1. The positive slope of the red line suggests that as car sales increase, the share
price also tends to increase.
REGRESSION ANALYSIS
Regression Equation: Share Price=0.0881×Car Sales−1320.74
Example:
•MSE: 123456.78 (adjust with your value)
•R²: 0.85 (adjust with your value)
Coefficient: 0.0881
Key Takeaways:
Intercept: -1320.74
•The regression model shows a positive relationship between car sales and share price.
•For every additional unit sold, the share price increases by 0.0881.
•The model explains approximately 85% of the variance in share price, suggesting a strong fit.
For example, if car sales increase by 10,000 units, the share price is expected to
increase by: ΔShare Price=0.0881×10000=881
INSIGHTS
Key Takeaways:
Next Steps:
• The analysis identified a moderate positive correlation between Sales and Close prices.
• The regression model effectively predicts share price trends with reasonable accuracy.
Next Steps: