Statistical Modelling For Machine Learning SML DSC 3 1 4 2 10 5 3 30 70 100 40 25 10 50@ 20 25 10 200
Statistical Modelling For Machine Learning SML DSC 3 1 4 2 10 5 3 30 70 100 40 25 10 50@ 20 25 10 200
I. RATIONALE
Machine Learning refers to the automated identification of patterns in data. This course is included in
curriculum to establish foundation for Artificial Intelligence and Machine Learning. Statistic, Probability,
Interpolation and sampling methods are the core components of AI/ML. This course will enable students to
implement mathematical concepts using R-Programming which will enhance the knowledge and skills to use the
methodology for solving AI/ML based problems of various domains.
CO1 - Solve the given problem based on Statistic Techniques using R-Programming.
CO2 - Implement Statistic methods using R-Programming.
CO3 - Use Principles of Probability to solve given Problem.
CO4 - Implement appropriate method based on the Interpolation.
CO5 - Apply Sampling Methods to solve given problem using R-Programming.
1. FA-TH represents average of two class tests of 30 marks each conducted during the semester.
2. If candidate is not securing minimum passing marks in FA-PR of any course then the candidate shall be
declared as "Detained" in that semester.
3. If candidate is not securing minimum passing marks in SLA of any course then the candidate shall be
declared as fail and will have to repeat and resubmit SLA work.
4. Notional Learning hours for the semester are (CL+LL+TL+SL)hrs.* 15 Weeks
5. 1 credit is equivalent to 30 Notional hrs.
6. * Self learning hours shall not be reflected in the Time Table.
7. * Self learning includes micro project / assignment / other activities.
VII. SUGGESTED MICRO PROJECT / ASSIGNMENT/ ACTIVITIES FOR SPECIFIC LEARNING / SKILLS
DEVELOPMENT (SELF LEARNING)
Assignment
Collect data of at least 05 real world examples and test the Hypothesis of sampling distribution.
Collect data of at least 05 real world examples and calculate Measures of skewness and kurtosis and prepare t
document.
Collect data of at least 05 real world examples and draw/fit straight line and second-degree polynomial.
Collect data of at least 05 real world examples and calculate probability using Bayes’ theorem.
Collect data of at least 03 city like cost of living and temperature data etc. and interpolate the missing index
number for it and prepare the document.
Micro project
Analyze Uber Data: Analyze different parameters like the number of trips made in a day, the number of trips
during a particular month, average passenger that uber can have in a day, the peak hours where more customer
are available, maximum number of trips found on day of the month, etc.
Implement each least squares regression technique using a programming language such as Python or R. Utiliz
libraries like scikit-learn or stats models for implementation, ensuring proper parameter tuning and
regularization settings for each technique.
Collect temperature data from different locations at various times of the day. Use interpolation techniques suc
as linear interpolation or spline interpolation to estimate the temperature at specific times and locations where
data is not available.
MSBTE Approval Dt. 02/07/2024 Semester - 3, K Scheme
Page 5/8
17-05-2025 09:20:46 P
Note :
Above is just a suggestive list of microprojects and assignments; faculty must prepare their own bank of
microprojects, assignments, and activities in a similar way.
The faculty must allocate judicial mix of tasks, considering the weaknesses and / strengths of the student in
acquiring the desired skills.
If a microproject is assigned, it is expected to be completed as a group activity.
SLA marks shall be awarded as per the continuous assessment record.
For courses with no SLA component the list of suggestive microprojects / assignments/ activities are
optional, faculty may encourage students to perform these tasks for enhanced learning experiences.
If the course does not have associated SLA component, above suggestive listings is applicable to Tutorials
and maybe considered for FA-PR evaluations.
IX. SUGGESTED WEIGHTAGE TO LEARNING EFFORTS & ASSESSMENT PURPOSE (Specification Table)
Sr.No Unit Unit Title Aligned COs Learning Hours R-Level U-Level A-Level Total Marks
1 I Statistical Techniques CO1 10 2 6 12 20
2 II Statistical Methods CO2 10 2 4 8 14
3 III Probability of Random Variable CO3 7 2 2 4 8
4 IV Interpolation CO4 10 2 4 8 14
5 V Sampling Methods CO5 8 2 4 8 14
Grand Total 45 10 20 40 70
X. ASSESSMENT METHODOLOGIES/TOOLS
Formative assessment (Assessment for Learning)
Laboratory Performance, Unit Tests , Midterm Exam, Self-learning, Term Work, Seminar/Presentations.
Continuous assessment based on process and product related performance indicators.
Each practical will be assessed considering 60% weightage to process and 40% weightage to product.
Page 8/8