0% found this document useful (0 votes)

4 views

Assignment

Uploaded by

akshat112004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Assignment

Uploaded by

akshat112004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Assignment (50 Marks)

1. (Marks: 10)
(Feature Masking as regularization) consider linear regression model by minimizing the squared loss
𝑁 2
𝑇
function ∑ (𝑦𝑛 − 𝑤 𝑥𝑛) . Suppose we decide to mask out or “drop” each feature 𝑥𝑛𝑑 of each input 𝑥𝑛
𝑛=1
𝐷
ϵ 𝑅 , independently, with probability (1-p ) (equivalently, retaining the feature with probability p). Masking
or dropping out basically means that we will set the feature 𝑥𝑛𝑑 to 0 with probability (1-p). Essentially, it
would be equivalent to replacing each input 𝑥𝑛 by 𝑥𝑛 = 𝑥𝑛◦ 𝑚𝑛 , where ◦denotes element wise product and
𝑚𝑛 denotes the 𝐷 × 1 binary mask vector with 𝑚𝑛𝑑 ∼ 𝐵𝑒𝑟𝑛𝑜𝑢𝑙𝑙𝑖 (𝑝) (𝑚𝑛𝑑 = 1 means the feature 𝑥𝑛𝑑
was retained; 𝑚𝑛𝑑 = 0 means the feature 𝑥𝑛𝑑 was masked/zeroed).

𝑁 2
𝑇
Let us now define a new loss function using these masked inputs as follows: ∑ (𝑦𝑛 − 𝑤 𝑥𝑛 ) . Show
𝑛=1
that minimizing the expected value of this new loss function (where the expectation is used since the mask
vectors 𝑚𝑛 are random) is equivalent to minimizing a regularized loss function. Clearly write down the
expression of this regularized loss function.

2. Explore, define, and explain the following terms: (11 marks)

1. Least squares problem
2. Generalization
3. Cross-validation, Hold-out validation, Leave-one out validation
4. KKT conditions
5. Feature Correlation
6. Bias and Variance tradeoff
7. Prior Probability, Conditional Probability, and Posterior Probability

3. Explain the issues with the current AI models and the problem of unity of perception. (4 Marks)

4. Consider the following set of training examples: (Marks: 2)

What is the information gain of a2 relative to these training examples? Provide the equation for calculating the
information gain.

5. How do you handle collinearity in a linear regression model? (Hint: read about assumptions of linear regression)
(Marks: 2)

6. Given a dataset for utility fraud detection, you built a classifier model that achieved a performance score of
98.5%. Is this a good model? If yes, justify your answer. If not, what can you do to improve it? (Marks: 2)
7. Why would you Prune a decision tree? (Marks: 2)

8. There is a dictionary containing information about employees in a company. (Marks: 8)

employee_data = { 'EmployeeID': [101, 102, 103, 104, 105],

'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve'],

'Department': ['HR', 'Finance', 'IT', 'HR', 'Finance'],

'Salary': [60000, 70000, 80000, 65000, 72000],

'JoiningDate': ['2020-01-15', '2019-03-23', '2021-06-01', '2020-07-30', '2018-11-20']

Your tasks are as follows:

1. Create a pandas DataFrame from the employee_data dictionary.

2. Display the first 3 rows of the DataFrame.

3. Calculate the average salary of employees in each department.

4. Find the employee with the highest salary.

5. Determine the number of employees who joined after January 1, 2020.

6. Add a new column called YearsInCompany that shows the number of years each employee has been in the

company (as of the current date).

7. Filter and display the data of employees who are in the 'Finance' department.

9. (Marks: 9)

Derive the corresponding equations and solutions for the primal and dual problems in binary SVM for the cases

below:

a) hard margin b) soft margin

Problem 1: Linear Regression
54% (13)
Problem 1: Linear Regression
14 pages
Cognitive Class - Answers Data Analysis With Python
No ratings yet
Cognitive Class - Answers Data Analysis With Python
6 pages
List of PowerPivot DAX Functions With Description
No ratings yet
List of PowerPivot DAX Functions With Description
14 pages
E9 205 - Machine Learning For Signal Processing
No ratings yet
E9 205 - Machine Learning For Signal Processing
3 pages
Sukanya Linear LogisticRegression Report
100% (1)
Sukanya Linear LogisticRegression Report
23 pages
CSE1703 - Fundamental of Data Science
No ratings yet
CSE1703 - Fundamental of Data Science
6 pages
Lokesh T00691325
No ratings yet
Lokesh T00691325
5 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Assignment_III
No ratings yet
Assignment_III
3 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
ML 2023a Midsem Solution
No ratings yet
ML 2023a Midsem Solution
9 pages
Quiz1 Solutions Quiz 1 Soln
No ratings yet
Quiz1 Solutions Quiz 1 Soln
7 pages
HW 1
No ratings yet
HW 1
3 pages
Question 1 The Given Dataset Can Be Visualized As Follows
No ratings yet
Question 1 The Given Dataset Can Be Visualized As Follows
13 pages
Compre FoDS
No ratings yet
Compre FoDS
2 pages
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
2 pages
Ai Ml Exam_1march 16 2022-Michael Magreola
No ratings yet
Ai Ml Exam_1march 16 2022-Michael Magreola
8 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
ML Practice 1
No ratings yet
ML Practice 1
106 pages
Anshul Dyundi Predictive Modelling Alternate Project July 2022
No ratings yet
Anshul Dyundi Predictive Modelling Alternate Project July 2022
11 pages
IE506 Assignment1
No ratings yet
IE506 Assignment1
2 pages
IBM322 Last Year ETE
No ratings yet
IBM322 Last Year ETE
5 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Ell409 Aq
No ratings yet
Ell409 Aq
8 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Devidutta_Predictive_Modeling.pdf
No ratings yet
Devidutta_Predictive_Modeling.pdf
25 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
ML PYQs
No ratings yet
ML PYQs
32 pages
Unit 2 ML_Ver 2
No ratings yet
Unit 2 ML_Ver 2
129 pages
DA PROGRAM UPTO 6 (1)
No ratings yet
DA PROGRAM UPTO 6 (1)
20 pages
Chapter 3. Linear Regression
No ratings yet
Chapter 3. Linear Regression
41 pages
Exam 2011
No ratings yet
Exam 2011
22 pages
L02 Linear Regression
No ratings yet
L02 Linear Regression
9 pages
Data Analytics All Paper Solution
No ratings yet
Data Analytics All Paper Solution
11 pages
prac2_174_final
No ratings yet
prac2_174_final
5 pages
2023-24 AIML ML Mid-Semester Regular QP Anwer-Keys
No ratings yet
2023-24 AIML ML Mid-Semester Regular QP Anwer-Keys
4 pages
Bussiness Report PM
No ratings yet
Bussiness Report PM
44 pages
CS-30004(DSA)-CS_END_NOV_2024
No ratings yet
CS-30004(DSA)-CS_END_NOV_2024
17 pages
PM Alternate Project
No ratings yet
PM Alternate Project
2 pages
Regression_Questionnaire
No ratings yet
Regression_Questionnaire
10 pages
Week 4
No ratings yet
Week 4
3 pages
hw5_1
No ratings yet
hw5_1
6 pages
Nptel Week 5
No ratings yet
Nptel Week 5
4 pages
ML Midsem 2018 Solutions
No ratings yet
ML Midsem 2018 Solutions
7 pages
quiz3
No ratings yet
quiz3
12 pages
Worksheet For Quiz
No ratings yet
Worksheet For Quiz
5 pages
ML Week7 Soln
No ratings yet
ML Week7 Soln
3 pages
ML
No ratings yet
ML
47 pages
Assignment 1-12 ML
No ratings yet
Assignment 1-12 ML
54 pages
CPSC 540 Assignment 1 (Due January 19)
100% (1)
CPSC 540 Assignment 1 (Due January 19)
9 pages
HW_02
No ratings yet
HW_02
3 pages
Midterm 1 Practice Solutions
No ratings yet
Midterm 1 Practice Solutions
12 pages
Unit 2 ML_Ver 2
No ratings yet
Unit 2 ML_Ver 2
129 pages
Predictive_Modelling_Alternate_Project_Business_Case.docx
No ratings yet
Predictive_Modelling_Alternate_Project_Business_Case.docx
47 pages
S&UL Subjective Question Bank
No ratings yet
S&UL Subjective Question Bank
7 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Lec 8 Domain of Vector Valued Function
No ratings yet
Lec 8 Domain of Vector Valued Function
3 pages
1.1 Functions Topic Questions 0606 Set 1 QP Ms
No ratings yet
1.1 Functions Topic Questions 0606 Set 1 QP Ms
19 pages
Behavioral Mathematics for Game AI 1st Edition Dave (Dave Mark) Mark download
100% (1)
Behavioral Mathematics for Game AI 1st Edition Dave (Dave Mark) Mark download
62 pages
UG ECE 2023 Scheme and Syllabus
No ratings yet
UG ECE 2023 Scheme and Syllabus
167 pages
Programming in Karel: Eric Roberts and Jerry Cain CS 106J April 5, 2017
No ratings yet
Programming in Karel: Eric Roberts and Jerry Cain CS 106J April 5, 2017
19 pages
Control Systems U2 (TEL306)
No ratings yet
Control Systems U2 (TEL306)
80 pages
Get Mathematics For Electrical Engineering and Computing 1st Edition Mary Patricia Attenborough Free All Chapters
100% (5)
Get Mathematics For Electrical Engineering and Computing 1st Edition Mary Patricia Attenborough Free All Chapters
84 pages
(Handwritten Solutions) Sets Relations Functions - JEE Mains April 2024 PYQs
No ratings yet
(Handwritten Solutions) Sets Relations Functions - JEE Mains April 2024 PYQs
54 pages
g8m6l3 - Graphing Stories and Increasing and Decreasing Functions
No ratings yet
g8m6l3 - Graphing Stories and Increasing and Decreasing Functions
5 pages
Stability Analysis of Linear Time-Varying Time-Delay Systems by Non-Qua
No ratings yet
Stability Analysis of Linear Time-Varying Time-Delay Systems by Non-Qua
9 pages
CLAD Sample Exam 02: Name: Date
No ratings yet
CLAD Sample Exam 02: Name: Date
13 pages
SH GEN MATH W1 Q1 Lesson 1
No ratings yet
SH GEN MATH W1 Q1 Lesson 1
9 pages
Math 11 - 19 Samplex PDF
No ratings yet
Math 11 - 19 Samplex PDF
1 page
Limits and Continuity - P P Korovkin
No ratings yet
Limits and Continuity - P P Korovkin
145 pages
Chapter 2 Roots of Equation
No ratings yet
Chapter 2 Roots of Equation
20 pages
Spark Reference Manual
No ratings yet
Spark Reference Manual
188 pages
Functional Equations For Fractal Interpolants Lj. M. Koci C and A. C. Simoncelli
No ratings yet
Functional Equations For Fractal Interpolants Lj. M. Koci C and A. C. Simoncelli
12 pages
Full download Foundation Maths 7th Edition Anthony Croft pdf docx
100% (1)
Full download Foundation Maths 7th Edition Anthony Croft pdf docx
28 pages
Jurnal SEAQIM PDF
No ratings yet
Jurnal SEAQIM PDF
92 pages
PPT4-Exponential and Logarithmic Functions
No ratings yet
PPT4-Exponential and Logarithmic Functions
17 pages
Slides Sequences Series
No ratings yet
Slides Sequences Series
22 pages
Time Allotment in Each Step) : Teacher's Activity Student's Activity
No ratings yet
Time Allotment in Each Step) : Teacher's Activity Student's Activity
1 page
Additional Mathematics Project Work FORM 5 (2015) : Smka Sheikh Haji Mohd Said, Seremban, Negeri Sembilan
No ratings yet
Additional Mathematics Project Work FORM 5 (2015) : Smka Sheikh Haji Mohd Said, Seremban, Negeri Sembilan
27 pages
2022 Odd CE143 CCP PracticalList
No ratings yet
2022 Odd CE143 CCP PracticalList
23 pages
R2018-BTech-IT-curr-and-syllabus-20-may-21
No ratings yet
R2018-BTech-IT-curr-and-syllabus-20-may-21
157 pages
Lec 20
No ratings yet
Lec 20
15 pages
MAD Chapter2
No ratings yet
MAD Chapter2
117 pages
Appendix A Basic Mathematical Tools 2015
No ratings yet
Appendix A Basic Mathematical Tools 2015
29 pages
IB Questionbank Mathematics Higher Level 3rd Edition 1
No ratings yet
IB Questionbank Mathematics Higher Level 3rd Edition 1
32 pages

Assignment

Uploaded by

Assignment

Uploaded by

Assignment (50 Marks)

2. Explore, define, and explain the following terms: (11 marks)

4. Consider the following set of training examples: (Marks: 2)

8. There is a dictionary containing information about employees in a company. (Marks: 8)

'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve'],

'Department': ['HR', 'Finance', 'IT', 'HR', 'Finance'],

'Salary': [60000, 70000, 80000, 65000, 72000],

'JoiningDate': ['2020-01-15', '2019-03-23', '2021-06-01', '2020-07-30', '2018-11-20']

Your tasks are as follows:

1. Create a pandas DataFrame from the employee_data dictionary.

2. Display the first 3 rows of the DataFrame.

3. Calculate the average salary of employees in each department.

4. Find the employee with the highest salary.

5. Determine the number of employees who joined after January 1, 2020.

company (as of the current date).

a) hard margin b) soft margin

You might also like