0% found this document useful (0 votes)
3 views2 pages

Minor - Data Science

The document contains a comprehensive set of questions related to various machine learning concepts, including linear regression, logistic regression, clustering, PCA, and SVM. It covers definitions, equations, objectives, and applications of these techniques, as well as model selection and regularization methods. The document serves as a guide for understanding fundamental machine learning principles and their practical implementations.

Uploaded by

kumawatharsh191
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views2 pages

Minor - Data Science

The document contains a comprehensive set of questions related to various machine learning concepts, including linear regression, logistic regression, clustering, PCA, and SVM. It covers definitions, equations, objectives, and applications of these techniques, as well as model selection and regularization methods. The document serves as a guide for understanding fundamental machine learning principles and their practical implementations.

Uploaded by

kumawatharsh191
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Important Question

1. What is linear regression?


2. Write the equation of a simple linear regression model.
3. What is the objective of linear regression?
4. Explain the concept of the cost function in linear regression.
5. What is the significance of the slope and intercept in linear regression?
6. Define logistic regression.
7. What type of problems are solved using logistic regression?
8. Explain the sigmoid function and its importance.
9. What is the decision boundary in logistic regression?
10. How do you interpret the output of a logistic regression model?
11. What is clustering in machine learning?
12. Differentiate between K-means and hierarchical clustering.
13. Explain the purpose of the elbow method in K-means clustering.
14. What is PCA (Principal Component Analysis)?
15. How does PCA help in dimensionality reduction?
16. What is a Support Vector Machine (SVM)?
17. Explain the concept of a hyperplane in SVM.
18. What is the kernel trick in SVM?
19. How does SVM classify non-linear data?
20. What is the role of support vectors in SVM?
21. What is model selection in machine learning?
22. Define Lasso regression.
23. How does Ridge regression differ from Lasso regression?
24. What is the main purpose of using regularization in regression?
25. Explain the concept of overfitting and how it can be prevented using Lasso or
Ridge regression.

26. What is linear regression, and where is it applied?


27. Write the equation for multiple linear regression.
28. What is the purpose of the mean squared error (MSE) in linear regression?
29. Explain the concept of R-squared in linear regression.
30. How do you check the accuracy of a linear regression model?
31. What is logistic regression and how does it differ from linear regression?
32. Define the sigmoid function and its mathematical form.
33. What is the range of output in logistic regression?
34. Explain the concept of the threshold value in logistic regression.
35. How is logistic regression used in binary classification?
36. What is the primary objective of clustering?
37. Differentiate between hard clustering and soft clustering.
38. Explain the working of the K-means algorithm.
39. What is the role of the linkage criterion in hierarchical clustering?
40. How does PCA reduce the dimensionality of a dataset?
41. Define Support Vector Machine (SVM).
42. What is a hyperplane in SVM, and how is it determined?
43. Explain the concept of the margin in SVM.
44. What are support vectors in SVM?
45. How does SVM handle linearly inseparable data using kernels?

46. What is model selection, and why is it important?


47. Define Lasso regression and its primary use.
48. How does Ridge regression prevent overfitting?
49. Explain the difference between Lasso and Ridge regression.
50. What is cross-validation, and how is it used for model selection?

51. Derive the mathematical formulation of linear regression using the least squares
method. Explain how gradient descent optimizes the regression model.
52. Explain the working of logistic regression using the sigmoid function. Derive the
cost function for logistic regression and explain how it is minimized using
gradient descent.
53. Explain the K-means clustering algorithm in detail, including the initialization
problem and the role of the elbow method in determining the optimal number of
clusters.
54. Apply Principal Component Analysis (PCA) on a high-dimensional dataset.
Visualize the reduced dimensions and explain how PCA helps in feature
extraction and reducing computational complexity.
55. A biometric security system needs to recognize handwritten signatures for
authentication. Explain how SVM can be applied for signature verification and
how feature extraction techniques impact accuracy.
56. Compare Lasso and Ridge regression. Explain how L1 and L2 regularization
techniques impact feature selection and model performance.
57. Financial analyst wants to predict stock prices using multiple economic
indicators. Explain how model selection techniques like cross-validation, feature
selection, and regularization (Lasso, Ridge) can help in building a robust
predictive model.
58. A company wants to predict its sales based on advertising expenditures across
TV, radio, and newspapers. Develop a linear regression model to forecast sales
and interpret the impact of each marketing channel on revenue.
59. Evaluate a dataset of your choice and identify if linear regression is a suitable
model. Justify your answer.
60. Develop a simple linear regression model using Python to predict housing prices.
Explain each step of the implementation.

You might also like