Minor - Data Science
Minor - Data Science
51. Derive the mathematical formulation of linear regression using the least squares
method. Explain how gradient descent optimizes the regression model.
52. Explain the working of logistic regression using the sigmoid function. Derive the
cost function for logistic regression and explain how it is minimized using
gradient descent.
53. Explain the K-means clustering algorithm in detail, including the initialization
problem and the role of the elbow method in determining the optimal number of
clusters.
54. Apply Principal Component Analysis (PCA) on a high-dimensional dataset.
Visualize the reduced dimensions and explain how PCA helps in feature
extraction and reducing computational complexity.
55. A biometric security system needs to recognize handwritten signatures for
authentication. Explain how SVM can be applied for signature verification and
how feature extraction techniques impact accuracy.
56. Compare Lasso and Ridge regression. Explain how L1 and L2 regularization
techniques impact feature selection and model performance.
57. Financial analyst wants to predict stock prices using multiple economic
indicators. Explain how model selection techniques like cross-validation, feature
selection, and regularization (Lasso, Ridge) can help in building a robust
predictive model.
58. A company wants to predict its sales based on advertising expenditures across
TV, radio, and newspapers. Develop a linear regression model to forecast sales
and interpret the impact of each marketing channel on revenue.
59. Evaluate a dataset of your choice and identify if linear regression is a suitable
model. Justify your answer.
60. Develop a simple linear regression model using Python to predict housing prices.
Explain each step of the implementation.