Assignment 1
Assignment 1
1. Explain the concept of least square estimation in the context of linear regression. How
would you use this technique to build a predictive model for house prices? Describe
how you would evaluate the model's accuracy and what steps you would take to
improve the model’s performance if necessary.
2. Develop a business case for the need for machine learning in this recommendation
system. Explain the difference between supervised and unsupervised learning and
identify which type of machine learning algorithm would be suitable for this scenario.
Outline the steps involved in building and evaluating the machine learning model.
3. Explain the importance of handling missing values in data modeling. Propose different
imputation techniques that can be used to fill in the missing data and justify which
method would be most appropriate for this scenario. Discuss how missing data can
impact the accuracy of the predictive model.
4. Explain the concept of data quality. Discuss how issues such as noise, outliers, missing
values, and duplicate data can impact data analytics. Provide methods to address each
of these issues.
5. What is data architecture in data analytics? Discuss the components involved in
designing a data architecture and the role of data management in ensuring efficient data
processing and analysis