Introduction to Data Science - University Lecture Notes
1. What is Data Science?
- Interdisciplinary field focused on extracting knowledge from data
- Key components: Statistics, Programming, Domain Knowledge
2. Data Science Workflow
- Problem Definition
- Data Collection
- Data Cleaning and Preparation
- Exploratory Data Analysis (EDA)
- Model Building
- Evaluation and Communication
3. Tools and Languages
- Python (Pandas, NumPy, Scikit-learn)
-R
- Jupyter Notebooks
4. Case Study Example: Predicting Housing Prices
- Data source: Kaggle dataset
- Used Linear Regression for prediction
5. Further Reading
- 'Python for Data Analysis' by Wes McKinney
- 'Hands-On Machine Learning with Scikit-Learn and TensorFlow'