Part One: Exploratory Concepts and Application (50 Marks)**
Name:** Abdimajiid Mohamed Daahir
Class:** BCS15_B In this report, we delve into key data science concepts by exploring essential Python libraries and analyzing --- **Section 1: Exploring Data Science Concepts (25 Marks)** **1. Python Libraries in Action** In this section, we examine three essential Python libraries-Pandas, NumPy, and Matplotlib-to understand **Pandas** Pandas is a powerful library for data manipulation and analysis. It provides data structures like DataFrames Example Use Case:** ```python import pandas as pd Loading data data = pd.read_csv("data.csv") Data manipulation filtered_data = data[data["Age"] > 25] Summarizing data summary = filtered_data.describe() print(summary) ``` Typical Workflow:** - Importing and cleaning datasets - Data filtering and transformation - Summarizing and analyzing structured data **NumPy** NumPy is a fundamental package for numerical computations in Python. It provides support for arrays, mat Example Use Case:** ```python import numpy as np Creating arrays array = np.array([1, 2, 3, 4, 5]) Performing operations mean_value = np.mean(array) print("Mean:", mean_value) ``` Typical Workflow:** - Handling multidimensional arrays - Performing mathematical computations - Supporting operations for machine learning models **Matplotlib** Matplotlib is a versatile library for creating static, animated, and interactive visualizations in Python. Example Use Case:** ```python import matplotlib.pyplot as plt Creating a line plot x = [1, 2, 3, 4, 5] y = [10, 20, 25, 30, 40] plt.plot(x, y, marker="o") plt.title("Line Plot Example") plt.xlabel("X-axis") plt.ylabel("Y-axis") plt.show() ``` Typical Workflow:** - Plotting and visualizing data trends - Customizing visual elements for clarity - Presenting insights effectively --- **Section 2: The Power of Data Visualization** **Selected Research Paper:** "An Introduction to Data Visualization Tools and Techniques in Various Domains" by Deepmala Srivastava, [ResearchGate Link](https://www.researchgate.net/publication/370593444_An_Introduction_to_Data_Visua **Overview of the Visualization** The paper presents a bar chart comparing the effectiveness of different data visualization tools across vario **Key Elements of the Visualization** - **Type of Data Represented:** - **Categorical Data:** Different domains (e.g., business, healthcare). - **Numerical Data:** Effectiveness scores of visualization tools. - **Visualization Features:** - Bars representing effectiveness scores for each tool within specific domains. - Color coding to differentiate between tools. - Clear axis labels and a legend for easy interpretation. **Effectiveness of the Visualization** - **Strengths:** - **Comparative Analysis:** Enables quick comparison of tool effectiveness across domains. - **Clarity:** Well-labeled axes and legend enhance readability. - **Color Differentiation:** Assists in distinguishing between multiple tools. - **Weaknesses:** - **Lack of Interactivity:** A static bar chart limits user engagement. - **Data Granularity:** Aggregated effectiveness scores may overlook specific tool features. **Insights Gained** The visualization highlights that certain tools excel in specific domains. For instance, Tableau shows high e --- Sources:** - Srivastava, D. (2023). *An Introduction to Data Visualization Tools and Techniques in Various Domains*.