0% found this document useful (0 votes)
15 views

Assignment 3

Uploaded by

wizznajka10
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views

Assignment 3

Uploaded by

wizznajka10
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

**Part One: Exploratory Concepts and Application (50 Marks)**

**Name:** Abdimajiid Mohamed Daahir

**Class:** BCS15_B

In this report, we delve into key data science concepts by exploring essential Python libraries and

analyzing the power of data visualization.

---

### **Section 1: Exploring Data Science Concepts (25 Marks)**

#### **1. Python Libraries in Action**

In this section, we examine three essential Python libraries-Pandas, NumPy, and Matplotlib-to

understand their contributions to a typical data science workflow.

##### **Pandas**

Pandas is a powerful library for data manipulation and analysis. It provides data structures like

DataFrames and Series that simplify handling structured data.

**Example Use Case:**

```python

import pandas as pd
# Loading data

data = pd.read_csv("data.csv")

# Data manipulation

filtered_data = data[data["Age"] > 25]

# Summarizing data

summary = filtered_data.describe()

print(summary)

```

**Typical Workflow:**

- Importing and cleaning datasets

- Data filtering and transformation

- Summarizing and analyzing structured data

##### **NumPy**

NumPy is a fundamental package for numerical computations in Python. It provides support for

arrays, mathematical operations, and linear algebra.

**Example Use Case:**

```python

import numpy as np
# Creating arrays

array = np.array([1, 2, 3, 4, 5])

# Performing operations

mean_value = np.mean(array)

print("Mean:", mean_value)

```

**Typical Workflow:**

- Handling multidimensional arrays

- Performing mathematical computations

- Supporting operations for machine learning models

##### **Matplotlib**

Matplotlib is a versatile library for creating static, animated, and interactive visualizations in Python.

**Example Use Case:**

```python

import matplotlib.pyplot as plt

# Creating a line plot

x = [1, 2, 3, 4, 5]

y = [10, 20, 25, 30, 40]

plt.plot(x, y, marker="o")

plt.title("Line Plot Example")


plt.xlabel("X-axis")

plt.ylabel("Y-axis")

plt.show()

```

**Typical Workflow:**

- Plotting and visualizing data trends

- Customizing visual elements for clarity

- Presenting insights effectively

---

### **Section 2: The Power of Data Visualization**

#### **Selected Research Paper:**

**"An Introduction to Data Visualization Tools and Techniques in Various Domains" by Deepmala

Srivastava, published in the International Journal of Computer Trends and Technology (2023).**

[ResearchGate

Link](https://www.researchgate.net/publication/370593444_An_Introduction_to_Data_Visualization_

Tools_and_Techniques_in_Various_Domains)

#### **Overview of the Visualization**

The paper presents a bar chart comparing the effectiveness of different data visualization tools

across various domains, such as business, healthcare, and environmental sciences.


#### **Key Elements of the Visualization**

- **Type of Data Represented:**

- **Categorical Data:** Different domains (e.g., business, healthcare).

- **Numerical Data:** Effectiveness scores of visualization tools.

- **Visualization Features:**

- Bars representing effectiveness scores for each tool within specific domains.

- Color coding to differentiate between tools.

- Clear axis labels and a legend for easy interpretation.

#### **Effectiveness of the Visualization**

- **Strengths:**

- **Comparative Analysis:** Enables quick comparison of tool effectiveness across domains.

- **Clarity:** Well-labeled axes and legend enhance readability.

- **Color Differentiation:** Assists in distinguishing between multiple tools.

- **Weaknesses:**

- **Lack of Interactivity:** A static bar chart limits user engagement.

- **Data Granularity:** Aggregated effectiveness scores may overlook specific tool features.

#### **Insights Gained**

The visualization highlights that certain tools excel in specific domains. For instance, Tableau shows

high effectiveness in business analytics, while specialized tools are more effective in healthcare data

visualization. This underscores the importance of selecting appropriate visualization tools tailored to

domain-specific requirements.
---

**Sources:**

- Srivastava, D. (2023). *An Introduction to Data Visualization Tools and Techniques in Various

Domains*. International Journal of Computer Trends and Technology.

You might also like