0% found this document useful (0 votes)
9 views4 pages

Hehehehe

Uploaded by

Rayhan Zheno
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views4 pages

Hehehehe

Uploaded by

Rayhan Zheno
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

10/26/24, 3:14 PM 10623005_2SA05_Abiyyu_Ukbah_Sembiring_Meliala - Colab

!pip install seaborn

# Import libraries
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

Requirement already satisfied: seaborn in /usr/local/lib/python3.10/dist-packages (0.


Requirement already satisfied: numpy!=1.24.0,>=1.20 in /usr/local/lib/python3.10/dist
Requirement already satisfied: pandas>=1.2 in /usr/local/lib/python3.10/dist-packages
Requirement already satisfied: matplotlib!=3.6.1,>=3.4 in /usr/local/lib/python3.10/d
Requirement already satisfied: contourpy>=1.0.1 in /usr/local/lib/python3.10/dist-pac
Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.10/dist-package
Requirement already satisfied: fonttools>=4.22.0 in /usr/local/lib/python3.10/dist-pa
Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.10/dist-pa
Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-pack
Requirement already satisfied: pillow>=6.2.0 in /usr/local/lib/python3.10/dist-packag
Requirement already satisfied: pyparsing>=2.3.1 in /usr/local/lib/python3.10/dist-pac
Requirement already satisfied: python-dateutil>=2.7 in /usr/local/lib/python3.10/dist
Requirement already satisfied: pytz>=2020.1 in /usr/local/lib/python3.10/dist-package
Requirement already satisfied: tzdata>=2022.7 in /usr/local/lib/python3.10/dist-packa
Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.10/dist-packages (f

Klik dua kali (atau tekan Enter) untuk mengedit

Penjelasan Hasil:

1. Univariate Analysis: hours-per-week: Kita mendapatkan statistik deskriptif seperti mean,


median, minimum, maksimum, dan standar deviasi. Visualisasi histogram menunjukkan
distribusi jam kerja di populasi ini. income: Frekuensi dari setiap kategori penghasilan
(<=50K dan >50K) dihitung dan divisualisasikan dengan bar plot.
2. Bivariate Analysis: Boxplot: Visualisasi boxplot menunjukkan perbandingan distribusi
hours-per-week untuk dua kategori income. Boxplot memungkinkan kita melihat median,
interquartile range (IQR), dan outliers untuk setiap kelompok. Mean Comparison: Hasil dari
rata-rata jam kerja untuk setiap kategori penghasilan dihitung dan dibandingkan untuk
melihat apakah ada perbedaan dalam rata-rata jam kerja berdasarkan tingkat penghasilan.
Kesimpulan: Univariate analysis memberikan gambaran umum tentang distribusi data
hours-per-week dan frekuensi untuk income. Bivariate analysis membantu melihat apakah
ada perbedaan jam kerja antara mereka yang memiliki penghasilan lebih tinggi dan yang
lebih rendah.

Klik dua kali (atau tekan Enter) untuk mengedit

# Membuat data sample untuk hours-per-week dan income


data = {'hours_per_week': [40, 35, 50, 60, 40, 38, 45, 55, 37, 42, 48, 41],
'income': ['<=50K', '<=50K', '>50K', '>50K', '<=50K', '<=50K', '>50K', '>50K', '<
https://colab.research.google.com/drive/1Ww6oNv3hm04TEqY8lQoML1U3S9WfJQn3#scrollTo=UImOCwzb1j42&printMode=true 1/4
10/26/24, 3:14 PM 10623005_2SA05_Abiyyu_Ukbah_Sembiring_Meliala - Colab

# Membuat dataframe
df = pd.DataFrame(data)

# Univariate Analysis: Hours-per-Week


print("Univariate Analysis: Hours-per-Week")
print(df['hours_per_week'].describe()) # Statistik deskriptif (mean, median, dll)

# Visualisasi Univariate Analysis untuk Hours-per-Week


plt.figure(figsize=(10, 5))
sns.histplot(df['hours_per_week'], bins=5, kde=True)
plt.title('Distribution of Hours per Week')
plt.xlabel('Hours per Week')
plt.ylabel('Frequency')
plt.show()

# Univariate Analysis: Income


print("\nUnivariate Analysis: Income")
print(df['income'].value_counts()) # Frekuensi untuk setiap kategori income

# Visualisasi Univariate Analysis untuk Income


plt.figure(figsize=(7, 5))
sns.countplot(x='income', data=df)
plt.title('Distribution of Income')
plt.xlabel('Income Category')
plt.ylabel('Count')
plt.show()

# Bivariate Analysis: Relationship between Hours-per-Week and Income


plt.figure(figsize=(10, 6))
sns.boxplot(x='income', y='hours_per_week', data=df)
plt.title('Boxplot of Hours per Week by Income Category')
plt.xlabel('Income')
plt.ylabel('Hours per Week')
plt.show()

# Mean Comparison
mean_hours_per_income = df.groupby('income')['hours_per_week'].mean()
print("\nMean Hours per Week by Income Category")
print(mean_hours_per_income)

https://colab.research.google.com/drive/1Ww6oNv3hm04TEqY8lQoML1U3S9WfJQn3#scrollTo=UImOCwzb1j42&printMode=true 2/4
10/26/24, 3:14 PM 10623005_2SA05_Abiyyu_Ukbah_Sembiring_Meliala - Colab

Univariate Analysis: Hours-per-Week


count 12.00000
mean 44.25000
std 7.62919
min 35.00000
25% 39.50000
50% 41.50000
75% 48.50000
max 60.00000
Name: hours_per_week, dtype: float64

Univariate Analysis: Income


income
<=50K 6
>50K 6
Name: count, dtype: int64

https://colab.research.google.com/drive/1Ww6oNv3hm04TEqY8lQoML1U3S9WfJQn3#scrollTo=UImOCwzb1j42&printMode=true 3/4
10/26/24, 3:14 PM 10623005_2SA05_Abiyyu_Ukbah_Sembiring_Meliala - Colab

Mean Hours per Week by Income Category


income
<=50K 38.666667
>50K 49.833333
Name: hours_per_week, dtype: float64

https://colab.research.google.com/drive/1Ww6oNv3hm04TEqY8lQoML1U3S9WfJQn3#scrollTo=UImOCwzb1j42&printMode=true 4/4

You might also like