Getting Start With Pandas
Getting Start With Pandas
AND MANAGEMENT
2
Creating a series
• From a List:
import pandas as pd
data = [10, 20, 30, 40, 50]
series = pd.Series(data)
print(series)
• From a Dictionary:
import pandas as pd
data = {"a": 1, "b": 2, "c": 3}
series = pd.Series(data)
print(series)
• With Custom Index:
import pandas as pd
data = [10, 20, 30, 40, 50]
series = pd.Series(data, index=["a", "b", "c", "d", "e"])
print(series)
3
Creating a DataFrame
import pandas as pd
4
Data Manipulation
Pandas provides a wide range of methods to manipulate data, such as filtering, sorting,
and grouping.
•Filtering: Select rows based on conditions.
# Filtering rows where Age is greater than 28
filtered_df = df[df['Age'] > 28]
print(filtered_df)
Pandas makes it easy to handle missing data with methods like fillna() and dropna().
# Filling missing values with a default value
df.fillna(0, inplace=True)
You can read from and write to various file formats like CSV, Excel, and SQL databases.
# Reading from a CSV file
df = pd.read_csv('data.csv')
9
Descriptive Statistics
10
• Calculating Specific Statistics:
# Mean
mean = df['A'].mean()
print('Mean:', mean)
# Standard Deviation
std = df['A'].std()
print('Standard Deviation:', std)
# Correlation
correlation = df.corr()
print('Correlation:\n', correlation)
11