Open navigation menu

Scribd

0% found this document useful (0 votes)

25 views3 pages

Python Comands

The document provides a comprehensive guide on using Pandas, Matplotlib, Seaborn, NumPy, and SciPy for data manipulation, visualization, and statistical analysis. It includes functions for reading data, exploring data, cleaning, manipulating, and visualizing it through various types of plots and statistical tests. Each library's key functionalities and methods are outlined for effective data handling and analysis.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views3 pages

Python Comands

The document provides a comprehensive guide on using Pandas, Matplotlib, Seaborn, NumPy, and SciPy for data manipulation, visualization, and statistical analysis. It includes functions for reading data, exploring data, cleaning, manipulating, and visualizing it through various types of plots and statistical tests. Each library's key functionalities and methods are outlined for effective data handling and analysis.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Pandas (import pandas as pd):

1. Reading Data:

• pd.read_csv('filename.csv'): Read a CSV file into a DataFrame.

• pd.read_excel('filename.xlsx'): Read an Excel file into a DataFrame.

2. Data Exploration:

• df.head(): Display the first few rows of the DataFrame.

• df.describe(): Summary statistics for numerical columns.

• df.info(): Information about the DataFrame, including data types and null
values.

• .dtype() to check the data type

• df.shape: Get the dimensions of the DataFrame (rows, columns).

3. Data Selection and Filtering:

• df['column_name'] or df.column_name: Select a single column.

• df[['col1', 'col2']]: Select multiple columns.

• df.loc[row_indexer, col_indexer]: Access a group of rows and columns by

labels.

• df.iloc[row_indexer, col_indexer]: Access a group of rows and columns by

integer position.

4. Data Cleaning:

• df.isnull(): Check for null values in the DataFrame.

• df.dropna(): Remove rows with null values.

• df.fillna(value): Fill null values with a specified value.

• df.replace(old/missing _value, new_value)

• .astype() to change the data type

5. Data Manipulation:

• df.groupby('column_name').agg(func): Group by a column and apply an

aggregation function.

• df['new_column'] = df['col1'] + df['col2']: Create a new column based on

existing columns.

• pd.concat([df1, df2], axis=0): Concatenate DataFrames vertically (along rows).

• pd.concat([df1, df2], axis=1): Concatenate DataFrames horizontally (along

columns).

Matplotlib (import matplotlib.pyplot as plt):

1. Basic Plots:

• plt.plot(x, y): Line plot.

• plt.scatter(x, y): Scatter plot.

• plt.bar(x, height): Bar plot.

• plt.hist(data, bins=30): Histogram.

2. Customization:

• plt.xlabel('xlabel'), plt.ylabel('ylabel'): Set axis labels.

• plt.title('title'): Set plot title.

• plt.legend(): Display legend.

3. Saving and Showing:

• plt.savefig('filename.png'): Save the plot to a file.

• plt.show(): Display the plot.

Seaborn (import seaborn as sns):

1. Data Visualization:

• sns.scatterplot(x='col1', y='col2', data=df): Scatter plot.

• sns.lineplot(x='col1', y='col2', data=df): Line plot.

• sns.histplot(data=df, x='column_name', bins=30): Histogram.

• sns.boxplot(x='col1', y='col2', data=df): Box plot.

2. Statistical Estimations:

• sns.regplot(x='col1', y='col2', data=df): Regression plot.

• sns.lmplot(x='col1', y='col2', data=df, hue='category'): Scatter plot with a

linear fit for each category.

3. Categorical Plots:

• sns.barplot(x='col1', y='col2', data=df): Bar plot.

• sns.countplot(x='column_name', data=df): Count plot.

4. Heatmaps and Matrices:

• sns.heatmap(corr_matrix, annot=True, cmap='coolwarm'): Heatmap of a

correlation matrix.

• sns.clustermap(corr_matrix, cmap='coolwarm'): Hierarchical clustering of a

correlation matrix.

NumPy (import numpy as np):

1. Creating Arrays:
• np.array([1, 2, 3]): Create a 1D array.

• np.zeros((3, 3)): Create an array of zeros with the specified shape.

• np.ones((3, 3)): Create an array of ones with the specified shape.

2. Array Operations:

• np.sum(arr): Sum of array elements.

• np.mean(arr): Mean of array elements.

• np.max(arr), np.min(arr): Maximum and minimum values in the array.

• np.arange(start, stop, step): Create an array with a range of values.

3. Array Manipulation:

• arr.reshape((rows, cols)): Reshape the array.

• np.vstack((arr1, arr2)): Stack arrays vertically.

• np.hstack((arr1, arr2)): Stack arrays horizontally.

SciPy (from scipy import stats):

1. Statistical Tests:

• stats.ttest_ind(a, b): Independent t-test.

• stats.pearsonr(x, y): Pearson correlation coefficient and p-value.

• stats.norm.pdf(x, loc, scale): Probability density function of a normal

distribution.

2. Distribution Fitting:

• params = stats.norm.fit(data): Fit data to a normal distribution.

3. Descriptive Statistics:

• stats.describe(data): Compute several descriptive statistics.

You might also like

CFA Level I Formula Sheet 2025 by Fabian Moa
No ratings yet
CFA Level I Formula Sheet 2025 by Fabian Moa
74 pages
Pandas Cheat Sheet PDF
67% (3)
Pandas Cheat Sheet PDF
1 page
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Pandas PDF
No ratings yet
Pandas PDF
25 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Python Cheatsheet.pptx
No ratings yet
Python Cheatsheet.pptx
2 pages
Data Visualization Cheatsheet 1702209209
100% (1)
Data Visualization Cheatsheet 1702209209
7 pages
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
No ratings yet
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
11 pages
Machine Learning Experiment
No ratings yet
Machine Learning Experiment
69 pages
Python Cheat Sheet Code Academy
100% (1)
Python Cheat Sheet Code Academy
1 page
Ivivc: in Vitro-In Vivo Correlation
100% (1)
Ivivc: in Vitro-In Vivo Correlation
46 pages
Python Pandas and Matplotlib 7
100% (3)
Python Pandas and Matplotlib 7
72 pages
Pandas & Numpy
No ratings yet
Pandas & Numpy
3 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
BDA File
No ratings yet
BDA File
26 pages
Pandas 1702216043
No ratings yet
Pandas 1702216043
86 pages
Pandas Notes
No ratings yet
Pandas Notes
3 pages
Data Frame
No ratings yet
Data Frame
95 pages
Jupyter Notebook
No ratings yet
Jupyter Notebook
71 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Python For Data Analysis Jan 28
No ratings yet
Python For Data Analysis Jan 28
105 pages
Pandas
No ratings yet
Pandas
25 pages
Python For Statistics
No ratings yet
Python For Statistics
40 pages
Python Library Functions
No ratings yet
Python Library Functions
12 pages
Unit 3 (FODS)
No ratings yet
Unit 3 (FODS)
34 pages
14oct Pandas 2024
No ratings yet
14oct Pandas 2024
13 pages
Python For ML
No ratings yet
Python For ML
41 pages
Pandas Complete + Visualisation Summary of IBM Visualization
No ratings yet
Pandas Complete + Visualisation Summary of IBM Visualization
21 pages
Programming Notes 2
No ratings yet
Programming Notes 2
9 pages
Dav 2 Unit
No ratings yet
Dav 2 Unit
55 pages
Summary: Introduction To Data Visualization Tools
No ratings yet
Summary: Introduction To Data Visualization Tools
13 pages
Fundamental - Python
No ratings yet
Fundamental - Python
3 pages
AIML Short Term Internship Session 9 Summary-1719044709410
No ratings yet
AIML Short Term Internship Session 9 Summary-1719044709410
14 pages
סיכום פקודות יוניטים
No ratings yet
סיכום פקודות יוניטים
3 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
Unit - 4 - Part 2
No ratings yet
Unit - 4 - Part 2
36 pages
Statistics and Probability PROJECT 2
No ratings yet
Statistics and Probability PROJECT 2
8 pages
Unit 5 PythonPackages (Matplotlib)
No ratings yet
Unit 5 PythonPackages (Matplotlib)
24 pages
Eda Code Snippets
No ratings yet
Eda Code Snippets
17 pages
DMV Unit-4-1 PDF
No ratings yet
DMV Unit-4-1 PDF
10 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
What Is Pandas
No ratings yet
What Is Pandas
9 pages
Mypnotes
No ratings yet
Mypnotes
3 pages
Informatics Practices Practical File
No ratings yet
Informatics Practices Practical File
8 pages
Solution For Mid Sem Paper
No ratings yet
Solution For Mid Sem Paper
7 pages
Course - Introduction To Data Science (SD211105)
No ratings yet
Course - Introduction To Data Science (SD211105)
10 pages
Eco Work
No ratings yet
Eco Work
38 pages
Module 4
No ratings yet
Module 4
57 pages
Correlating Internet, Social Networks and Workplace - A Case of Generation Z Students
No ratings yet
Correlating Internet, Social Networks and Workplace - A Case of Generation Z Students
16 pages
Test Bank For Biostatistics For The Biological and Health Sciences 3rd Edition by Triola
No ratings yet
Test Bank For Biostatistics For The Biological and Health Sciences 3rd Edition by Triola
36 pages
Pandas Notes Design
No ratings yet
Pandas Notes Design
5 pages
Bivariate Statistics
No ratings yet
Bivariate Statistics
6 pages
Pandas
No ratings yet
Pandas
50 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
12 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
9 pages
Python Syntax and Functions for Data Mining
No ratings yet
Python Syntax and Functions for Data Mining
6 pages
53.understanding How Brands Compete or A Guide To Duplication of Purchase Analysis
No ratings yet
53.understanding How Brands Compete or A Guide To Duplication of Purchase Analysis
12 pages
1995 JMCB Berger - The Profit-Structure Relationship in Banking-Tests of Market-Power and EfficientStructure Hypotheses
No ratings yet
1995 JMCB Berger - The Profit-Structure Relationship in Banking-Tests of Market-Power and EfficientStructure Hypotheses
29 pages
HNS B301 BIOSTATISTICS FOR HEALTH SCIENCES - Marking Scheme
No ratings yet
HNS B301 BIOSTATISTICS FOR HEALTH SCIENCES - Marking Scheme
9 pages
Jupyterand Pandas
No ratings yet
Jupyterand Pandas
6 pages
FMCG (Indirect Tax)
No ratings yet
FMCG (Indirect Tax)
20 pages
Data Handling Module
No ratings yet
Data Handling Module
10 pages
Art 3 Icban-6
No ratings yet
Art 3 Icban-6
31 pages
Pandas Research
No ratings yet
Pandas Research
14 pages
Morphological Evolution of Athletes
No ratings yet
Morphological Evolution of Athletes
21 pages
Workplacediscipline PDF
No ratings yet
Workplacediscipline PDF
5 pages
Test 1 Datasheet
No ratings yet
Test 1 Datasheet
3 pages
One-Day Intensive Python Data Analysis and Visuali
No ratings yet
One-Day Intensive Python Data Analysis and Visuali
6 pages
Financial Literacy and Portfolio Dynamics
No ratings yet
Financial Literacy and Portfolio Dynamics
34 pages
DA&V_module_6(SAMI)
No ratings yet
DA&V_module_6(SAMI)
10 pages
Requiem For Large-Scale Models - Lee1973
No ratings yet
Requiem For Large-Scale Models - Lee1973
17 pages
Pandas
No ratings yet
Pandas
2 pages
Item Dimensionality
No ratings yet
Item Dimensionality
30 pages
2021 Auditorcareerconcernsauditfeesandauditquality APJAE
No ratings yet
2021 Auditorcareerconcernsauditfeesandauditquality APJAE
28 pages
pandas
No ratings yet
pandas
6 pages
Assignment 1 Questions
No ratings yet
Assignment 1 Questions
5 pages
Ch.6-7 Correlation & Regression
No ratings yet
Ch.6-7 Correlation & Regression
56 pages
2013 Gutierrez JClim
No ratings yet
2013 Gutierrez JClim
18 pages
Internal Exam Syllabus For Mba Sem 1
No ratings yet
Internal Exam Syllabus For Mba Sem 1
3 pages
Correlation and Path Coefficient Analysis in Fodder Maize
No ratings yet
Correlation and Path Coefficient Analysis in Fodder Maize
6 pages
Determinants of Smallholder Maize Farmers' Adaptation Strategies To Climate Change in Bahati Sub-County, Nakuru County, Kenya
No ratings yet
Determinants of Smallholder Maize Farmers' Adaptation Strategies To Climate Change in Bahati Sub-County, Nakuru County, Kenya
85 pages
Econ 316 Course Outline
No ratings yet
Econ 316 Course Outline
4 pages
Effects of Temperature and Humidity On Radio Signal Strength in Outdoor
No ratings yet
Effects of Temperature and Humidity On Radio Signal Strength in Outdoor
10 pages
Test-for-Significant-Relationship (Final)
No ratings yet
Test-for-Significant-Relationship (Final)
6 pages
Pol212 22 - 2
No ratings yet
Pol212 22 - 2
7 pages
Economic Indicators: Tools For Analyzing Market Trends and Predicting Future Performance Gafurdjan Zakhidov
No ratings yet
Economic Indicators: Tools For Analyzing Market Trends and Predicting Future Performance Gafurdjan Zakhidov
7 pages
Lab 4
No ratings yet
Lab 4
5 pages