0% found this document useful (0 votes)

94 views10 pages

Interactive Data Analysis With Jupyter Cheatsheet 1731972443

This cheat sheet provides a comprehensive guide to using Jupyter Notebooks for interactive data analysis, covering basics, magic commands, data import/export, exploration, cleaning, manipulation, visualization with Matplotlib and Seaborn, statistical analysis, and machine learning with Scikit-learn. It includes essential commands and code snippets for each topic, making it a valuable resource for data analysts and scientists. The document is authored by Waleed Mousa.

Uploaded by

vamsitarak55

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views10 pages

Interactive Data Analysis With Jupyter Cheatsheet 1731972443

Uploaded by

vamsitarak55

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

[ Interactive Data Analysis with Jupyter Notebooks ] ( CheatSheet )

1. Jupyter Notebook Basics

● Start Jupyter Notebook: jupyter notebook

● Create new notebook: Click "New" > "Python 3"
● Run cell: Shift + Enter
● Insert cell above: A
● Insert cell below: B
● Delete cell: D, D (press twice)
● Change cell type to Markdown: M
● Change cell type to Code: Y
● Toggle line numbers: L
● Toggle output: O
● Clear cell output: Clear > Clear Cell Output
● Restart kernel: 0, 0 (press twice)
● Save notebook: Ctrl + S
● Convert to Python script: jupyter nbconvert --to script notebook.ipynb
● Convert to HTML: jupyter nbconvert --to html notebook.ipynb

2. Magic Commands

● List all magic commands: %lsmagic

● Run Python file: %run script.py
● Time cell execution: %%time
● Time multiple executions: %timeit function()
● Display plots inline: %matplotlib inline
● Display plots in a separate window: %matplotlib qt
● Load extension: %load_ext autoreload
● Autoreload modules: %autoreload 2
● Display all variables: %who
● Display all variables of a specific type: %who_ls str
● Delete variable: %reset_selective variable_name
● Run shell command: !ls -l
● Set environment variable: %env MY_VAR=value
● Debug with pdb: %pdb
● Profile code: %prun function()

By: Waleed Mousa

3. Data Import and Export

● Import pandas: import pandas as pd

● Read CSV: df = pd.read_csv('file.csv')
● Read CSV with specific encoding: df = pd.read_csv('file.csv',
encoding='utf-8')
● Read CSV with custom delimiter: df = pd.read_csv('file.csv', sep='\t')
● Read Excel: df = pd.read_excel('file.xlsx', sheet_name='Sheet1')
● Read JSON: df = pd.read_json('file.json')
● Read SQL query: df = pd.read_sql_query("SELECT * FROM table", connection)
● Read from URL: df = pd.read_csv('https://example.com/data.csv')
● Read from clipboard: df = pd.read_clipboard()
● Read multiple CSV files: df = pd.concat([pd.read_csv(f) for f in
glob.glob('*.csv')])
● Write to CSV: df.to_csv('output.csv', index=False)
● Write to Excel: df.to_excel('output.xlsx', index=False)
● Write to JSON: df.to_json('output.json')
● Write to SQL: df.to_sql('table_name', connection, if_exists='replace')
● Write to clipboard: df.to_clipboard()

4. Data Exploration

● Display first rows: df.head()

● Display last rows: df.tail()
● Display random sample: df.sample(n=5)
● Get dataframe info: df.info()
● Get dataframe statistics: df.describe()
● Get column names: df.columns
● Get data types: df.dtypes
● Get dimensions: df.shape
● Check for null values: df.isnull().sum()
● Get unique values: df['column'].unique()
● Get value counts: df['column'].value_counts()
● Get correlation matrix: df.corr()
● Get covariance matrix: df.cov()
● Display all rows: pd.set_option('display.max_rows', None)
● Display all columns: pd.set_option('display.max_columns', None)
● Reset display options: pd.reset_option('display')
● Get memory usage: df.memory_usage(deep=True)

By: Waleed Mousa

● Get column data types and non-null count: df.info(verbose=True,
null_counts=True)
● Get basic information about RangeIndex: df.index
● Get summary of a specific column: df['column'].describe()

5. Data Cleaning

● Drop null values: df.dropna()

● Drop null values in specific columns: df.dropna(subset=['column1',
'column2'])
● Fill null values with a specific value: df.fillna(value)
● Fill null values with column mean: df.fillna(df.mean())
● Fill null values with column median: df.fillna(df.median())
● Fill null values with forward fill: df.fillna(method='ffill')
● Fill null values with backward fill: df.fillna(method='bfill')
● Replace values: df.replace(old_value, new_value)
● Replace values using dictionary: df.replace({'old1': 'new1', 'old2':
'new2'})
● Remove duplicates: df.drop_duplicates()
● Remove duplicates based on specific columns:
df.drop_duplicates(subset=['column1', 'column2'])
● Rename columns: df.rename(columns={'old_name': 'new_name'})
● Change data type: df['column'] = df['column'].astype('int64')
● Convert to datetime: df['date'] = pd.to_datetime(df['date'])
● Handle outliers using IQR: df = df[(df['column'] >
df['column'].quantile(0.25) - 1.5 * (df['column'].quantile(0.75) -
df['column'].quantile(0.25))) & (df['column'] <
df['column'].quantile(0.75) + 1.5 * (df['column'].quantile(0.75) -
df['column'].quantile(0.25)))]
● Strip whitespace from string columns: df = df.apply(lambda x:
x.str.strip() if x.dtype == "object" else x)
● Replace inf and -inf with NaN: df = df.replace([np.inf, -np.inf], np.nan)
● Coerce errors to NaN when changing data types: df['column'] =
pd.to_numeric(df['column'], errors='coerce')
● Drop columns: df = df.drop(['column1', 'column2'], axis=1)
● Reset index: df = df.reset_index(drop=True)

6. Data Manipulation

● Select column: df['column']

By: Waleed Mousa

● Select multiple columns: df[['column1', 'column2']]
● Filter rows: df[df['column'] > value]
● Filter rows with multiple conditions: df[(df['column1'] > value1) &
(df['column2'] < value2)]
● Sort values: df.sort_values('column', ascending=False)
● Sort values by multiple columns: df.sort_values(['column1', 'column2'],
ascending=[True, False])
● Group by: df.groupby('column').agg({'column2': 'mean', 'column3': 'sum'})
● Pivot table: pd.pivot_table(df, values='value', index='index',
columns='columns', aggfunc='mean')
● Melt dataframe: pd.melt(df, id_vars=['id'], value_vars=['column1',
'column2'])
● Merge dataframes: pd.merge(df1, df2, on='key', how='inner')
● Concatenate dataframes: pd.concat([df1, df2], axis=0)
● Apply function to column: df['new_column'] = df['column'].apply(lambda x:
x*2)
● Apply function to multiple columns: df[['col1', 'col2']] = df[['col1',
'col2']].apply(lambda x: x*2)
● Create new column based on conditions: df['new_column'] =
np.where(df['column'] > value, 'High', 'Low')
● Rank values: df['rank'] = df['column'].rank(method='dense',
ascending=False)
● Calculate cumulative sum: df['cumsum'] = df['column'].cumsum()
● Calculate percent change: df['pct_change'] = df['column'].pct_change()
● Shift values: df['previous'] = df['column'].shift(1)
● Get dummies (one-hot encoding): pd.get_dummies(df,
columns=['categorical_column'])
● Bin continuous variable: pd.cut(df['column'], bins=[0, 25, 50, 75, 100],
labels=['Low', 'Medium', 'High', 'Very High'])
● Reshape dataframe: df.pivot(index='date', columns='category',
values='value')
● Explode lists in a column: df = df.explode('list_column')
● Aggregate by time period: df.resample('M', on='date_column').mean()
● Rolling calculations: df['rolling_mean'] =
df['column'].rolling(window=7).mean()
● Expanding calculations: df['expanding_sum'] =
df['column'].expanding().sum()

By: Waleed Mousa

7. Data Visualization with Matplotlib

● Import matplotlib: import matplotlib.pyplot as plt

● Create line plot: plt.plot(x, y)
● Create scatter plot: plt.scatter(x, y)
● Create bar plot: plt.bar(x, height)
● Create horizontal bar plot: plt.barh(y, width)
● Create histogram: plt.hist(data, bins=10)
● Create box plot: plt.boxplot(data)
● Create violin plot: plt.violinplot(data)
● Create pie chart: plt.pie(sizes, labels=labels, autopct='%1.1f%%')
● Create heatmap: plt.imshow(data, cmap='hot'); plt.colorbar()
● Create subplot: fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(10, 5))
● Set title: plt.title('Title')
● Set x-label: plt.xlabel('X-axis')
● Set y-label: plt.ylabel('Y-axis')
● Add legend: plt.legend()
● Set axis limits: plt.xlim(0, 10); plt.ylim(0, 100)
● Add text to plot: plt.text(x, y, 'Text')
● Add annotation: plt.annotate('Annotation', xy=(x, y), xytext=(x+1, y+1),
arrowprops=dict(facecolor='black', shrink=0.05))
● Customize tick labels: plt.xticks(rotation=45, ha='right')
● Add grid: plt.grid(True)
● Set figure size: plt.figure(figsize=(10, 6))
● Save figure: plt.savefig('figure.png', dpi=300, bbox_inches='tight')
● Clear current figure: plt.clf()
● Close all figures: plt.close('all')
● Create 3D plot: from mpl_toolkits.mplot3d import Axes3D; fig =
plt.figure(); ax = fig.add_subplot(111, projection='3d'); ax.scatter(xs,
ys, zs)

8. Data Visualization with Seaborn

● Import seaborn: import seaborn as sns

● Set seaborn style: sns.set_style('darkgrid')
● Create scatter plot: sns.scatterplot(x='x', y='y', data=df)
● Create line plot: sns.lineplot(x='x', y='y', data=df)
● Create bar plot: sns.barplot(x='x', y='y', data=df)
● Create box plot: sns.boxplot(x='x', y='y', data=df)

By: Waleed Mousa

● Create violin plot: sns.violinplot(x='x', y='y', data=df)
● Create swarm plot: sns.swarmplot(x='x', y='y', data=df)
● Create count plot: sns.countplot(x='category', data=df)
● Create heatmap: sns.heatmap(df.corr(), annot=True, cmap='coolwarm')
● Create pair plot: sns.pairplot(df)
● Create joint plot: sns.jointplot(x='x', y='y', data=df, kind='scatter')
● Create distribution plot: sns.distplot(df['column'])
● Create cluster map: sns.clustermap(df.corr())
● Create categorical plot: sns.catplot(x='x', y='y', hue='category',
data=df, kind='bar')
● Create regression plot: sns.regplot(x='x', y='y', data=df)
● Create residual plot: sns.residplot(x='x', y='y', data=df)
● Create facet grid: g = sns.FacetGrid(df, col='category');
g.map(plt.scatter, 'x', 'y')
● Set color palette: sns.set_palette('Set2')
● Customize plot appearance: sns.set_context('paper', font_scale=1.5,
rc={'lines.linewidth': 2.5})

9. Statistical Analysis

● Import scipy stats: from scipy import stats

● Calculate mean: np.mean(data)
● Calculate median: np.median(data)
● Calculate mode: stats.mode(data)
● Calculate standard deviation: np.std(data)
● Calculate variance: np.var(data)
● Calculate skewness: stats.skew(data)
● Calculate kurtosis: stats.kurtosis(data)
● Calculate correlation: df['column1'].corr(df['column2'])
● Calculate Spearman correlation: df['column1'].corr(df['column2'],
method='spearman')
● Calculate covariance: df['column1'].cov(df['column2'])
● Perform t-test: stats.ttest_ind(group1, group2)
● Perform paired t-test: stats.ttest_rel(group1, group2)
● Perform one-way ANOVA: stats.f_oneway(group1, group2, group3)
● Perform chi-square test: stats.chi2_contingency(observed)
● Calculate p-value: stats.norm.sf(abs(z_score)) * 2
● Calculate confidence interval: stats.t.interval(alpha=0.95,
df=len(data)-1, loc=np.mean(data), scale=stats.sem(data))

By: Waleed Mousa

● Perform Shapiro-Wilk test for normality: stats.shapiro(data)
● Perform Kolmogorov-Smirnov test: stats.kstest(data, 'norm')
● Perform Mann-Whitney U test: stats.mannwhitneyu(group1, group2)
● Perform Wilcoxon signed-rank test: stats.wilcoxon(group1, group2)
● Perform Kruskal-Wallis H-test: stats.kruskal(group1, group2, group3)
● Perform Friedman test: stats.friedmanchisquare(group1, group2, group3)
● Calculate effect size (Cohen's d): cohens_d = (np.mean(group1) -
np.mean(group2)) / np.sqrt((np.std(group1) ** 2 + np.std(group2) ** 2) /
2)
● Perform linear regression: slope, intercept, r_value, p_value, std_err =
stats.linregress(x, y)
● Calculate Pearson correlation matrix: df.corr(method='pearson')
● Calculate Kendall's Tau: stats.kendalltau(x, y)
● Perform one-sample t-test: stats.ttest_1samp(data, popmean)
● Perform Levene's test for equality of variances: stats.levene(group1,
group2)
● Perform Bartlett's test for equality of variances:
stats.bartlett(group1, group2)

10. Machine Learning with Scikit-learn

● Import scikit-learn: from sklearn import *

● Split data: X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2, random_state=42)
● Scale features: X_scaled = StandardScaler().fit_transform(X)
● Normalize features: X_normalized = Normalizer().fit_transform(X)
● Encode categorical variables: X_encoded =
OneHotEncoder().fit_transform(X)
● Select features: selector = SelectKBest(f_classif, k=5).fit(X, y)
● Perform PCA: pca = PCA(n_components=2).fit_transform(X)
● Train linear regression: model = LinearRegression().fit(X_train, y_train)
● Train logistic regression: model = LogisticRegression().fit(X_train,
y_train)
● Train decision tree: model = DecisionTreeClassifier().fit(X_train,
y_train)
● Train random forest: model = RandomForestClassifier().fit(X_train,
y_train)
● Train SVM: model = SVC().fit(X_train, y_train)
● Train k-nearest neighbors: model = KNeighborsClassifier().fit(X_train,
y_train)

By: Waleed Mousa

● Train naive Bayes: model = GaussianNB().fit(X_train, y_train)
● Train gradient boosting: model =
GradientBoostingClassifier().fit(X_train, y_train)
● Make predictions: y_pred = model.predict(X_test)
● Calculate accuracy: accuracy_score(y_test, y_pred)
● Calculate precision, recall, f1-score:
precision_recall_fscore_support(y_test, y_pred, average='weighted')
● Create confusion matrix: confusion_matrix(y_test, y_pred)
● Perform cross-validation: cross_val_score(model, X, y, cv=5)
● Perform grid search: GridSearchCV(model, param_grid, cv=5).fit(X, y)
● Plot ROC curve: fpr, tpr, _ = roc_curve(y_test, y_pred_proba);
plt.plot(fpr, tpr)
● Calculate AUC: roc_auc_score(y_test, y_pred_proba)
● Plot learning curve: learning_curve(model, X, y, cv=5)
● Plot validation curve: validation_curve(model, X, y, param_name,
param_range, cv=5)

11. Deep Learning with TensorFlow and Keras

● Import TensorFlow and Keras: import tensorflow as tf; from tensorflow

import keras
● Create sequential model: model = keras.Sequential()
● Add dense layer: model.add(keras.layers.Dense(64, activation='relu',
input_shape=(input_dim,)))
● Add dropout layer: model.add(keras.layers.Dropout(0.5))
● Add convolutional layer: model.add(keras.layers.Conv2D(32, (3, 3),
activation='relu'))
● Add max pooling layer: model.add(keras.layers.MaxPooling2D((2, 2)))
● Add LSTM layer: model.add(keras.layers.LSTM(64))
● Compile model: model.compile(optimizer='adam',
loss='binary_crossentropy', metrics=['accuracy'])
● Train model: history = model.fit(X_train, y_train, epochs=10,
batch_size=32, validation_split=0.2)
● Evaluate model: model.evaluate(X_test, y_test)
● Make predictions: y_pred = model.predict(X_test)
● Save model: model.save('model.h5')
● Load model: loaded_model = keras.models.load_model('model.h5')
● Plot training history: plt.plot(history.history['accuracy'],
history.history['val_accuracy'])

By: Waleed Mousa

● Use early stopping: early_stopping =
keras.callbacks.EarlyStopping(patience=3)

12. Natural Language Processing

● Import NLTK: import nltk

● Download NLTK data: nltk.download('punkt')
● Tokenize text: tokens = nltk.word_tokenize(text)
● Get sentences: sentences = nltk.sent_tokenize(text)
● Remove stopwords: from nltk.corpus import stopwords; tokens = [word for
word in tokens if word.lower() not in stopwords.words('english')]
● Perform stemming: from nltk.stem import PorterStemmer; stemmer =
PorterStemmer(); stems = [stemmer.stem(word) for word in tokens]
● Perform lemmatization: from nltk.stem import WordNetLemmatizer;
lemmatizer = WordNetLemmatizer(); lemmas = [lemmatizer.lemmatize(word)
for word in tokens]
● Perform part-of-speech tagging: pos_tags = nltk.pos_tag(tokens)
● Extract named entities: named_entities = nltk.ne_chunk(pos_tags)
● Calculate term frequency: from nltk.probability import FreqDist;
freq_dist = FreqDist(tokens)
● Calculate TF-IDF: from sklearn.feature_extraction.text import
TfidfVectorizer; tfidf = TfidfVectorizer().fit_transform(documents)
● Perform topic modeling: from gensim import corpora, models; lda_model =
models.LdaMulticore(corpus, num_topics=10)
● Train Word2Vec model: from gensim.models import Word2Vec; model =
Word2Vec(sentences, vector_size=100, window=5, min_count=1, workers=4)
● Perform sentiment analysis: from textblob import TextBlob; sentiment =
TextBlob(text).sentiment
● Perform text classification: from sklearn.naive_bayes import
MultinomialNB; clf = MultinomialNB().fit(X_train, y_train)

13. Time Series Analysis

● Import statsmodels: import statsmodels.api as sm

● Create time series object: ts = pd.Series(data,
index=pd.date_range(start='2023-01-01', periods=len(data)))
● Resample time series: ts_monthly = ts.resample('M').mean()
● Calculate rolling mean: rolling_mean = ts.rolling(window=7).mean()
● Calculate exponential moving average: ema = ts.ewm(span=7).mean()
● Perform seasonal decomposition: result = sm.tsa.seasonal_decompose(ts)

By: Waleed Mousa

● Check stationarity: from statsmodels.tsa.stattools import adfuller;
result = adfuller(ts)
● Make time series stationary: ts_diff = ts.diff().dropna()
● Create ACF plot: from statsmodels.graphics.tsaplots import plot_acf;
plot_acf(ts)
● Create PACF plot: from statsmodels.graphics.tsaplots import plot_pacf;
plot_pacf(ts)
● Fit ARIMA model: model = sm.tsa.ARIMA(ts, order=(1,1,1)).fit()
● Make ARIMA predictions: predictions = model.forecast(steps=5)
● Fit SARIMA model: model = sm.tsa.SARIMAX(ts, order=(1,1,1),
seasonal_order=(1,1,1,12)).fit()
● Perform Granger causality test: from statsmodels.tsa.stattools import
grangercausalitytests; grangercausalitytests(data[['y', 'x']], maxlag=5)
● Create prophet model: from fbprophet import Prophet; model =
Prophet().fit(df)

By: Waleed Mousa

Data Cleaning - Cheatsheet
100% (2)
Data Cleaning - Cheatsheet
8 pages
Pandas Cheat Sheet PDF
67% (3)
Pandas Cheat Sheet PDF
1 page
Python For DS Cheat Sheet
100% (2)
Python For DS Cheat Sheet
6 pages
Unit IV - 1 PDF
No ratings yet
Unit IV - 1 PDF
43 pages
Power BI Important Shortcuts
No ratings yet
Power BI Important Shortcuts
5 pages
Pandas Roadmap
No ratings yet
Pandas Roadmap
6 pages
GitLab CI CD Operations CheatSheet 1731972419
No ratings yet
GitLab CI CD Operations CheatSheet 1731972419
11 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Comprehensive Python CheatSheet 1731972192
No ratings yet
Comprehensive Python CheatSheet 1731972192
10 pages
Quantitative-Methods Summary-Qm-Notes
No ratings yet
Quantitative-Methods Summary-Qm-Notes
35 pages
Pandas Cheat Sheet
100% (2)
Pandas Cheat Sheet
6 pages
Dataframe in Pandas - Cheatsheet
No ratings yet
Dataframe in Pandas - Cheatsheet
8 pages
Data Manipulation in Python Using Pandas
No ratings yet
Data Manipulation in Python Using Pandas
12 pages
Tolerances and Resultant Fits - SKF PDF
No ratings yet
Tolerances and Resultant Fits - SKF PDF
4 pages
A Second Course in Statistics Regression Analysis
No ratings yet
A Second Course in Statistics Regression Analysis
8 pages
EDA Cheat Sheet
No ratings yet
EDA Cheat Sheet
7 pages
EDA Cheat Sheet - Exploratory Data Analysis
No ratings yet
EDA Cheat Sheet - Exploratory Data Analysis
2 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
1 page
Universal Data Analytics Algorithm
No ratings yet
Universal Data Analytics Algorithm
51 pages
Python Pandas Cheatsheety
No ratings yet
Python Pandas Cheatsheety
7 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
Measure of Variation: Range: ST RD
No ratings yet
Measure of Variation: Range: ST RD
10 pages
12 Useful Pandas Techniques in Python For Data Manipulation
100% (2)
12 Useful Pandas Techniques in Python For Data Manipulation
19 pages
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
No ratings yet
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
7 pages
EDS - Python Cheat Sheet
0% (1)
EDS - Python Cheat Sheet
3 pages
Pandas Commands
No ratings yet
Pandas Commands
3 pages
2018 Sisk GrowthMindsetMetaAnalysis
No ratings yet
2018 Sisk GrowthMindsetMetaAnalysis
23 pages
Important Questions AD404 Data Science
No ratings yet
Important Questions AD404 Data Science
2 pages
Pandas Fuction Notes
No ratings yet
Pandas Fuction Notes
3 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Dwedw
No ratings yet
Dwedw
217 pages
Jupyter Notebook
No ratings yet
Jupyter Notebook
71 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
Machinistas Meet Randomistas: Useful ML Tools For Empirical Researchers Esther Duflo
No ratings yet
Machinistas Meet Randomistas: Useful ML Tools For Empirical Researchers Esther Duflo
71 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
Python Cheat Sheet Code Academy
100% (1)
Python Cheat Sheet Code Academy
1 page
CH 02 Wooldridge 5e ppt20250307
No ratings yet
CH 02 Wooldridge 5e ppt20250307
51 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Intermediate Economics Sem 2 Pyq
No ratings yet
Intermediate Economics Sem 2 Pyq
32 pages
Calculating Confidence Intervals For Prediction Error in Microarray Classification Using Resampling
No ratings yet
Calculating Confidence Intervals For Prediction Error in Microarray Classification Using Resampling
22 pages
Stat 230 Introduction To Probability and Statistics: Sections 1.1 & 1.2
No ratings yet
Stat 230 Introduction To Probability and Statistics: Sections 1.1 & 1.2
17 pages
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
No ratings yet
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
34 pages
Cheat Sheet
No ratings yet
Cheat Sheet
15 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
Pandas Syntax Revision For ML
No ratings yet
Pandas Syntax Revision For ML
10 pages
Data Wrangling With Dask CheatSheet 1731972488
No ratings yet
Data Wrangling With Dask CheatSheet 1731972488
7 pages
Determining Probabilities: Den Mark L. Asebo
No ratings yet
Determining Probabilities: Den Mark L. Asebo
18 pages
Data Engineer Interview 1740985064
No ratings yet
Data Engineer Interview 1740985064
14 pages
Pandas Practise Problems
No ratings yet
Pandas Practise Problems
8 pages
Python Lists, Sets, and Tuples
No ratings yet
Python Lists, Sets, and Tuples
5 pages
Data Science Cheat Sheet: KEY Imports
100% (1)
Data Science Cheat Sheet: KEY Imports
1 page
Web Scraping and Data Collection CheatSheet 1731972399
No ratings yet
Web Scraping and Data Collection CheatSheet 1731972399
10 pages
Python Essential Methods in Machine Learning
No ratings yet
Python Essential Methods in Machine Learning
6 pages
Exp3 Python
No ratings yet
Exp3 Python
15 pages
NumPy and Pandas Tutorial
No ratings yet
NumPy and Pandas Tutorial
8 pages
CORRELATION
No ratings yet
CORRELATION
6 pages
SQL For Data Science
No ratings yet
SQL For Data Science
8 pages
Pandas Notes
No ratings yet
Pandas Notes
8 pages
Eda Code Snippets
No ratings yet
Eda Code Snippets
17 pages
QM323 Analytics For Business Problem Set
No ratings yet
QM323 Analytics For Business Problem Set
6 pages
Power BI Deployment Pipelines CheatSheet 1731972155
No ratings yet
Power BI Deployment Pipelines CheatSheet 1731972155
10 pages
Pandas Basics Guide
No ratings yet
Pandas Basics Guide
4 pages
Data Handling Module
No ratings yet
Data Handling Module
10 pages
Pandas Cheat Sheet Free Resources At: Dataquest - Io/guide
No ratings yet
Pandas Cheat Sheet Free Resources At: Dataquest - Io/guide
7 pages
Pandas Data Manipulation Extended CheatSheet 1731972219
No ratings yet
Pandas Data Manipulation Extended CheatSheet 1731972219
9 pages
What Is Pandas
No ratings yet
What Is Pandas
9 pages
Data Analysis CheatSheet
No ratings yet
Data Analysis CheatSheet
2 pages
Expected Shortfall
No ratings yet
Expected Shortfall
4 pages
Pandas Cheat Sheet - Python For Data Science
No ratings yet
Pandas Cheat Sheet - Python For Data Science
5 pages
CB2203 2023-24 Sem B Assignment 2
No ratings yet
CB2203 2023-24 Sem B Assignment 2
3 pages
Dominican College of Tarlac: Facebook Account Name Age Average Daily Usage (In Hours, Rounded Off To A Whole Number)
No ratings yet
Dominican College of Tarlac: Facebook Account Name Age Average Daily Usage (In Hours, Rounded Off To A Whole Number)
7 pages
Unit-2 Bda
No ratings yet
Unit-2 Bda
11 pages
EDA With Pandas
No ratings yet
EDA With Pandas
8 pages
0 Statistical Functions in MS Excel
No ratings yet
0 Statistical Functions in MS Excel
4 pages
Pandas Notes Design
No ratings yet
Pandas Notes Design
5 pages
Averages
No ratings yet
Averages
3 pages
Chapter 2 Organizing and Summarizing Data
No ratings yet
Chapter 2 Organizing and Summarizing Data
8 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
5 pages
Pandas Cheat Sheet Final
No ratings yet
Pandas Cheat Sheet Final
1 page
Forecasting of Motorcycle Demand Using Calender Variations, Hybrid Calender Variations-ANN and Disagregation (Case Study in Jombang)
No ratings yet
Forecasting of Motorcycle Demand Using Calender Variations, Hybrid Calender Variations-ANN and Disagregation (Case Study in Jombang)
8 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Pandas Dataframe All Operations 1735471870
No ratings yet
Pandas Dataframe All Operations 1735471870
4 pages
EDA With Pandas CheatSheet
No ratings yet
EDA With Pandas CheatSheet
3 pages
Python CheatSheet
No ratings yet
Python CheatSheet
2 pages
Advanced Analytic Techniques
No ratings yet
Advanced Analytic Techniques
2 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Important Pandas Operations 1697910759
No ratings yet
Important Pandas Operations 1697910759
6 pages
Python Quick Notes
No ratings yet
Python Quick Notes
2 pages
Pandas
No ratings yet
Pandas
5 pages
8-F-Test (Two-Way Anova With Interaction Effect)
No ratings yet
8-F-Test (Two-Way Anova With Interaction Effect)
14 pages
Business Analytics Chapter 5
No ratings yet
Business Analytics Chapter 5
2 pages
Resume Akash Karmakar
No ratings yet
Resume Akash Karmakar
1 page
Wine Quality Prediction: Implementation
No ratings yet
Wine Quality Prediction: Implementation
3 pages
Exploratory Data Analysis: 2.1 Objectives
No ratings yet
Exploratory Data Analysis: 2.1 Objectives
23 pages
No Ph.D. Game Design With Three.js
From Everand
No Ph.D. Game Design With Three.js
Nikiforos Kontopoulos
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet

Interactive Data Analysis With Jupyter Cheatsheet 1731972443

Uploaded by

Interactive Data Analysis With Jupyter Cheatsheet 1731972443

Uploaded by

[ Interactive Data Analysis with Jupyter Notebooks ] ( CheatSheet )

1. Jupyter Notebook Basics

● Start Jupyter Notebook: jupyter notebook

● List all magic commands: %lsmagic

By: Waleed Mousa

● Import pandas: import pandas as pd

● Display first rows: df.head()

By: Waleed Mousa

● Drop null values: df.dropna()

● Select column: df['column']

By: Waleed Mousa

By: Waleed Mousa

● Import matplotlib: import matplotlib.pyplot as plt

8. Data Visualization with Seaborn

● Import seaborn: import seaborn as sns

By: Waleed Mousa

● Import scipy stats: from scipy import stats

By: Waleed Mousa

10. Machine Learning with Scikit-learn

● Import scikit-learn: from sklearn import *

By: Waleed Mousa

11. Deep Learning with TensorFlow and Keras

● Import TensorFlow and Keras: import tensorflow as tf; from tensorflow

By: Waleed Mousa

12. Natural Language Processing

● Import NLTK: import nltk

13. Time Series Analysis

● Import statsmodels: import statsmodels.api as sm

By: Waleed Mousa

By: Waleed Mousa

You might also like