Data Science With Python Workflow
Data Science With Python Workflow
Python Workflow
If you want to learn Python, then join our course: Python for
Data Science Automation (DS4B 101-P).
CS = Cheat Sheet
matplotlib
plotnine
seaborn plotly (CS)
Pandas
text
time series
Visualize
Pandas
categorical
(CS) missing
---
Numpy Transform
Pandas
Pandas
Dash
I/O tools
Model JupyterLab
Streamlit
data structures
group by
Jupyter
reshape (pivot) Pycaret
RStudio
VSCode
Scikit-Learn
TensorFlow
Spyder
Statsmodels
Keras
Important Resources
Anaconda Distribution: https://www.anaconda.com/download/
Python Documentation: https://docs.python.org/
Python Standard Library: https://docs.python.org/3/library
version: 2.0
Data Science with Text Analysis & NLP Machine Learning
Special Topics Scikit-Learn - ML in Python
NLTK - Text Tokenization & Modeling H2O - Scalable & AutoML
spaCy - NLP using Cython for Speed TPOT - TPOT Automated ML Tool
fuzzywuzzy - Fuzzy String Matching PyCaret - PyCaret Low Code ML
Time Series Forecasting Dask ML - Scalable ML with Dask
ML Packages: XGBoost, LightGBM, CatBoost
sktime - Scikit-Learn Extension for Time Series
Recommendation
statsmodels - Time Series Analysis
GluonTS - MXNet/Gluon Deep Learning for Time
Systems Feature Engineering
Series Annoy - Approximate Nearest Neighbors Sklearn Data Transformations
LightFM - Popular recommendation algo's. sklearn-pandas - Sklearn Extension for Pandas