0% found this document useful (0 votes)
11 views4 pages

Mind Mapping v1.2

The document outlines various topics related to data science and machine learning, including functions, data handling, machine learning algorithms, and deep learning concepts. It also lists tools and libraries such as Pandas, TensorFlow, and AWS services, along with applications in natural language processing and image processing. Additionally, it provides references for further learning and practice resources in Python and SQL.

Uploaded by

gowthamgdggenai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views4 pages

Mind Mapping v1.2

The document outlines various topics related to data science and machine learning, including functions, data handling, machine learning algorithms, and deep learning concepts. It also lists tools and libraries such as Pandas, TensorFlow, and AWS services, along with applications in natural language processing and image processing. Additionally, it provides references for further learning and practice resources in Python and SQL.

Uploaded by

gowthamgdggenai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

• Functions

Engineering
• Functions & Types of Functions • Handling missing data & Impute missing
• Arguments & Types of Arguments

Pandas

Data
values
Python

• Local & Global Variable


To Explore:
• Class & Objects • Encoding the data
• File & Folder Handling • Pyspark (Distributed Operations)
• Algorithmic Thinking • Outlier detection and correction
• Redshift
Problem Solving – Codekata
• Databricks • Meaningful data transformation
To Explore:
• Super Class, Meta class, Magic Class • AI Tools for Data Warehousing
• Data Structure & Algorithms

Visualization
• Buil APIs

Probability

Data
• Power BI
• General Probability (Distributions) • Looker Studio
• DDL (Create,Drop,Truncate,Alter)
• DML (Select,Delete,Update,Insert) • Machine Learning Algorithms

• JOINS
SQL

• Constraints(Primary key, Foreign • EC2


Key,Unique,Not NULL, CHECK,DEFAULT) Statistics • Descriptive Statistics • RDS
• Operators (Arithmetic, Logical, Bitwise, o Measure of Central Tendency • S3

AWS
Comparison, Compound) (Mean, Mode, Median) • Lambda
• Clauses in SQL(Where,Having,Group by, o Measure of Variation (Range,
Order by) Variance, Standard deviation) To Explore:
To Explore: TCL, DCL, Views, o Measure Distribution • VPC
Procedures/Functions o Covariance & Corelation • Security Groups
Procedures/Functions • Inferential Statistics • ML relevant services (Docker, Partyrock,
CTE / Analytical queries o Covariance & Corelation Sagemaker)
ML Algorithms & Model Optimization - Supervised
• Data Analysis & Cleansing
• Exploratory Data Analysis Supervised Learning

ML Algorithms - Unsupervised
Unsupervised Learning
Machine Learning Lifecycle

• Features & Target Selection Regression


• Preprocessing (Not much scope for Optimization)
• Linear Regression
o Data Balancing • Clustering
• Ridge & Lasso Regression
o Standard Scalar & Min Max Scalar • Dimension Reduction
o Standardization in Deep
o Encoding • Association(Pattern Search)
• Split Train & Test Dataset Leaning
o K Means
• Train the model Classification
o Hierarchical
o Algorithms • K Nearest Neighbours
o Parameters o K Value Adjustment
o Cross Validation – Random/Grid • Support Vector Machine
• Test/ Evaluate the model
o Kernels & Parameters
o Validation – Test Data
tuning
o Evaluation – Training Data
• Logistic Regression
(Confusion Matrix or Classification
Report) o Threshold Adjustment

• Re-Optimization (As needed) (Activation Function)


• Pickling
o Versioning
Applications:
• Speech Recognition
• ANN/ Neurons/ Perceptron
Deep Learning Terms/Concepts

Architecture • Text Prediction


Input Layer • Text Classification

Natural Language Processing


o
▪ Weights & Bias
• Semantic Analysis

Image Processing
▪ number of neurons equals number
of features • Naive Bayes
o Hidden Layer(Linear Function)
▪ Activation Function is imperative
to bring non-linearity into the Image Processing
network.
o Output Layer • CNN (Convolutional Neural Networks)
Recurrent Neural Network
o Feature Extraction/ Feature Maps/
• One to One, One to Many, Many to One,
Activation Functions & Output Range
Pooling/ Classification
• Many to Many
o Sigmoid (0 to 1) • GANs (Generative Adversarial Network)
o TanH (-1 to 1) o Diffusers
o ReLU (0 to infinity) LSTM(Long Short Term Memory)
o Leaky ReLU (0.01*x to x) o Generators & Discriminator
• Forget Gate, Input Gate, Output Gare
• Multi-Layer Perceptron – Deep Neural Network
GRU (Gated Recurrent Unit)
• Feedforward
• Back Propagation
• Update Gate, Reset Gate
To Explore: Pretrained Networks
Optimizers VAE (Variational AutoEncoders)
o Adam (Adaptive Moment Estimation)
o RMSprop(Root Mean Square Propagation)
Transformers
To Explore: Free Sites/ Services to deploy AI Models
ML Ops & AI Ops
To Explore: Publicly available pretrained
models(Hugging Face, Pytorch etc)
Python • Python Study Materials
• Python Tutor: Visualize code
• Data Processing and Modelling
• Python Exercises
o Standardization in Deep
o Pandas - Data Analysis • Python Keywords – Lexical • Python Cheat sheet

Practice Reference Links


o Scikit-learn – ML Library • Python Programming Exercises
o TensorFlow - ML Framework • Realpython Keywords
SQL Quick Reference

Reference Links

o PyTorch – DL Framework
o PySpark • Hugging Face Platform • SQL Cheat Sheet
Libraries

o XG Boost – Gradient Boosting • SQL Cheat sheet - Interviewbit


• Academo – Logicgate Stimulator • SQL Data Analytics Practice Exercises
o App Build
o Streamlit • SQL Case Study
• Naftali Harris.com/Visualize K-Means
o Gradio • SQL Practice Exercises
• API Build & Serve Clustering • SQL Query Analyzer
o Flask
o Fast API • SQL Query Tuning Tool
• Projector Tensorflow
• Eversql - Analyse SQL Query
• Data Visualization
• Betalist - AI Tool • SQL - Order of execution of Query
o Matplotlib
o Seaborn • SQL ShackSkip - Query Performance
o Plotly • Text Embeddings - Evolution Poster Tuning
o GG Plot
• SQLFlow - Visualize the flow
• Neutron App - Visualize Neural Network
• AWS • DB Diagram - DraQw Entity-Relationship
o Boto3 • Gradio App - ML Model Demo Diagrams for SQL queries
• SQL DBM
• Generative AI • Leonardo AI App - Image Generation
• Dofactory – SQL
• Partyrock.aws • Pandas Data Frames

You might also like