0% found this document useful (0 votes)

16 views20 pages

Models

Uploaded by

Ganesh Tangudu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views20 pages

Models

Uploaded by

Ganesh Tangudu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Regression

Classical Learning Classification

Clustering
Clustering

Pattern search

Bagging

Ensemble methods Boosting

Stacking

CNN
CNN

Machine learning

Nueral Nets

RNN

Generative Adversial Network

Auto Encoders

Reinforcement
Time series

Linear

Decomposition
Non-Linear
Decomposition
Non-Linear

Matrix Factorization

Manifold Learning
Data Science

Linear Regression

Polynomial Regression

Ridge Regression

Lasso

Elastic Net

Support Vector Machine

XGBoost

Decision Trees

Random forest

Bayesian Linear Regression

K-Nearest neighbours

Naïve Bayes

Support Vector Machine

Decision Trees

Logistic

Gradient Boost

Random forest

K-Means

Mean shift
Heirarchial clustering

Density-Based spatial clustering of applications with noise (DBSCAN)

Agglomerative

Apriori
Euclat

FP-Growth

Random forest

Xtreme Gradient Boosting(XGBoost)

AdaBoost

LightBoost

Gradient Boost Machine

CatBoost

Stochastic Gradient Boosting

Difussion convolution recurrent network

Deep residual network

LeNet-5

AlexNet

VGGNet

GoogLeNet
MobileNet

EfficientNet

Dense Convolution Network

Long Short Term Memory

Gated Recurrent Network

Recurrent Convolution network

Heirarchial Network

Bidirectional

Generator

Discriminator

seq2seq

Genetic Algorithm

Asynchronous Advantage Actor-Critic(A3C)

State Action Reward State Action(SARSA)

Deep Q-Network

Q-Learning
ARIMA

SARIMA

Exponential Smoothing

Seasonal Decomposition

Vector Autoregression(VAR)

State Space Model

Long short term memory

Guassian processes

Prophet

Neural prphet

Wavelet Transforms

Dynamic Linear Models

Principal Component Analysis

Linear Discriminant Analysis

Factor Analysis

Independent Component Analysis

Multidimensional Scaling

t-Distributed Stochastic Neighbor Embedding(t-SNE)

Isomap

Locally Linear Embedding

Kernel PCA
Self Organising Maps

Auto Encoders

Singular Value Decomposition

Non-Negative Matrix Factorization

Guassian process Latent Variable Model

Difussion maps
Predicts the target variable as a linear function of a single predictor variable.

Predicts the target variable as a linear function of multiple predictor variables.

Adds a penalty equal to the square of the magnitude of coefficients to the loss function to prevent
overfitting.
Adds a penalty equal to the absolute value of the magnitude of coefficients to the loss function,
leading to sparse models.
Combines L1 and L2 regularization penalties to take advantage of both ridge and lasso regression.

Uses support vector machine principles for regression, capable of fitting complex, non-linear
functions.
Builds an ensemble of trees sequentially, where each tree corrects errors made by the previous
ones. Combines the predictions of weak learners to form a strong prediction.
Splits the data into regions and fits a simple model (usually a constant) within each region.
Prediction is the mean value of the target variable in each leaf node.
An ensemble of decision trees, where each tree is trained on a random subset of the data and
predictions are averaged. Aggregates predictions from multiple decision trees.

Incorporates prior distributions for the model parameters and updates these distributions based on
the data. Bayesian inference is used to derive the posterior distribution of the model parameters.

Classifies a data point based on the majority class among its k-nearest neighbors in the feature
space. Simple and effective for small datasets with clear class boundaries.

Assumes independence among predictors given the class. Simple, fast, and works well with small
datasets and text classification.
Finds the hyperplane that best separates classes in the feature space. Uses kernel functions to
handle non-linear boundaries.
Splits the data into subsets based on feature values, forming a tree-like structure. Can handle both
numerical and categorical data.
Models the probability of a binary outcome. Uses the logistic function to constrain the output
between 0 and 1.
Builds an ensemble of trees sequentially, where each tree corrects errors made by the previous
ones. Includes popular implementations like XGBoost, LightGBM, and CatBoost.
An ensemble of decision trees, where each tree is trained on a random subset of the data.
Combines predictions from multiple trees to improve accuracy and reduce overfitting.
One of the most popular clustering algorithms, K-means partitions the data into K clusters by
iteratively assigning data points to the nearest cluster centroid and updating the centroids based
on the mean of the points assigned to each cluster. It aims to minimize the within-cluster variance.

Mean Shift is a non-parametric clustering algorithm that doesn't require specifying the number of
clusters beforehand. It works by iteratively shifting each data point towards the mode (peak) of the
kernel density estimate of the data until convergence.
This method creates a hierarchy of clusters by either bottom-up (agglomerative) or top-down
(divisive) approaches. In agglomerative clustering, each data point starts in its own cluster, and
pairs of clusters are merged as one moves up the hierarchy. In divisive clustering, all data points
start in one cluster, which is recursively split as one moves down the hierarchy.

DBSCAN groups together points that are closely packed together, based on a specified distance
(epsilon) and minimum number of points (minPts) within that distance. It can discover clusters of
arbitrary shapes and is robust to noise.

This hierarchical clustering algorithm starts with each point as its own cluster and merges the
closest pair of clusters iteratively until only one cluster remains.

Identifies frequent itemsets and generates association rules. Used in market basket analysis.
Uses a vertical database format for finding frequent itemsets. Efficient for datasets with many
frequent itemsets.
More efficient than Apriori by using a prefix tree to store frequent patterns. Avoids generating
candidate sets explicitly.
An extension of bagged decision trees where each tree is trained on a bootstrap sample of the data
and, additionally, a random subset of features is considered for splitting at each node. This
decorrelates the trees and improves performance.

An optimized and scalable implementation of gradient boosting, XGBoost includes regularization

terms to prevent overfitting and support parallel processing.
AdaBoost adjusts the weights of incorrectly classified instances, increasing their influence on the
learning process. It combines multiple weak classifiers to form a strong classifier by focusing on
difficult cases in each iteration.
A gradient boosting framework that uses tree-based learning algorithms, LightGBM is designed for
efficiency and scalability. It uses a histogram-based approach for faster training.
GBM builds an ensemble of decision trees sequentially, where each tree is trained to correct the
errors of the previous trees. It uses gradient descent to minimize the loss function.
Specifically designed to handle categorical features efficiently, CatBoost uses ordered boosting to
reduce prediction shift and support categorical data directly.

A variant of gradient boosting that introduces randomness by subsampling the training data and
features. This can help reduce overfitting.
Combines multiple base models (e.g., decision trees, logistic regression, SVMs) and trains a meta-
model on their predictions. The base models are typically trained on the original data, and the
meta-model is trained on the outputs (predictions) of the base models.

Introduced the concept of residual learning to address the degradation problem in deep networks.
Uses skip connections to allow gradients to flow more easily through the network.
One of the earliest CNN architectures designed by Yann LeCun for handwritten digit recognition
(MNIST dataset).
A deeper architecture that popularized CNNs in large-scale image classification tasks. It introduced
the use of ReLU activation and dropout for regularization.

Characterized by its use of very small (3x3) convolutional filters and a deep architecture with up to
19 layers.
Introduced the Inception module, which allows for more efficient computation by combining
multiple convolutions with different filter sizes.
Designed for mobile and embedded vision applications, it uses depthwise separable convolutions
to reduce the number of parameters.
Uses a compound scaling method to balance network depth, width, and resolution for improved
efficiency.
Uses dense blocks where each layer receives input from all previous layers, improving gradient flow
and feature reuse.
An extension of RNN designed to overcome the vanishing gradient problem. LSTMs have a more
complex architecture that includes gates (input gate, forget gate, and output gate) to control the
flow of information.
A simplified version of LSTM that combines the input and forget gates into a single update gate and
merges the cell state and hidden state. GRUs are computationally more efficient than LSTMs while
still addressing the vanishing gradient problem.

Combines the strengths of CNNs and RNNs by applying convolutional layers to capture spatial
features and recurrent layers to capture temporal features.
Organizes RNN layers hierarchically to capture different levels of abstraction at different temporal
scales.
Consists of two RNNs running in parallel, one in the forward direction and one in the backward
direction. This allows the network to have both past and future context.
The generator algorithm is responsible for creating new data samples. It takes random noise as
input and generates data that should ideally be indistinguishable from real data.
The discriminator algorithm is like a binary classifier. It tries to distinguish between real data
samples and the fake samples generated by the generator. Its goal is to correctly classify real data
as real (label 1) and fake data as fake (label 0).

An encoder and a decoder, widely used for tasks like machine translation and text summarization,
where it maps input sequences to output sequences, effectively capturing contextual information
and generating meaningful responses. It utilizes recurrent neural networks or transformers to
encode source sequences into fixed-length representations and decode them into target
sequences, enabling the modeling of complex relationships between inputs and outputs.

Genetic Algorithms are optimization techniques inspired by natural selection, using selection,
crossover, and mutation to evolve candidate solutions towards optimal outcomes across diverse
problem domains. They efficiently explore large search spaces, providing robust solutions to
complex optimization problems, albeit with sensitivity to parameter settings and computational
demands.

Actor-Critic methods combine value-based and policy-based approaches by maintaining both a

policy network (actor) and a value network (critic). The critic evaluates actions, providing feedback
to the actor to improve its policy.

DQN is an extension of Q-Learning that uses deep neural networks to approximate the action-value
function. It employs experience replay and target networks to stabilize training and improve
sample efficiency.
Q-Learning is a model-free RL algorithm that learns an action-value function Q(s,a)Q(s, a)Q(s,a),
which estimates the expected cumulative reward of taking action aaa in state sss. It uses the
Bellman equation to update the Q-values iteratively.
ARIMA is a popular linear model used for time series forecasting. It combines autoregression (AR),
differencing (I), and moving average (MA) components to capture different aspects of the time
series data.
SARIMA extends ARIMA by incorporating seasonal components into the model to account for
periodic fluctuations in the data.
Exponential smoothing methods, such as Simple Exponential Smoothing (SES), Double Exponential
Smoothing (DES), and Triple Exponential Smoothing (Holt-Winters), are simple yet effective
techniques for forecasting time series data. They assign exponentially decreasing weights to past
observations.

STL decomposes time series data into seasonal, trend, and remainder components using a process
of repeated filtering.
VAR models are used for multivariate time series forecasting, where multiple time series variables
are modeled simultaneously as a system of equations.
State space models, including the Kalman Filter and Bayesian Structural Time Series (BSTS) models,
represent time series data as a latent state process evolving over time.
LSTM networks are a type of recurrent neural network (RNN) capable of learning long-term
dependencies in sequential data, making them well-suited for time series forecasting tasks.
Gaussian processes are a flexible non-parametric Bayesian approach for modeling time series data.
They model the distribution over functions and can capture complex patterns in the data.
Prophet is a forecasting tool developed by Facebook that decomposes time series data into trend,
seasonality, and holiday effects and utilizes a piecewise linear or logistic growth curve model.

Neural Prophet is an extension of Facebook's Prophet that incorporates neural network

architectures to capture complex patterns in time series data.
Wavelet transforms are used for time-frequency analysis of non-stationary time series data,
allowing for both time and frequency localization.

DLMs are Bayesian state space models that can be used for time series analysis and forecasting by
specifying dynamic relationships between observed and unobserved variables.
Projects data onto the directions of maximum variance. Commonly used for feature extraction and
data visualization.
Projects data to a lower-dimensional space by maximizing class separability. Often used for
supervised classification tasks.
Models observed variables and their linear combinations using latent factors. Used for uncovering
hidden relationships in the data.
Decomposes multivariate signals into additive, independent components. Commonly used in signal
processing and for separating mixed signals.
Projects data into a lower-dimensional space while preserving pairwise distances as well as
possible. Used for data visualization.
Reduces dimensions by modeling pairwise similarities and preserving local structure. Primarily used
for visualization of high-dimensional data.
Preserves geodesic distances between all points. Captures the intrinsic geometry of the data
manifold.
Preserves local linear relationships among neighboring points. Good for unwrapping manifolds in
high-dimensional data.
Extends PCA using kernel methods to handle non-linear relationships. Used for capturing non-
linear structures.
Neural network-based method that reduces dimensions while preserving topological properties.
Used for clustering and visualization.
Neural networks trained to compress and reconstruct data. Capture complex non-linear
relationships and are used for unsupervised learning.
Decomposes a matrix into singular vectors and singular values. Used for latent semantic analysis,
noise reduction, and feature extraction.
Factorizes a matrix into non-negative factors. Useful for parts-based representation in image
processing and text mining.
Uses Gaussian processes to learn a latent space representation. Captures uncertainty in the
embedding process.
Uses diffusion processes to model the data geometry. Preserves the global data structure and is
robust to noise.
Hyperparameters

fit_intercept, nomalize

degree, include_bias

alpha, solver

alpha, max_iter

alpha, l1_ratio

c, kernel, gamma

alpha, lambda, fit_intercept

n_neighbors, weights

c, kernel, gamma

max_depth, min_samples_split, min_samples_leaf

penalty, c, solver

n_estimators, max_features, max_depth

n_clusters, max_iter
n_estimators, learning_rate
units, activation, reccurent_activation, dropout

units, activation, reccurent_activation, dropout

learning_rate, discount_factor, batch_size, memory_size

learning_rate, discount_factor, epsilon

Role and Range
fit_intercept' determines whether the model will include an intercept term (the y-intercept) in the equation. Boolean range
'normalize' determines whether the input features X should be normalized before fitting the model. Boolean range
degree' determines the highest power of x (the independent variable) to include in the polynomial features. Range(Any positiv
'include_bias' determines whether an additional column of ones (the bias term, or the intercept) is included in the transforme
Range (true or false).
alpha' controls the regularization strength applied to the linear regression model. Range(non-negative real number).
specifies the algorithm used to optimize the ridge regression model.
alpha' controls the strength of the regularization applied to the model. Range(non-negative real number).
'max_iter' determines the maximum number of iterations allowed for the solver to converge to a solution. Range(positive valu
Linear
Polynomial
Ridge
Lasso
Regression
Elastic Net
Bayesian
Quantile
Support vector
Logistic
Classification Support vector
KNN

Case Study Week 2
No ratings yet
Case Study Week 2
3 pages
Models
No ratings yet
Models
46 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
Unit 4 Introduction To Algorithm
No ratings yet
Unit 4 Introduction To Algorithm
10 pages
Algorithms and Frameworks Used in The Development of Machine Learning Models
No ratings yet
Algorithms and Frameworks Used in The Development of Machine Learning Models
5 pages
ML CheatSheet
No ratings yet
ML CheatSheet
14 pages
Kavin
No ratings yet
Kavin
15 pages
Machine Learning and Deep Learning A Comprehensive Overview
No ratings yet
Machine Learning and Deep Learning A Comprehensive Overview
15 pages
2 Mark DM
No ratings yet
2 Mark DM
1 page
Machine Learning Clustering AlgorithmsI
No ratings yet
Machine Learning Clustering AlgorithmsI
129 pages
Supervised Learning Final With Diagrams Cleaned
No ratings yet
Supervised Learning Final With Diagrams Cleaned
7 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
Nit ML Sugg
No ratings yet
Nit ML Sugg
5 pages
ML Unit 3
No ratings yet
ML Unit 3
10 pages
M.L. 3,5,6 Unit 3
No ratings yet
M.L. 3,5,6 Unit 3
6 pages
INT 354 CA1 Mokshagna
No ratings yet
INT 354 CA1 Mokshagna
8 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
14 pages
Aiml Model
No ratings yet
Aiml Model
13 pages
Interview AI Algo
No ratings yet
Interview AI Algo
3 pages
ML - ML in Nutshell
No ratings yet
ML - ML in Nutshell
7 pages
Httpsforcastify App - Onrender.comlog
No ratings yet
Httpsforcastify App - Onrender.comlog
2 pages
Exercise of Chapter 4 - Data Mining Tools and Techniques Worksheet
No ratings yet
Exercise of Chapter 4 - Data Mining Tools and Techniques Worksheet
4 pages
Classification
No ratings yet
Classification
34 pages
Categorization of ML-DL Algorithms
No ratings yet
Categorization of ML-DL Algorithms
1 page
Unit 3 Ds
No ratings yet
Unit 3 Ds
10 pages
Unit 1
No ratings yet
Unit 1
15 pages
Decision Trees
No ratings yet
Decision Trees
5 pages
AI As Subset
No ratings yet
AI As Subset
16 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
7 pages
Learning Techniques For NILMTK
No ratings yet
Learning Techniques For NILMTK
9 pages
001IntroductiontomachinelearningPart I
No ratings yet
001IntroductiontomachinelearningPart I
10 pages
ML 1
No ratings yet
ML 1
17 pages
Unit-5 MECH 3-2
No ratings yet
Unit-5 MECH 3-2
14 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Zzplagiarism
No ratings yet
Zzplagiarism
23 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
AutoML Tools
No ratings yet
AutoML Tools
2 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
DL
No ratings yet
DL
10 pages
ML 2
No ratings yet
ML 2
3 pages
(IJCST-V10I4P14) :manish Chava, Aman Agarwal, DR Radha K
No ratings yet
(IJCST-V10I4P14) :manish Chava, Aman Agarwal, DR Radha K
5 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Machine Learning Mastery Notes
No ratings yet
Machine Learning Mastery Notes
4 pages
CPEN106 Machine Learning and Predictive Analytics
No ratings yet
CPEN106 Machine Learning and Predictive Analytics
43 pages
Notes
No ratings yet
Notes
35 pages
Mod B Concise Summary
No ratings yet
Mod B Concise Summary
2 pages
Machine Learning
No ratings yet
Machine Learning
4 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
What Are The Common Algorithms in Machine Learning
No ratings yet
What Are The Common Algorithms in Machine Learning
3 pages
ML - Interview Prep
No ratings yet
ML - Interview Prep
9 pages
Introduction To AI and Machine Learning
No ratings yet
Introduction To AI and Machine Learning
21 pages
Lec05 - Supervised
No ratings yet
Lec05 - Supervised
26 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Basic Knowlege
No ratings yet
Basic Knowlege
4 pages
Assignment 2
No ratings yet
Assignment 2
111 pages
Machinelearning GateNotes
No ratings yet
Machinelearning GateNotes
105 pages
Machine Learning Algorithmns.
No ratings yet
Machine Learning Algorithmns.
11 pages
Unit 3 by GPT
No ratings yet
Unit 3 by GPT
10 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Python Machine Learning: Learn how to build powerful Python machine learning algorithms to generate useful data insights with this data analysis tutorial
From Everand
Python Machine Learning: Learn how to build powerful Python machine learning algorithms to generate useful data insights with this data analysis tutorial
Sebastian Raschka
4/5 (20)
An Exploratory Spatial Data Analysis of Income and Education Inequality in Pakistan
No ratings yet
An Exploratory Spatial Data Analysis of Income and Education Inequality in Pakistan
24 pages
Data Analyst Information
No ratings yet
Data Analyst Information
15 pages
English Courses 2014-2015 PDF
No ratings yet
English Courses 2014-2015 PDF
9 pages
Calculation of Regrattion
No ratings yet
Calculation of Regrattion
6 pages
The Effects of Big Data On Forensic Accounting
No ratings yet
The Effects of Big Data On Forensic Accounting
16 pages
Two Sample Z Test
100% (1)
Two Sample Z Test
6 pages
Learning Path Excel
No ratings yet
Learning Path Excel
4 pages
The Effect of Jarimatic Technique On Children
No ratings yet
The Effect of Jarimatic Technique On Children
8 pages
Chapter 3 Forcasting
0% (1)
Chapter 3 Forcasting
82 pages
IOTA Data Driven Tax Administration 2016
No ratings yet
IOTA Data Driven Tax Administration 2016
64 pages
01 Koch PDF
No ratings yet
01 Koch PDF
13 pages
Data Science and Big Data by IBM CE Allsoft Summer Training Final Report
100% (1)
Data Science and Big Data by IBM CE Allsoft Summer Training Final Report
41 pages
Research Papers of Rainfall Ptediction
No ratings yet
Research Papers of Rainfall Ptediction
8 pages
CV0003
No ratings yet
CV0003
43 pages
Association, Correlation and Causation: Points of Significance
No ratings yet
Association, Correlation and Causation: Points of Significance
2 pages
(IJETA-V9I1P2) :yew Kee Wong
No ratings yet
(IJETA-V9I1P2) :yew Kee Wong
7 pages
Decision Support System Assignment 1
No ratings yet
Decision Support System Assignment 1
4 pages
Junior Estimator Cover Letter Examples
100% (2)
Junior Estimator Cover Letter Examples
6 pages
Goals in Statistic
100% (1)
Goals in Statistic
149 pages
Preliminary Selection of Mean Structure
No ratings yet
Preliminary Selection of Mean Structure
9 pages
ML Supervised Full Notes
No ratings yet
ML Supervised Full Notes
62 pages
Sds
No ratings yet
Sds
14 pages
Project Report Format - Draft - BMS
No ratings yet
Project Report Format - Draft - BMS
21 pages
DBSCAN
No ratings yet
DBSCAN
30 pages
05 Contrasts1
No ratings yet
05 Contrasts1
73 pages
Adoption of B2B E-Commerce by The SMEs in Bangladesh, 2011, Al Noor, Arif PDF
No ratings yet
Adoption of B2B E-Commerce by The SMEs in Bangladesh, 2011, Al Noor, Arif PDF
13 pages
Deepanshu Analyst - SQL
No ratings yet
Deepanshu Analyst - SQL
1 page