0% found this document useful (0 votes)

4 views

04_MLModelingBasics

The lecture covers the basics of machine learning and modeling, focusing on supervised learning, computational notebooks, and the use of Scikit-Learn for data analysis. Key topics include the structure of data, model validation techniques, and the concepts of overfitting and underfitting. The session also includes practical demonstrations using decision trees and discusses potential improvements through ensemble methods.

Uploaded by

qq1034351717

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

04_MLModelingBasics

Uploaded by

qq1034351717

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 61

Lecture 4: Machine

Learning and Modeling

Basics
Outline
▪ IPython and Notebooks

▪ Supervised machine learning with classification/regression

▪ Deep learning with neural networks

Thanks to Christian Kaestner for preparing the material for these slides
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Learning Goals
▪ Explain the benefits and drawbacks of notebooks
▪ Demonstrate effective use of computational notebooks
▪ Understand how machine learning learns models from labeled data (basic
model)
▪ Explain the steps of a typical machine learning pipeline and their
responsibilities and challenges
▪ Understand the role of hyper-parameters
▪ Appropriately use vocabulary for machine learning concepts
▪ Evaluate a machine-learned classifier

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Computational Notebooks
▪ Origins in "literal programming", interleaving text and code, treating programs
as literature (Knuth'84)
▪ First notebook in Wolfram Mathematica 1.0 in 1988
▪ Document with text and code cells, showing execution results under cells
▪ Code of cells is executed, per cell, in a kernel
▪ Many notebook implementations and supported languages, Python + Jupyter
currently most popular

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Ipython and Jupyter Notebooks
▪ Interactive Python – command shell
▪ Persistent kernel – remembers variables/objects

▪ Notebook – markup text and code, typically alternating

▪ Able to display documentation and graphics after code
blocks
▪ Downside – persistence
▪ Eg: Delete a variable and re-run the code block, that variable is
still stored and callable

▪ Often used for prototyping data science projects,

migration to production a current topic of research
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Google Collab
• Allows you to run Python code in your browser. Uses Jupyter
Notebooks.
• Comes preloaded with several popular data science libraries.
numpy, scikit learn, pytorch etc.
• Most features provided by iPython notebook. Some very cool ones!

• Viewing notebook is open to everyone, free to run, but requires

Google login
• Some demos will use Colab, others will use VMs

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Scikit Learn: Introduction
▪ Most popular machine learning toolkit, used by data scientists everyday
around the world.
▪ Created by David Cournapeau during a Google Summer of Code
▪ Brought together and implemented common ML algorithms from literature
• Provides tools for standard statistical machine learning.
• Actively under development, last major release May 2024 (minor release
July 2024)
• Easy modeling using standard ML algorithms such as classification,
regression, clustering and several more.
• Well documented API with examples

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Scikit Learn: Datasets
• Well, the first thing you need for machine learning is data. Scikit-
learn provides us some assistance with this.
• The library comes with some predefined toy datasets that you
can load, while larger, more comprehensive datasets are also
supported which you must fetch
• Often, data is represented as basic table in a two-dimensional grid
of data, in which the
• rows represent individual elements of the dataset
• columns represent quantities related to each of these elements.

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Features and Arrays
▪ Features matrix
▪ A matrix of data is also called the features matrix, with dimensions
[n_samples * n_features]
▪ By convention, this features matrix is often stored in a variable
named X, is two dimensional in shape and usually a Numpy array.
▪ The samples (i.e., rows) always refer to the individual objects
described by the dataset. In the Iris dataset, the sample is a flower.
▪ The features (i.e., columns) always refer to the distinct
observations that describe each sample in a quantitative manner.
Usually numbers or booleans.

▪ Target array
▪ In addition to the feature matrix X, we also generally work with a
label or target array, which by convention we will usually call y.
▪ The target array is usually one dimensional, with length
n_samples, and is generally contained in a NumPy array.
▪ The target array may have continuous numerical values, or
discrete classes/labels.

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Target array
▪ Essentially, the target array is that it is usually the quantity
we want to predict from the data: in statistical terms, it is
the dependent variable.
▪ For example, with the Iris dataset, we may wish to construct a
model that can predict the species of flower based on the other
measurements; in this case, the species column would be
considered the target array.

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Scikit Learn - Estimators
▪ Every machine learning algorithm in Scikit-Learn is implemented via the
Estimator API, which provides a consistent interface for a wide range of machine
learning applications.

▪ Most commonly, the steps in using the Scikit-Learn estimator API are as follows:
1. Choose a class of model by importing the appropriate estimator class from Scikit-Learn.
2. Choose model hyperparameters by instantiating this class with desired values.
3. Arrange data into a features matrix and target vector.
4. Fit the model to your data by calling the fit() method of the model instance.
5. Apply the Model to new data. For supervised learning, often we predict labels for unknown
data using the predict() method.

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Scikit Learn - Model Validation with Holdout sets
▪ Model validation is very simple: after choosing a model and its
hyperparameters, we can estimate how effective it is by applying it to some of
the training data and comparing the prediction to the known value.

▪ We want to evaluate the model on data it has not seen before, and so we will
split the data into a training set and a testing set.
▪ In some data sets there will be a second data set available. Other times we
will need to do the split ourselves. We hold back some subset of the data from
the training of the model, and then use this holdout set to check the model
performance.
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Scikit Learn - Model Validation via cross-validation

▪ One disadvantage of using holdout sets for model validation is that

we lose a portion of our data to the model training. To the side, half
the dataset does not contribute to the training of the model!
Particularly problematic for small datasets.

▪ One way to address this is to use cross-validation; that is, to do a

sequence of fits where each subset of the data is used both as a
training set and as a validation set.

▪ Cross-validation simply repeating the experiment multiple times,

using all the different parts of the training set as validation sets
and gives a more accurate indication of how well the model
generalizes to unseen data.
▪ Downside: Multiplies the evaluation time

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

▪ learn a function (called model)

▪ by observing data
▪ Typically used when writing that function manually is hard because the
problem is hard or complex.
▪ Examples:
• Detecting cancer in an image
• Transcribing an audio file
• Detecting spam
• Predicting recidivism
• Detect suspicious activity in a credit card
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Supervised Machine Learning
▪ Given a training dataset containing instances

▪ which consists of features and a corresponding outcome label ,

▪ learn a function

▪ that "fits" the given training set and "generalizes" to other data.

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Example: House Price Analysis
▪ Given data about a house and its neighborhood, what is the likely sales price
for this house?

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

%Large
Crime Rate %Industrial Near River # Rooms ... Price
Lots
0.006 18 2.3 0 6 ... 240.000
0.027 0 7.0 0 6 ... 216.000
0.027 0 7.0 0 7 ... 347.000
0.032 0 2.1 0 6 ... 334.000
0.069 0 2.1 0 7 ... 362.000

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Common Datastructures: Dataframes
• Package: Pandas (pd)
• Tabular data, 2 dimensional
• Named columns
• Heterogeneous data: different types
per column
• Still need to be converted into numpy arrays
before ingestion by sci-kit learn

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

▪ In the one-dimensional case, it simply fits the function to best

explain all given data points (technically to minimize some error
between f(x) and y across all given (x, y) pairs, e.g. sum of squared
errors)

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Does it Learn?
▪ Many different strategies to learn
function
▪ Anscombe’s quartet
▪ The average x value is 9 for each dataset
▪ The average y value is 7.50 for each dataset
▪ The variance for x is 11 and the variance for y
is 4.12
▪ The correlation between x and y is 0.816 for
each dataset
▪ Linear regression for each dataset follows the
equation y = 0.5x + 3
Parikh, 2014

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Terminology
• The decisions in a model are called model parameter of the model
(constants in the resulting function, weights, coefficients), their
values are usually learned from the data
• Degrees of freedom ~ number of model parameters
• The parameters to the learning algorithm that are not learned from
the data (and are typically user defined) are called model
hyperparameters
• EG: In today’s demo (decision tree)

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Overfitting/Underfitting
▪ Overfitting: Model learned exactly for the input data, but does not
generalize to unseen data (e.g., exact memorization)
▪ Underfitting: Model makes very general observations but poorly
fits to data (e.g., brightness in picture)
▪ Typically adjust degrees of freedom during model learning to
balance between overfitting and underfitting: can better learn the
training data with more freedom (more complex models); but with
too much freedom, will memorize details of the training data rather
than generalizing

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

Geeksforgeeks.org

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

Towards AI

▪ Note: We are using decision trees as an example of a simple and

easy to understand learning algorithm. It is worth to understand at
least one learning approach in some detail, to get an intuitive sense
of the functioning and limitations of machine learning. Also this
example will illustrate the role of hyperparameters and how they
relate to overfitting/underfitting.
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Decision Tree Generation
▪ Attribute Selection Measures (ASM) – which attribute, at what value, splits the
dataset
▪ Information gain
▪ Entropy:
▪ Entropy of parent – (Sum of
entropy of children)

▪ Gini Index (gini impurity)

▪ Gain Ratio
: fraction of items labeled with class i in the set
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Example - Golfing 𝑓 ( 𝑂𝑢𝑡𝑙𝑜𝑜𝑘 ,𝑇𝑒𝑚𝑝𝑒𝑟𝑎𝑡𝑢𝑟𝑒 , 𝐻𝑢𝑚𝑖𝑑𝑖𝑡𝑦 ,𝑊𝑖𝑛𝑑𝑦 ) =𝑃𝑙𝑎𝑦 𝑔𝑜𝑙𝑓 𝑜𝑟 𝑛𝑜𝑡

Outlook Temperature Humidity Windy Play

overcast hot high false yes
overcast hot high false no
overcast hot high false yes
overcast cool normal true yes
overcast mild high true yes
overcast hot normal false yes
rainy mild high false yes
rainy cool normal false yes
rainy cool normal true no
rainy mild normal false yes
rainy mild high true no
sunny hot high false no
sunny hot high true no
sunny mild high false no
sunny cool normal false yes
sunny mild normal true yes

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
On Specifications
▪ No specification given for f(outlook, temperature, humidity, windy)
▪ Learning f from data!
▪ We do not expect perfect predictions; no possible model could always predict
all training data correctly:
Outlook Temperature Humidity Windy Play
overcast hot high false yes
overcast hot high false no
overcast hot high false yes
overcast cool normal true yes

▪ We are looking for models that generalize well

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
In Class Demo
▪ Navigate to:
▪ https://colab.research.google.com/github/jGiltinan/SE4AI_DecisionTree/blob/master/golf.ipynb

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Possible Improvements
• Averaging across multiple trees (ensemble methods, including
Boosting and Random forests) to avoid overfitting
• building different trees on different subsets of the training data or basing
decisions on different subsets of features

• Different decision selection criteria and heuristics, Gini impurity,

information gain, statistical tests, etc.
• Better handling of numeric data
• Extensions for graphs

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Summary of Learning with Decision Trees
• Learning function fitting the data
• Generalizes to different decisions (categorical and numeric data)
and different outcomes (classification and regression)
• Customizable hyperparameter (here: max tree height, min
support, ...) to control learning process
• Many decisions influence qualities: accuracy/overfitting, learning
cost, model size, ...
• Resulting models easy to understand and interpret (up to a size),
mirroring human decision-making procedures
• Scales fairly well to large datasets
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Evaluating Models (Supervised Learning)
▪ Basic Approach
▪ Given labeled data, how well can the function predict the outcome
labels?
▪ Basic measure accuracy:

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Separate Training and Validation Data
▪ Always test for generalization on unseen validation data
▪ Accuracy on training data (or similar measure) used during learning
to find model parameters

▪ high probability of overfitting

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Detecting Overfitting
▪ Change hyperparameter to detect training accuracy
(blue)/validation accuracy (red) at different degrees of freedom

Degrees of freedom
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Demo
▪ Navigate to:
▪ https://colab.research.google.com/github/jGiltinan/SE4AI_DecisionTree/blob/master/golf_TrainTest
Split.ipynb

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Cross-validation
• Motivation
• Evaluate accuracy on different training and validation splits
• Evaluate with small amounts of validation data

• Method: Repeated partitioning of data into train and validation data, train and
evaluate model on each partition, average results
• Many split strategies, including
• leave-one-out: evaluate on each datapoint using all other data for training
• k-fold: k equal-sized partitions, evaluate on each training on others
• repeated random sub-sampling (Monte Carlo)

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Separate Training, Validation and Test Data
▪ Often a model is "tuned" manually or automatically on a validation
set (hyperparameter optimization)
▪ In this case, we can overfit on the validation set, separate test set is
needed for final evaluation

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Academic Escalation: Overfitting on Benchmarks
▪ If many researchers publish best results on the same benchmark, collectively
they perform "hyperparameter optimization" on the test set

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Summary
• Key concepts in machine learning: dataframes, model,
train/validation/test data
• A simple machine-learning algorithm: Decision trees
• Overfitting, underfitting, hyperparameter tuning
• Basic model accuracy measures and cross-validation
• Introduction to working with computational notebooks, typical
iterative workflow, benefits and limitations of notebooks

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

• Give an overview of different AI problems and approaches

• Explain at high level how deep learning works
• Some neural network architecture techniques
• Introduction to explainability

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

© 2024 University of Maryland
Neural Networks
▪ Artificial neural networks are
inspired by how biological
neural networks work
("groups of chemically
connected or functionally
associated neurons" with
synapses forming
connections)

From "Texture of the Nervous System

of Man and the Vertebrates" by
Santiago Ramón y Cajal
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Artificial Neural Networks
▪ Simulating biological neural networks of neurons (nodes) and
synapses (connections), popularized in 60s and 70s
▪ Basic building blocks: Artificial neurons, with n inputs and one
output; output is activated if at least m inputs are active

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic

▪ Univariate, single input (constant,

vector, matrix, etc)
▪ Heaviside step function:

▪ ( and are parameters of the model)

▪ TLU/Perceptron: Binary
classification depending on if
weighted sum > threshold

© 2024 University of Maryland
Multiple Layers
▪ Layers are fully connected here, but layers may have different numbers of
neurons

© 2024 University of Maryland
Learning Model Parameters (Backpropagation)
▪ Intuition:
• Initialize all weights with random values
• Compute prediction, remembering all intermediate activations
• If output is not expected output (measuring error with a loss function),
• compute how much each connection contributed to the error on output layer
• repeat computation on each lower layer
• tweak weights a little toward the correct output (gradient descent)

• Continue training until weights stabilize

▪ Works efficiently only for certain , typically logistic function: or rectified linear
unit (ReLU):
▪ Why? Smooth with monotonic derivatives.
▪ Logistic function:
▪ ReLU: = 0, 1
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Deep Learning
• More layers (technically, > 2)
• Layers with different numbers of neurons
• Different kinds of connections
• fully connected (feed forward)
• not fully connected (eg. convolutional networks)
• keeping state (eg. recurrent neural networks)
• skipping layers
• and other possibilities!

© 2024 University of Maryland
On Terminology
• Deep learning: neural networks with many internal layers
• DNN architecture: network structure, how many layers, what
connections, which (hyperparameters)
• Model parameters: weights associated with each input in each
neuron

© 2024 University of Maryland
Example Scenario
▪ MNIST Fashion dataset of 70k 28x28 grayscale pixel images, 10
output classes

© 2024 University of Maryland
Example Scenario
• MNIST Fashion dataset of 70k 28x28 grayscale pixel images, 10
output classes
• 28x28 = 784 inputs in input layers (each 0..255)
• Example model with 3 layers, 300, 100, and 10 neurons
▪ How many parameters does this model have?

▪ parameters, assuming 32-bit float values, > 1 MB just parameters

© 2024 University of Maryland
Network Size
• 50 Layer ResNet network -- classifying 224x224 images into 1000
categories
• 26 million weights, computes 16 million activations during inference, 168 MB
to store weights as floats

• OpenAI’s GPT-2 (2019) -- text generation

• 48 layers, 1.5 billion weights (~12 GB to store weights)
• released model reduced to 117 million weights
• trained on 7-8 GPUs for 1 month with 40GB of internet text from 8 million web
pages
• Demo (sometimes down)
• GPT-3, released June 2020, has 175 billion weights (~650 GB)

• Llama 3 (2024) – text generation, up to 405 billion weights (~ 1.5

© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
Tb) © 2024 University of Maryland
Convolutional neural network (Intuition)
▪ Connect local neurons

3x3x3 convolutional kernel acting on a 3 channel input. Source:

https://machinethink.net/images/vggnet-convolutional-neural-net
[email protected]

© 2024 University of Maryland
Dropout Layer
▪ During training – randomly drop nodes
each iteration
▪ During testing/validation – include all
nodes
▪ (1-Probability of node dropout) is used as an
extra weighting factor

▪ Improves generalization and reduces

overfitting (regularization)

Srivastava et al., “Dropout: A Simple Way to Prevent Neural Networks from Overfitting,” 2014.

▪ Classification:
▪ Categorical cross entropy:
▪ Sparse CCE – similar,
does not require one-hot encoding

https://ml-cheatsheet.readthedocs.io/en/latest/loss_functions.html

© 2024 University of Maryland
Optimization Algorithms
▪ Learning rate: constant which affects step size
when updating the weights
▪ Gradient descent: straightforward, but
computational expensive
▪ Stochastic Gradient Descent: save computation mc.ai

by taking gradient at a single example or small

batch
▪ Adagrad – adaptive gradient algorithm, learning
rate is variable w.r.t. parameters
▪ Adam – adaptive moment estimation, variable
w.r.t parameters, uses history of gradients
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Reusing and Retraining Networks
• Incremental learning process enables continued training, retraining,
incremental updates
• A model that captures key abstractions may be good starting point
for adjustments (i.e., rather than starting with randomly initialized
parameters)
• Reused models may inherit bias from original model
• Lineage important. Model cards (introduced 2019) promoted for
documenting design rationale, training method, use, and evaluation,
e.g., Google Perspective Toxicity Model

© 2024 University of Maryland
Deep Learning Discussion
• Can approximate arbitrary functions
• Able to handle many input values (e.g., millions of pixels)
• Internal layers may automatically recognize higher-level structures
• Often used without explicit feature engineering
• Often huge number of parameters, expensive inference and training
• Often large training sets needed
• Too large and complex to understand what is learned, why, or how
decisions are made (compared to decision trees)
▪ Student experiences?
© 2024 Fraunhofer USA, Inc. - Center Mid-Atlantic
© 2024 University of Maryland
Tensorflow and Keras
▪ Tensorflow: developed by Google
▪ Symbolic math language to write differentiable programs – ML and NN are target use cases

▪ Keras: neural-network library, written to be an interface between developers

and backend
▪ Mostly written by François Chollet

▪ Also popular: Pytorch

▪ Torch library in Python, from Facebook
▪ 2024: Now much more popular than Tensorflow and Keras

© 2024 University of Maryland
LIME (Local Interpretable Model-agnostic Explanations) Results
▪ Principle: Have a trained model predict many points locally around a datapoint
you want to explain.
▪ Make a linear approximation
▪ Works better on edge cases

“Why Should I Trust You?” Explaining the Predictions of Any Classifier , Ribeiro et al. 2016
https://github.com/marcotcr/lime

▪ Virtual machine demonstration:

▪ Zipped file link
▪ Username: student
“Why Should I Trust You?” Explaining the Predictions of Any Classifier , Ribeiro et al. 2016
▪ Password: fraunhofer https://github.com/marcotcr/lime

CS 2 3 4 Aml
No ratings yet
CS 2 3 4 Aml
70 pages
Data Science II: Charles C.N. Wang
No ratings yet
Data Science II: Charles C.N. Wang
38 pages
Intro To Scikit Learning
No ratings yet
Intro To Scikit Learning
18 pages
2 Machine Learning
No ratings yet
2 Machine Learning
21 pages
1 An Introduction To Machine Learning With Scikit Learn
No ratings yet
1 An Introduction To Machine Learning With Scikit Learn
2 pages
Think Big and Understanding Format of Salary Prediction Using Machine Learning
No ratings yet
Think Big and Understanding Format of Salary Prediction Using Machine Learning
7 pages
3. Introduction to Machine Learning
No ratings yet
3. Introduction to Machine Learning
27 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
2018 02 Msu Data Science
No ratings yet
2018 02 Msu Data Science
65 pages
Lecture 17&18 - Introduction To Machine Learning
No ratings yet
Lecture 17&18 - Introduction To Machine Learning
51 pages
MILIT PPT Modifies
No ratings yet
MILIT PPT Modifies
43 pages
Scikit - Notes ML
100% (2)
Scikit - Notes ML
12 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
Mental Illness Prediction Using Deep Learning
No ratings yet
Mental Illness Prediction Using Deep Learning
58 pages
Scikit Learn
No ratings yet
Scikit Learn
25 pages
Sms Spam Detection Using Machine Learning and Deep Learning Techniques
No ratings yet
Sms Spam Detection Using Machine Learning and Deep Learning Techniques
11 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
MAchine Learning
No ratings yet
MAchine Learning
120 pages
Lecture4
No ratings yet
Lecture4
56 pages
Machine Learning with Python for Everyone (Addison Wesley Data & Analytics Series) 1st Edition, (Ebook PDF) - Download the ebook and explore the most detailed content
100% (1)
Machine Learning with Python for Everyone (Addison Wesley Data & Analytics Series) 1st Edition, (Ebook PDF) - Download the ebook and explore the most detailed content
60 pages
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
No ratings yet
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
22 pages
ML Notes
No ratings yet
ML Notes
79 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
ML_Exp
No ratings yet
ML_Exp
9 pages
Unit 2 MLMM
No ratings yet
Unit 2 MLMM
41 pages
ML Book Notes
No ratings yet
ML Book Notes
9 pages
Unit-2 Feature Selection
No ratings yet
Unit-2 Feature Selection
92 pages
1 - An Introduction To Machine Learning With Scikit-Learn
No ratings yet
1 - An Introduction To Machine Learning With Scikit-Learn
9 pages
Class10-Introduction_to_ML
No ratings yet
Class10-Introduction_to_ML
32 pages
algorithmeknn-121213175830-phpapp02
No ratings yet
algorithmeknn-121213175830-phpapp02
52 pages
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
No ratings yet
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
20 pages
Python 06 MachineLearning
No ratings yet
Python 06 MachineLearning
45 pages
Lecture4.pptx
No ratings yet
Lecture4.pptx
56 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
Machine Learning with Python for Everyone (Addison Wesley Data & Analytics Series) 1st Edition, (Ebook PDF) instant download
100% (2)
Machine Learning with Python for Everyone (Addison Wesley Data & Analytics Series) 1st Edition, (Ebook PDF) instant download
38 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Disease Prediction Using Machine Learning
No ratings yet
Disease Prediction Using Machine Learning
10 pages
Python Predictive Modeling
No ratings yet
Python Predictive Modeling
24 pages
Ml Lab Manual Completed
No ratings yet
Ml Lab Manual Completed
56 pages
Unit 2 ML
No ratings yet
Unit 2 ML
93 pages
AI 900
No ratings yet
AI 900
120 pages
Disease Prediction Using Machine Learning
No ratings yet
Disease Prediction Using Machine Learning
4 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
Data analysis ch1
No ratings yet
Data analysis ch1
13 pages
END_TO_END_PROJECT
No ratings yet
END_TO_END_PROJECT
21 pages
Week 01
No ratings yet
Week 01
37 pages
ML
No ratings yet
ML
8 pages
PPT-Final Project_DT- Done All Final
No ratings yet
PPT-Final Project_DT- Done All Final
14 pages
1739168641630
No ratings yet
1739168641630
30 pages
Slides on DataI
No ratings yet
Slides on DataI
33 pages
Lab 1 - Machine Learning with Python - ML Engineering مهم
No ratings yet
Lab 1 - Machine Learning with Python - ML Engineering مهم
10 pages
machine learning
No ratings yet
machine learning
39 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
2 pages
72b85f60-8523-423f-9efc-ff56aa21f3f3
No ratings yet
72b85f60-8523-423f-9efc-ff56aa21f3f3
29 pages
Steven Skiena-The Algorithm Design Manual-En
50% (2)
Steven Skiena-The Algorithm Design Manual-En
27 pages
Chapter1 ML
No ratings yet
Chapter1 ML
101 pages
Lecture 3 - MachineLearning-CrashCourse2023
No ratings yet
Lecture 3 - MachineLearning-CrashCourse2023
99 pages
Chapter Two_ Classification Feb 26 2024
No ratings yet
Chapter Two_ Classification Feb 26 2024
18 pages
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
4th Week Report
No ratings yet
4th Week Report
3 pages
Muhammad Difky Daniartha - 2015354028 - 3B TRPL
No ratings yet
Muhammad Difky Daniartha - 2015354028 - 3B TRPL
5 pages
Ukraine Ukraine - Conformity - Mark Label Requirements
No ratings yet
Ukraine Ukraine - Conformity - Mark Label Requirements
6 pages
Manual Board Ddr400 Fsb800
100% (1)
Manual Board Ddr400 Fsb800
84 pages
Management Project-Jenseen Sawhney-PG-20-039
No ratings yet
Management Project-Jenseen Sawhney-PG-20-039
53 pages
Notice for Ascure Technologies 18-12-2024
No ratings yet
Notice for Ascure Technologies 18-12-2024
3 pages
Block 5 Al Shaheen Field Development: Velocity Energy - P01 - Manual Valve General Arrangement Drawings - Starline
No ratings yet
Block 5 Al Shaheen Field Development: Velocity Energy - P01 - Manual Valve General Arrangement Drawings - Starline
39 pages
Japji Sahib Steek Brief - Gurmukhi PDF Asian Ethnic Religion Indian Religious Texts
No ratings yet
Japji Sahib Steek Brief - Gurmukhi PDF Asian Ethnic Religion Indian Religious Texts
5 pages
FS S5850-Series-Switches-Datasheet
No ratings yet
FS S5850-Series-Switches-Datasheet
16 pages
Introduction To Hacking Project
No ratings yet
Introduction To Hacking Project
10 pages
Syntec Automation Controller
No ratings yet
Syntec Automation Controller
20 pages
Field Abb Manual FBM
No ratings yet
Field Abb Manual FBM
3 pages
Test Page
No ratings yet
Test Page
1 page
Laboratorio Enrutamiento Eigrp: Corporación Universitaria Minuto de Dios
No ratings yet
Laboratorio Enrutamiento Eigrp: Corporación Universitaria Minuto de Dios
8 pages
Core Java Cheat Sheet
No ratings yet
Core Java Cheat Sheet
11 pages
The Pokepri Expansion
No ratings yet
The Pokepri Expansion
4 pages
Log
No ratings yet
Log
12 pages
Sacred Heart Villa School "Maria Schinina": 181 Aguinaldo Highway, Lalaan I, Silang, Cavite (046) 423 - 3403
No ratings yet
Sacred Heart Villa School "Maria Schinina": 181 Aguinaldo Highway, Lalaan I, Silang, Cavite (046) 423 - 3403
7 pages
CUSTOMER Proposal Format For ILL
No ratings yet
CUSTOMER Proposal Format For ILL
4 pages
How To Register On SIMCC Membership Development Portal For AMO 2021
No ratings yet
How To Register On SIMCC Membership Development Portal For AMO 2021
21 pages
Incentivizing Honesty Among Competitors in Collaborative Learning and Optimization
No ratings yet
Incentivizing Honesty Among Competitors in Collaborative Learning and Optimization
37 pages
Device Load Monitor With Programmable Meter For Energy Audit
No ratings yet
Device Load Monitor With Programmable Meter For Energy Audit
5 pages
Excel Fundamentals Manual 30
No ratings yet
Excel Fundamentals Manual 30
1 page
fiagon_navigation_system
No ratings yet
fiagon_navigation_system
34 pages
Vishal Resume Complete
No ratings yet
Vishal Resume Complete
2 pages
MySQL - WHERE Clause
No ratings yet
MySQL - WHERE Clause
5 pages
Bulan 10
No ratings yet
Bulan 10
5 pages
Java DSP Printer
No ratings yet
Java DSP Printer
32 pages
Apps For Paediatric Dosing - An Evaluation - Vonbach Priska
No ratings yet
Apps For Paediatric Dosing - An Evaluation - Vonbach Priska
1 page
Business Intelligence and Analytics Systems for Decision Support 10th Edition Sharda Solutions Manualdownload
100% (4)
Business Intelligence and Analytics Systems for Decision Support 10th Edition Sharda Solutions Manualdownload
56 pages