0% found this document useful (0 votes)

13 views28 pages

Lec 01

The document outlines the course IT549: Deep Learning, taught by Dr. Arpit Rana, covering topics such as neural networks, machine learning tasks, and data mining. It includes course logistics, assessment methods, a preliminary schedule, and case studies demonstrating data-driven solutions. Prerequisites include programming in Python and machine learning, with a focus on practical applications in various domains.

Uploaded by

202411073

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views28 pages

Lec 01

Uploaded by

202411073

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

IT549: Deep Learning

Lecture - 01

A Primer for Deep Learning

[Deﬁnition, Tasks, and Case Studies]

Arpit Rana
2nd January 2025
Course Logistics
Syllabus and Evaluation Scheme
Course Logistics

Dr. Arpit Rana

Instructor Room-3105, Faculty Block-3
Email: [email protected] (with a prior appointment only)

Himanshu Beniwal ([email protected]) - PMRF TA

TA Contact Info
DA-IICT, TBD

Prerequisites Programming in Python, Machine Learning

Eligibility B.Tech. VI Semester, M.Tech. II Semester, and Ph.D. Students

Course Logistics

Credit Weighting 4 Credits (3-0-2)

Lectures Monday, Thursday: 12:00 - 13:00 hrs. and Friday: 08:00 -

[CEP-108] 09:00 hrs.

Lab
Thursday, 14:00 – 16:00
[LT-02]

Private Study At least 4 hrs per week

● Learn how to solve Data-driven Decision-Making

Problems;
● Learn how to work on structured and unstructured
Potential Outcome
(e.g., text, image) data;
● Targeted Jobs: Data Scientist, ML Engineer,
Research Engineer, AI Engineer
Course Logistics

● In-Sem Exams: 30% (15% + 15%)

● End-Sem Exam: 30%
Assessment ● Course Project:: 45%
Extra Credits: ML Challenges, Participate on Google Stream

Skip lectures; avoid private study; cram just before the exam;
How to Fail expect the exam to be a memory test; copy project assignments;
be inactive on the course stream

Attend lectures; summarize the notes; expect a problem-solving

How to Pass exam; do your project yourself; be active and accurate in the
class and on the course stream; and do group study
Preliminary Schedule

Week Lecture Lab Project

Week-1 Course Admin; Fundamentals of Predictive Analytics:
– No lab – –
[1 Jan 2025] Representation, Evaluation, and Optimization

– No lab –
Week-2 Introduction to Neural Networks: Neurons, Activation,
(Group formation and domain –
[6 Jan 2025] Layers, Architecture, and Examples
ﬁnalization through Google form)

Training Deep Neural Networks: Backpropagation,

Week-3 Vanishing Gradient Problem: Deﬁnition, Better Linguistic Preprocessing, Text Project
[13 Jan 2025] Weight Initialization, Non-Saturating Activation Vectorization, and Embeddings Assigned
Functions, and Batch-Normalization

Week-4 Overﬁtting: Reducing the network size, Weight or

Pre-training and Fine-tuning –
[20 Jan 2025] Max-norm regularization, Dropout, Early stopping

Week-5 100-Minute ML Development

Applications in Natural Language Processing –
[27 Jan 2025] Challenge

Week-6 First In-Semester Examination

–
[3 Feb 2025] (7th Feb - Friday to 11th Feb - Tuesday)
Preliminary Schedule

Convolution Neural Networks (CNNs): Introduction,

Week-6 Motivation, Images and Rank-3 Tensors, Convolution Image Vectorization or Image Tensors, Progress
[10 Feb 2025] Layers, Filters, Feature Maps, and Stacking Using Pre-trained CNNs, Fine-tuning, etc. Check I
Convolution Layers

Training CNNs: Memory requirements, overﬁtting,

Week-7 Object Detection and Image
data augmentation, Pre-trained CNNs, Transfer –
[17 Feb 2025] Segmentation
Learning,

Recurrent Neurons and Layers; Training RNNs;

Week-8 Working with Sequences or Time-Series
Forecasting a Time Series using Simple RNN, Deep –
[24 Feb 2025] Data
RNN; Handling Long Sequences: LSTM;

Week-9 Encoder-Decoder Network: Bidirectional RNN, Beam

100-Minute ML Development Challenge –
[3 Mar 2025] Search; Attention Mechanism:

Week-10 In-semester Break

-
[10 Mar 2025] (Entire week)

Week-11 The Transformer Architecture — Training of Progress

Training Transformers
[17 Mar 2025] Transformer Check II

Week-12 Second In-Semester Examination

–
[24 Mar 2025] (22 Mar Saturday to 26th Mar Wednesday)
nd
Preliminary Schedule

Deep learning for unsupervised learning;

Unsupervised Learning using
Week-13 Architecture design of AE: Linear, Stacked,
Auto-encoders: An application for –
[31 Mar 2025] Convolutional; Recurrent; Denoising, Sparse; and
demonstration
Variational; Generative Learning using VAE.

Generative modeling as a game-theoretic approach;

Week-14 Training GANs: An application for
Architecture design of GAN; Training methodology -
[7 Apr 2025] demonstration
of GAN; Applications.

Recommendation Systems: Deﬁnition, objectives,

components, approaches, evaluation, and challenges.
Week-15
Models: Matrix Factorization, Neural Matrix 100-Minute ML Development Challenge -
[14 Apr 2025]
Factorization, and Collaborative Denoising
Auto-Encoders;

Week-16 Last date of Classes, Labs, and Tutorials Progress

[21 Apr 2024] (23rd Apr 2025) Check III

Week-17 End-Semester Examination

[29 Apr 2024] (24 Apr 2025 to 02 May 2025)

Course Project Evaluation

(05 May 2025 to 07 May 2025)
Course Policy

Student Groups

● The course project will be allocated to groups of three/ four members. Each group will
also be involved in 100-minute ML development challenges.
● 45% of your marks will be based on team efforts, so choose your members wisely.
● Teams will remain unchanged throughout the semester once registered. No requests for
changes will be entertained.
● Team registration will open the second week after classes start, and you must register
your team via a Google form within three days of the announcement.
● During lab hours, there will be three machine-learning development challenges. The
three most proﬁcient teams shall be acknowledged with a bonus of up to 3% on their
respective scores.
● Every team member must understand the concepts, code, and claims they submit, as
any member may be asked questions about their project.
Course Policy

Course Project
● There is only one-course project, an End-to-End ML application.
● Student groups must select a thematic domain: Finance, E-commerce, Healthcare,
Pharma, Sports, Entertainment, Renewable Energy, Oil & Gas, Automobile, Agriculture,
FMCG, Security, Social Media, Supply Chain, or any other exciting and valuable domain.
● Each group will deﬁne the problem in their selected domain and collect dataset(s) from
reliable sources, including publicly available ones (no two groups can work on the same
dataset and not more than two groups can work on the same domain). You are
encouraged to gather additional data to enhance your dataset and better address the
problem.
● Each group must develop a multimodal machine-learning application and select a
dataset with all the necessary modalities. You must add a novel contribution to your
project and compare yours with the existing baselines.
● Three progress checks are scheduled to ensure incremental progress, not a last-week
effort.
● The project guideline document will provide information on domain allocation, general
instructions, evaluation criteria, and other protocols.
Course Policy

Submission
● One group member will submit the project report and the code on Google Classroom.
Submission instructions will be provided in the project guideline document.
● Evaluation will primarily be online, reviewing your code. Any group member may be
asked questions about anything in the assignment.
● Late submissions (up to 24 hours) will incur a 20% penalty.
● Plagiarism includes:
○ Copying any segment of code from any source.
○ Submitting code not written by you personally.
● Suspected plagiarism will result in a ZERO for the assignment.
Introduction
Deﬁnition and Tasks
What is Data (Knowledge) Mining?

The process of automatically (or

semi-automatically) discovering interesting
patterns from large amounts of data.

● Implicit (somewhat hidden),

● Non-trivial (not obvious),

● Previously unknown (novel), and

● Potentially useful (for consumers /

sellers / stakeholders)

Image Source: https://www.investopedia.com/terms/d/datamining.asp

Data (Knowledge) Mining: Knowledge Discovery from Data

Databases Extracting
Flat ﬁles Interesting Patterns Novel
Data Warehouses using Actionable
and so on. Intelligent Methods Useful

Data Post-
Data Data Mining Knowledge
Preprocessing processing

Textual data
Cleaning Evaluation w.r.t.
e.g. text, blogs
Reduction
- Interestingness
Multimedia data Transformation
- Completeness
e.g. image, video Discretization
- Optimality
Sequential data Selection etc.
e.g., gene sequence
Spatial data
e.g. maps,
and so on.

DIKW Pyramid: https://www.ontotext.com/knowledgehub/fundamentals/dikw-pyramid/

Data Mining vs. Machine Learning

The process of automatically (or Machine learning (ML) is focused on

semi-automatically) discovering interesting understanding and building methods that
patterns from large amounts of data. 'learn'.

It uses methods at the intersection of It leverages data to improve performance

machine learning, statistics, and database on some set of tasks.
systems.
E.g.: A spam ﬁlter (an ML program)
E.g., customer churn
Data Mining Tasks

Data Mining Tasks

The actual data mining task is the semi-automatic or automatic
analysis of large quantities of data to extract interesting patterns.

Descriptive Predictive
Find human-interpretable patterns Use some variables to predict future
that describe the data. or unknown values of other variables.

● Cluster Analysis ● Regression

● Outlier Analysis ● Classiﬁcation
● Association Rule Mining
● Sequence Pattern Mining

In Machine Learning terminology, these In Machine Learning terminology, these

tasks are categorised as “Unsupervised tasks are categorised as “Supervised
Learning”. Learning”.
Data Mining Tasks

Data Mining Tasks

The actual data mining task is the semi-automatic or automatic
analysis of large quantities of data to extract interesting patterns.

Descriptive Predictive
Find human-interpretable patterns Use some variables to predict future
that describe the data. or unknown values of other variables.

● Cluster Analysis ● Regression

● Outlier Analysis ● Classiﬁcation
● Association Rule Mining
● Sequence Pattern Mining

In Machine Learning terminology, these In Machine Learning terminology, these

tasks are categorised as “Unsupervised tasks are categorised as “Supervised
Learning”. Learning”.
Machine Learning: Deﬁnition

Machine Learning is

● the science (and art) of programming computers

● so they can learn from data. AI

ML
– Aurelien Geron, Google
DL

Gen
-AI
Machine Learning: Example

A Spam Filter,
● a Machine Learning Program, given
○ examples of “spam” emails (e.g. ﬂagged by
users), and
○ examples of “ham” (i.e. regular) emails
● can learn to ﬂag spam
Machine Learning: A New Programming Paradigm

Data Rules Data Answers

Traditional
Programming Machine
Learning
(Symbolic AI)

Answers Rules

● A long list of complex (hard coded) rules ● Automatically learns which words or
phrases are good predictors of spam
● Keep writing new rules as the new
phrases are introduced by spammers
Machine Learning: Deﬁnition Revisited

Machine Learning is the training of a model from data that generalises a decision against a
performance measure.

● Training a model suggests training examples. Data Answers

● A model suggests state acquired through experience.

● Generalises a decision suggests the capability to make a

decision based on inputs and anticipating unseen inputs in
the future for which a decision will be required. Machine
Learning
● against a performance measure suggests a targeted need and
directed quality to the model being prepared.

Model
Learning = Representation + Evaluation + Optimization

Representation
Choosing a representation of the learner: the hypotheses
space or the model class — the set of models that it can
possibly learn.

Evaluation
Choosing an evaluation function (also called objective
function, utility function, loss function, or scoring
function) is needed to distinguish good classiﬁers from
bad ones.

Optimization
��
Choosing a method to search among the models in the
hypothesis space for the highest-scoring one.
Learning = Representation + Evaluation + Optimization

✔
✔ ✔ ✔
✔ ✔ ✔

✔ ✔
✔ ✔ ✔
✔
✔

✔
Business Case Studies
Fakespot, GoKwik, and Intello Labs
Case Study - I: Fakespot

Problem Identiﬁed
● Nearly 93% consumers read reviews before any kind of purchasing decision.
● Out of these, around 91% of 18–34 year olds trust reviews as much as a
recommendation from a friend!
● Over 30% of reviews are found to be fake.

Target Audience
All e-commerce businesses that allow users to write
reviews.

Data-driven Solution
Fakespot reports provide an Adjusted Rating that
weighs reviews based on authenticity and then Courtesy: Fakespot
recalculates it.

For more information, go to Fakespot Blogs: https://www.fakespot.com/posts

Case Study - II: GoKwik

Problem Identiﬁed
● In e-commerce, more than 30% of orders are returned to origin (RTO, i.e. shipped back
to the warehouse) in India.

Target Audience
All e-commerce businesses

Data-driven Solution
● Mostly, CoD orders are converted to
RTO.
● So, analyzing customer behavioural
patterns and disable CoD option for Courtesy: Gokwik
those showing high-risk RTO
behaviour.

For more information, go to GoKwik Blogs: https://www.gokwik.co/blog/

Case Study - III: Intello Labs

Problem Identiﬁed
● One-third of the food produced in the world for human consumption every year gets
lost or wasted.
● Mainly (in some countries) at the early stages of the food value chain.

Target Audience
From growers to packers, from
exporters to food services

Data-driven Solution
Smart, scalable solutions to digitize
food quality, achieve fair pricing and
reduce food wastage. Using AI, ML, and Computer Vision technology

For more information, go to Intello Labs Blogs: https://blogs.intellolabs.com/en

Next lecture
Primer for Deep Learning Contd…
3rd January 2025

Note: Study ﬁrst three chapters of Data Mining and

Machine Learning by Mohammed J. Zaki. Second Edition.

01 - Introduction To Machine Learning
No ratings yet
01 - Introduction To Machine Learning
71 pages
Fall2024 W4995 Lecture1
No ratings yet
Fall2024 W4995 Lecture1
110 pages
Lecture 1
100% (1)
Lecture 1
51 pages
Ad8552 ML Unit Ii
No ratings yet
Ad8552 ML Unit Ii
94 pages
Lecture 1
No ratings yet
Lecture 1
34 pages
SEM 5 Syllabus
No ratings yet
SEM 5 Syllabus
28 pages
Bayesian Statistics Using Python
No ratings yet
Bayesian Statistics Using Python
329 pages
1DataScience MachineLearning AI Syllabus.-1.PDF 20240118 174213 0000
No ratings yet
1DataScience MachineLearning AI Syllabus.-1.PDF 20240118 174213 0000
9 pages
Machine Learning Notes22
No ratings yet
Machine Learning Notes22
45 pages
MLUnit - 1 Share
No ratings yet
MLUnit - 1 Share
162 pages
Mac Unit 3
No ratings yet
Mac Unit 3
65 pages
Lecture 01 - Introduction To AML-Jan24
No ratings yet
Lecture 01 - Introduction To AML-Jan24
66 pages
1 Introduction
No ratings yet
1 Introduction
58 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
CourseOutline PHD RTML
No ratings yet
CourseOutline PHD RTML
4 pages
ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
Chapter 1
No ratings yet
Chapter 1
62 pages
Lec1 Intro To p556
No ratings yet
Lec1 Intro To p556
29 pages
ML - Lecture - 1 Introduction To ML
No ratings yet
ML - Lecture - 1 Introduction To ML
29 pages
Lecture - 1 Introduction To ML
No ratings yet
Lecture - 1 Introduction To ML
38 pages
Unit 1
No ratings yet
Unit 1
43 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
60 pages
Master Machine Learning in Just 30 Days Version01
No ratings yet
Master Machine Learning in Just 30 Days Version01
25 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
81 pages
ML Notes
No ratings yet
ML Notes
25 pages
Ai&ml 20250109 074314
No ratings yet
Ai&ml 20250109 074314
8 pages
Artificial Intelligence Machine Learning Program Brochure
No ratings yet
Artificial Intelligence Machine Learning Program Brochure
22 pages
Week 1: Python Basics: Class 1: Getting Started With Python
No ratings yet
Week 1: Python Basics: Class 1: Getting Started With Python
6 pages
CAIP Delivery Plan
No ratings yet
CAIP Delivery Plan
7 pages
AI Fellowship Syllabus LATAM
No ratings yet
AI Fellowship Syllabus LATAM
17 pages
CS-871-Lecture 1
No ratings yet
CS-871-Lecture 1
41 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
40 pages
Lec 01 Introduction
No ratings yet
Lec 01 Introduction
98 pages
AI Fellowship Nepal
No ratings yet
AI Fellowship Nepal
17 pages
Artificial Intelligence & Machine Learning: Post Graduate Program in
No ratings yet
Artificial Intelligence & Machine Learning: Post Graduate Program in
16 pages
PGP Aiml2024
No ratings yet
PGP Aiml2024
22 pages
PGP Machine Learning Brochure
No ratings yet
PGP Machine Learning Brochure
20 pages
CSCE 636: Deep Learning
No ratings yet
CSCE 636: Deep Learning
30 pages
Machine Learning-Updated
No ratings yet
Machine Learning-Updated
4 pages
Artificial Intelligence Machine Learning Program Brochure
No ratings yet
Artificial Intelligence Machine Learning Program Brochure
24 pages
Antern ML002
No ratings yet
Antern ML002
15 pages
AIML V.22 Brochure Newversion22
No ratings yet
AIML V.22 Brochure Newversion22
16 pages
Deep Atlas MLI Syllabus
No ratings yet
Deep Atlas MLI Syllabus
1 page
Unit1-Work Envelope, Workspaceand Full
No ratings yet
Unit1-Work Envelope, Workspaceand Full
13 pages
ML Lesson Plan
No ratings yet
ML Lesson Plan
4 pages
PGP - Unified Brochure
No ratings yet
PGP - Unified Brochure
18 pages
R18B Tech MinorIVYearISemesterTENTATIVESyllabus
No ratings yet
R18B Tech MinorIVYearISemesterTENTATIVESyllabus
22 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
37 pages
Ai - Introduction: FDP / Short Term Training On Artificial Intelligence & Deep Learning Applications
No ratings yet
Ai - Introduction: FDP / Short Term Training On Artificial Intelligence & Deep Learning Applications
6 pages
Data Science Student Schedule
No ratings yet
Data Science Student Schedule
7 pages
ML and Deep Learning Syllabus
No ratings yet
ML and Deep Learning Syllabus
3 pages
Java ML
No ratings yet
Java ML
7 pages
GreatLearning AI and ML Brochure
No ratings yet
GreatLearning AI and ML Brochure
19 pages
Aiml Online Brochure PDF
No ratings yet
Aiml Online Brochure PDF
23 pages
Syl3 ML
No ratings yet
Syl3 ML
5 pages
Essentials of Deep Learning
No ratings yet
Essentials of Deep Learning
2 pages
Lesson Plan - ML - Spring 2023
No ratings yet
Lesson Plan - ML - Spring 2023
4 pages
Artificial Intelligence Machine Learning Program Brochure
No ratings yet
Artificial Intelligence Machine Learning Program Brochure
24 pages
Week 1 - Chapter 1 - Introduction
No ratings yet
Week 1 - Chapter 1 - Introduction
25 pages
Lahore University of Management Sciences CS 535/EE 514 Machine Learning
No ratings yet
Lahore University of Management Sciences CS 535/EE 514 Machine Learning
3 pages
The Pex: Global State of Process Excellence
100% (1)
The Pex: Global State of Process Excellence
47 pages
AI Facial Recognition System
No ratings yet
AI Facial Recognition System
54 pages
ĐỀ ÔN TẬP CUỐI HỌC KỲ I T ANH 12 2024
No ratings yet
ĐỀ ÔN TẬP CUỐI HỌC KỲ I T ANH 12 2024
29 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
50 pages
Ccs 3350 Artificial Intelligence
No ratings yet
Ccs 3350 Artificial Intelligence
3 pages
Using AI To Detect and Promote Green Skills in Job Market Trends
No ratings yet
Using AI To Detect and Promote Green Skills in Job Market Trends
29 pages
AI Notes Module 1
No ratings yet
AI Notes Module 1
14 pages
ML Imp Ques 2
No ratings yet
ML Imp Ques 2
37 pages
Economics of Artificial Intelligence Implications For The Future of Work
No ratings yet
Economics of Artificial Intelligence Implications For The Future of Work
35 pages
Emotional Robots
100% (1)
Emotional Robots
15 pages
Report
No ratings yet
Report
33 pages
Survey On Large Language Models
No ratings yet
Survey On Large Language Models
52 pages
Tips and Tricks
No ratings yet
Tips and Tricks
5 pages
CHAPTER - 1-3 (Hauwa)
No ratings yet
CHAPTER - 1-3 (Hauwa)
34 pages
Roadmap To GenAi
No ratings yet
Roadmap To GenAi
2 pages
A Review of Traffic Congestion Prediction Using Artificial Intelligence
No ratings yet
A Review of Traffic Congestion Prediction Using Artificial Intelligence
18 pages
AI in Education B1
No ratings yet
AI in Education B1
13 pages
Program CS MS
No ratings yet
Program CS MS
20 pages
Pathfinder Glossary Web Final
No ratings yet
Pathfinder Glossary Web Final
4 pages
Csa4005 Expert-Systems-And-Fuzzy-Logic LT 1.0 6 Csa4005
No ratings yet
Csa4005 Expert-Systems-And-Fuzzy-Logic LT 1.0 6 Csa4005
2 pages
Investigating The Effect of Bd-Craft To Text Detection Algorithms
No ratings yet
Investigating The Effect of Bd-Craft To Text Detection Algorithms
16 pages
Key Track
No ratings yet
Key Track
11 pages
6G Security Paper EUCNC 2024
No ratings yet
6G Security Paper EUCNC 2024
7 pages
Marketing - Strategy - of - Open - AI (2) X
No ratings yet
Marketing - Strategy - of - Open - AI (2) X
4 pages
CNN Text Classification
No ratings yet
CNN Text Classification
12 pages
Two Column Resume Siddharth
No ratings yet
Two Column Resume Siddharth
1 page
Funtoot To Address Indian Education System's One Size Fit All' Challenge
No ratings yet
Funtoot To Address Indian Education System's One Size Fit All' Challenge
2 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
CISSP Domain 1 Study Guide ( Updated 2024 ) With Practice Exam Questions, Quizzes, Flash Cards: CISSP Study Guide - Updated 2024, #1
From Everand
CISSP Domain 1 Study Guide ( Updated 2024 ) With Practice Exam Questions, Quizzes, Flash Cards: CISSP Study Guide - Updated 2024, #1
ADITYA .
No ratings yet