0% found this document useful (0 votes)

447 views15 pages

3 Idiots Approach Display Advertising Challenge

This document summarizes an approach for a display advertising click-through rate prediction challenge that achieved scores of 0.44488 and 0.44479 on the public and private leaderboards. The approach involved preprocessing data, training a gradient boosting decision tree model to generate additional features, training a field-aware factorization machine model on the preprocessed data, and calibrating the predictions. Key steps included one-hot encoding categorical features, applying hashing tricks to generate a high-dimensional sparse feature space, and an additional calibration step to improve scores.

Uploaded by

shafiahmedbd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

447 views15 pages

3 Idiots Approach Display Advertising Challenge

Uploaded by

shafiahmedbd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

1/15

3 Idiots Approach for

Display Advertising Challenge
Yu-Chin Juan, Yong Zhuang, and Wei-Sheng Chin
NTU CSIE MLGroup

What This Competition Challenges Us?

Predict the click probabilities of impressions.

2/15

Dataset
Label
1
0
0

I1
3
7
12

I2
20
91
73

I13
2741
1157
1844

C1
68fd1e64
3516f6e6
05db9164
..
.

C2
80e26c9b
cfc86806
38a947a1

C26
4cf72387
796a1a2e
5d93f8ab

1457

68fd1e64

cfc86806

cf59444f

#Train:
#Test:
#Features after one-hot encoding:

3/15

45M
6M
33M

Evaluation

1X
yi log yi + (1 yi ) log (1 yi ),
logloss =
L
i=1

where L is the number of instances, yi is the true label (0 or 1),

and yi is the predicted probability.

4/15

5/15

This slide introduces our approach to achieve 0.44488 and 0.44479

on the public and private leaderboards, respectively.

Flowchart

6/15

Pre-A

nnz=13-39
feat=39

GBDT

nnz=30
feat=30 27

Pre-B

nnz=
69
feat
=10 6

CSV

Rst

Calib.

FFM

nnz means the number of non-zero elements of each impression; feat represents
the size of feature space.

Preprocessing-A

Purpose: generate features for GBDT.

All numerical data are included. (13 features)
Categorical features (after one-hot encoding) appear more
than 4 million times are also included. (26 features)

7/15

Gradient Boosting Decision Tree (GBDT)

8/15

Purpose: generate GBDT features.

We use trees in GBDT to generate features.
30 trees with depth 7 are used.
30 features are generated for each impression.
This approach is proposed by Xinran He et al. at Facebook.

Gradient Boosting Decision Tree (GBDT)

9/15

Example: Assuming that we have already trained GBDT with 3 trees with depth 2.
We feed an impression x into these trees. The first tree thinks x belong to node 4, the
second node 7, and the third node 6. Then we generate the feature 1:4 2:7 3:6 for
this impression.

x
1

2
4

3
5

1:4

2
7

3
5

2:7

2
7

3
5

3:6

Preprocessing-B

10/15

Purpose: generate features for FFM.

Numerical features (I1-I13) greater than 2 are transformed by
v blog(v )2 c.
Categorical features (C1-C26) appear less than 10 times are
transformed into a sepcial value.
GBDT features are directly included.
These three groups of features are hashed into 1M-dimension
by hashing trick.
Each impression has 13 (numerical) + 26 (categorical) + 30
(GBDT) = 69 features.

Hashing Trick

11/15

text

hash function

hash value

mod 106

feature

I1:3

739920192382357839297

839297

C1-68fd1e64

839193251324345167129

167129

GBDT1:173

923490878437598392813

392813

Field-aware Factorization Machine (FFM)

12/15

For the details of FFM, please check the following slides:

http://www.csie.ntu.edu.tw/~r01922136/slides/ffm.pdf

Calibration

Purpose: calibrate the final result.

The average CTRs on the public / private leaderboards are
0.2632 and 0.2627, respectively.
The average CTR of our submission is 0.2663.
There is a gap. So we minus every prediction by 0.003, and
the logloss is reduced by around 0.0001.

13/15

Running Time

14/15

Environment: A workstation with two 6-core CPUs

All processes are parallelized.
Process
Pre-A
GBDT
Pre-B
FFM
Calibration
Total

Time (min.)
8
29
38
100
1
176

Memory (GB)
0
15
0
16
0

Comparison Among Different Methods

15/15

Method
LR-Poly2
FFM
FFM + GBDT
FFM + GBDT (v2)
FFM + GBDT + calib.
FFM + GBDT + calib. (v2)

Public
0.44984
0.44613
0.44497
0.44474
0.44488
0.44461

v2: 50 trees and 8 latent factors

Private
0.44954
0.44598
0.44483
0.44462
0.44479
0.44449

Building Machine Learning Systems With Python - Second Edition - Sample Chapter
100% (2)
Building Machine Learning Systems With Python - Second Edition - Sample Chapter
32 pages
PDSeasonableSchool ML4PD
No ratings yet
PDSeasonableSchool ML4PD
135 pages
Week01 Intro AI
No ratings yet
Week01 Intro AI
53 pages
Predictive Analytics in Marketing
No ratings yet
Predictive Analytics in Marketing
90 pages
Content Based Filtering
No ratings yet
Content Based Filtering
40 pages
Comp Vis Week 3
No ratings yet
Comp Vis Week 3
44 pages
05 - Feature Engineering (Text)
No ratings yet
05 - Feature Engineering (Text)
28 pages
06-Classification Part2
No ratings yet
06-Classification Part2
34 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
01 02 Intro
No ratings yet
01 02 Intro
11 pages
ML DecisionTrees
No ratings yet
ML DecisionTrees
46 pages
21 Decision Trees
No ratings yet
21 Decision Trees
62 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
2024 Lecture11 MLAlgorithms
No ratings yet
2024 Lecture11 MLAlgorithms
84 pages
Beyond Syallbus LAB - ML
No ratings yet
Beyond Syallbus LAB - ML
9 pages
Neal Zhang
No ratings yet
Neal Zhang
33 pages
(FFM) Field-Aware Factorization Machines For CTR Prediction (Criteo 2016) PDF
No ratings yet
(FFM) Field-Aware Factorization Machines For CTR Prediction (Criteo 2016) PDF
8 pages
Mini Projects
No ratings yet
Mini Projects
25 pages
Internship Lekhana
No ratings yet
Internship Lekhana
17 pages
TTNT 09 Learning From Examples
No ratings yet
TTNT 09 Learning From Examples
58 pages
JaiswalGopinathLimaye OutbrainClickPrediction Report
No ratings yet
JaiswalGopinathLimaye OutbrainClickPrediction Report
8 pages
AI Programs
No ratings yet
AI Programs
6 pages
Adkdd 2014 Camera Ready Junfeng
No ratings yet
Adkdd 2014 Camera Ready Junfeng
9 pages
ML7 - Text Classification
No ratings yet
ML7 - Text Classification
13 pages
5.2 Feature Engineering
No ratings yet
5.2 Feature Engineering
57 pages
Practical File DL
No ratings yet
Practical File DL
14 pages
End To End Project
No ratings yet
End To End Project
21 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
33 pages
Slide 10 Chapter9 Classification Advanced Methods
No ratings yet
Slide 10 Chapter9 Classification Advanced Methods
46 pages
Factorization Machines With Follow-The-Regularized-Leader For CTR Prediction in Display Advertising
No ratings yet
Factorization Machines With Follow-The-Regularized-Leader For CTR Prediction in Display Advertising
3 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
Bag of Tricks For Efficient Text Classification: Armand Joulin Edouard Grave Piotr Bojanowski Tomas Mikolov
No ratings yet
Bag of Tricks For Efficient Text Classification: Armand Joulin Edouard Grave Piotr Bojanowski Tomas Mikolov
5 pages
AI ML Theory Fixed
No ratings yet
AI ML Theory Fixed
5 pages
Learning
No ratings yet
Learning
51 pages
L02 Fundamentals of ML
No ratings yet
L02 Fundamentals of ML
39 pages
ML Industry Lab File With Code and IO
No ratings yet
ML Industry Lab File With Code and IO
8 pages
CO327 ML 2023 End Nov
No ratings yet
CO327 ML 2023 End Nov
4 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
27 pages
Team Trainwreck Final Report: Stephen Worlow Sam Rush Michael Lauria
No ratings yet
Team Trainwreck Final Report: Stephen Worlow Sam Rush Michael Lauria
13 pages
Identifing Software Bugs or Not Using SMLT Model
No ratings yet
Identifing Software Bugs or Not Using SMLT Model
34 pages
ML Course PDF
No ratings yet
ML Course PDF
133 pages
Machine Learning Notes22
No ratings yet
Machine Learning Notes22
45 pages
CS464 Chapter 4: Naïve Bayes: (Slides Based On The Slides Provided by Öznur Taştan and Mehmet Koyutürk)
No ratings yet
CS464 Chapter 4: Naïve Bayes: (Slides Based On The Slides Provided by Öznur Taştan and Mehmet Koyutürk)
55 pages
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
No ratings yet
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
4 pages
Machine Learning, NLP - Text Classification Using Scikit-Learn, Python and NLTK
No ratings yet
Machine Learning, NLP - Text Classification Using Scikit-Learn, Python and NLTK
9 pages
Advanced Techniques in Machine Learning and Optimization
No ratings yet
Advanced Techniques in Machine Learning and Optimization
8 pages
ML Internship Project Report 2024
No ratings yet
ML Internship Project Report 2024
7 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
Applied Machine Learning
No ratings yet
Applied Machine Learning
49 pages
AML Imp Ques
No ratings yet
AML Imp Ques
10 pages
P14 Final Slides - Beyond GLMs (Priest - Conort) - Demobb
No ratings yet
P14 Final Slides - Beyond GLMs (Priest - Conort) - Demobb
39 pages
Project Documentation - LightGBM Tuning For Ad Fraud Detection
No ratings yet
Project Documentation - LightGBM Tuning For Ad Fraud Detection
9 pages
Midterm Solutions For Machine Learning
No ratings yet
Midterm Solutions For Machine Learning
13 pages
@ Car Evaluation
No ratings yet
@ Car Evaluation
10 pages
MLL
No ratings yet
MLL
2 pages
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
Report Rohun Sjmoon
No ratings yet
Report Rohun Sjmoon
6 pages
Delphi Tutorial
No ratings yet
Delphi Tutorial
118 pages
WMS Msca Rfid
100% (2)
WMS Msca Rfid
32 pages
Contractor Tool and Equipment Register
No ratings yet
Contractor Tool and Equipment Register
2 pages
Building A SCADA System
No ratings yet
Building A SCADA System
107 pages
Cuestionario NS4
No ratings yet
Cuestionario NS4
20 pages
KSC2016 - Recurrent Neural Networks
No ratings yet
KSC2016 - Recurrent Neural Networks
66 pages
Microsoft Word - Prospectus Revised III
No ratings yet
Microsoft Word - Prospectus Revised III
243 pages
Hydac-Components and Systems For Agricultural Machinery
No ratings yet
Hydac-Components and Systems For Agricultural Machinery
36 pages
Building Firewall Application To Enhance The Cyber Security: Research Proposal
No ratings yet
Building Firewall Application To Enhance The Cyber Security: Research Proposal
21 pages
OCULUS General Catalog en
No ratings yet
OCULUS General Catalog en
46 pages
II-II Aids SDC Lab Manual
No ratings yet
II-II Aids SDC Lab Manual
54 pages
Data Structure Unit 2 Notes
No ratings yet
Data Structure Unit 2 Notes
20 pages
Steam Education PowerPoint Templates
No ratings yet
Steam Education PowerPoint Templates
48 pages
Chapter 5 Python
No ratings yet
Chapter 5 Python
7 pages
02-10 Parashat Vayakhel - and He Assembled
No ratings yet
02-10 Parashat Vayakhel - and He Assembled
30 pages
Active Directory
No ratings yet
Active Directory
17 pages
Akbar Shaik Java Developer
No ratings yet
Akbar Shaik Java Developer
6 pages
Sony Xperia C2305 User's Manual
No ratings yet
Sony Xperia C2305 User's Manual
123 pages
Overview of EPC Architecture Framework
No ratings yet
Overview of EPC Architecture Framework
13 pages
Python Optimization Modeling Objects (Pyomo)
No ratings yet
Python Optimization Modeling Objects (Pyomo)
17 pages
Manual Siwarex Wp521 Wp522 en
No ratings yet
Manual Siwarex Wp521 Wp522 en
176 pages
WinCC V8.1 - REST Now IT Connector
No ratings yet
WinCC V8.1 - REST Now IT Connector
8 pages
Rotary Flexible Joint Data Sheet
No ratings yet
Rotary Flexible Joint Data Sheet
2 pages
AI by Vks 6th - Pyq
No ratings yet
AI by Vks 6th - Pyq
8 pages
PIC-18F4553 (Reviewed)
No ratings yet
PIC-18F4553 (Reviewed)
46 pages
Practical 2 - Working With Scikit-Learn
No ratings yet
Practical 2 - Working With Scikit-Learn
6 pages
Oops
No ratings yet
Oops
5 pages
Untitled
No ratings yet
Untitled
12 pages
Intelligent Web Servers As Agents: Mauro Gaspari Nicola Dragoni Davide Guidi
No ratings yet
Intelligent Web Servers As Agents: Mauro Gaspari Nicola Dragoni Davide Guidi
13 pages
Donald Knuth
No ratings yet
Donald Knuth
21 pages
Large Divisions
No ratings yet
Large Divisions
7 pages
Adaptive and Channel-Aware Detection of Selective Forwarding Attacks in Wireless Sensor Networks
No ratings yet
Adaptive and Channel-Aware Detection of Selective Forwarding Attacks in Wireless Sensor Networks
16 pages
Create Bell Curve in XL
No ratings yet
Create Bell Curve in XL
3 pages
Child Development Table
No ratings yet
Child Development Table
1 page
KPI Analysis
No ratings yet
KPI Analysis
4 pages
Experiment #23b: Expanded Outputs
No ratings yet
Experiment #23b: Expanded Outputs
4 pages
Gd Script
From Everand
Gd Script
Marijo Trkulja
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)

3 Idiots Approach Display Advertising Challenge

Uploaded by

3 Idiots Approach Display Advertising Challenge

Uploaded by

1/15

3 Idiots Approach for

What This Competition Challenges Us?

Predict the click probabilities of impressions.

where L is the number of instances, yi is the true label (0 or 1),

This slide introduces our approach to achieve 0.44488 and 0.44479

Purpose: generate features for GBDT.

Gradient Boosting Decision Tree (GBDT)

Purpose: generate GBDT features.

Gradient Boosting Decision Tree (GBDT)

Purpose: generate features for FFM.

Field-aware Factorization Machine (FFM)

For the details of FFM, please check the following slides:

Purpose: calibrate the final result.

Environment: A workstation with two 6-core CPUs

Comparison Among Different Methods

v2: 50 trees and 8 latent factors

You might also like