0% found this document useful (0 votes)

3 views15 pages

Konuralp

The document discusses a hybrid Pointer-Generator Network with a Coverage Mechanism designed to improve abstractive summarization by addressing issues like repetition, factual inaccuracies, and out-of-vocabulary words. The proposed model integrates both word generation and direct word extraction, resulting in more human-like and accurate summaries. Experimental results indicate that this approach outperforms existing models in terms of fluency and content novelty.

Uploaded by

tegictechnologyllc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views15 pages

Konuralp

Uploaded by

tegictechnologyllc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Get to The Point Presentation

Professor : Mehmet Can Yavuz

Student : Konuralp Dalkılınç - 23COMP5004
Isik University Faculty of Engineering
and Natural Sciences

1
Get To The Point: Summarization
with Pointer-Generator Networks
Improving Abstractive Summarization through Hybrid Copying and Coverage

Problems :

➢ Repetition
➢ Factual inaccuracy
➢ Out-of-vocabulary (OOV) words

Proposed Solution : Introduce a hybrid Pointer-Generator Network with a Coverage Mechanism

Obtained Result : Outperforming the current abstractive models

Abigail See Peter J. Liu Christopher D. Manning

Stanford University Google Brain
[email protected] Stanford University
[email protected] [email protected]
2
summarization

abstractive extractive

Generating words from scratch. Copying words from the input text.
+ Creating more human like + Easier to implement than
summaries abstractive
- Possibility of - Less human like summaries
inaccurate factual details and
repetitive text
- Harder to train and fine-tune

3
What This Paper Proposes
summarization

seq2seq pointer generation coverage mechanism

The combined model capable of ;

➢ Generating human-like, factually accurate summaries by seamlessly integrating new word generation with
direct word extraction from the input text
➢ Eliminating out of vocabulary(oov) problem
➢ Eliminating repetitive text generation

4
5
Basic Definitions
Tokenization:

➢ Tokens can be considered as smallest unit of natural language processing. Can be words, subwords, characters and
sentences

Word Embedding:

➢ Word embeddings are numerical vector representations of words, sentences, and documents that capture their
semantic essence. Similar embeddings cluster together, reflecting their related context or meaning

Bi-directional long short-term memory(LSTM):

➢ LSTM is a type of recurrent neural network(RNN), designed to better capture long-term

dependencies in sequential data. Bi- LSTM can processes sequence data in both
forward and reverse directions by combining two LSTMs

** UNK : unknown token

6
Seq2Seq with Attention
➢ An encoder-decoder architecture using bi-LSTM
➢ Encoder processes input tokens into hidden states
➢ Decoder generates summary token-by-token

Role of attention:

➢ By computing attention weights at each t, help decoder to focus on revelants parts of the input

Limitations:

➢ Struggling with Out Of Vocabulary words

➢ Factual accuracy
➢ Repetition of text

7
Encoders and Decoders
Encoders
➢ Processing the input one by one
➢ Convert each word into a vector representation using a Bi-LSTM
➢ These vectors are called hidden states, including their semantic meaning
Context Vector
➢ A weighted combination of encoder hidden states
➢ Created by applying attention over the encoder outputs
➢ Tells the decoder what part of the input to focus
Attention distribution
➢ A set of scores showing how much focus to put on each word in the input
Vocab distribution
➢ Shows the probability of every word in the vocabulary being the next word
➢ The word with the highest probability is selected, if model decided to generate
Decoders
➢ Generates summary one word at a time
➢ Using previous word, context vector, and past hidden state to predict the next word

8
Seq2Seq

9
Pointer Generator Network
A Hybrid Approach :

➢ For each step a decision is given whether to create a word, or to copy directly from input text

Role of pointer generator network :

➢ Like extractive summarization, copying words from input text

➢ Like abstractive summarization, generating new words from vocabulary

This solves :

➢ Out of vocabulary problem

➢ Increase factual accuracy

10
Pointer Generator Network

11
Coverage Mechanism
➢ By building a coverage vector, it tracks how much attention each word has received for the given time
➢ Using coverage vector to guide attention so it doesn't keep attending to the same words again and again

Role of coverage :

➢ Repeat the same phrases

➢ Lose track of what has already been summarized

This solves :

➢ Reducing repetition
➢ Better at tracking coverage
➢ Increasing ROUGE & METEOR test performances

12
Results
➢ This graph proves that how the coverage mechanism
effects repetition. Without it, the model keeps repeating
words and phrases. Using coverage, repetition decreases
summaries becomes much more fluent

**1, 2 grams, sentences = single words, two-word phrases,

sentences

13
Results
➢ This graph shows how much of the generated summaries
are novel(new) not copied. The pointer-generator model
creates more unique phrases and sentences compared to
the older baseline

**1, 2 grams, sentences = single words, two-word phrases,

sentences

14
Results

➢ ROUGE-1(basic content), ROUGE-2(Fluency and phrase quality), and ROUGE-L(structural and

grammatical accuracy) evaluates word overlap and structure
➢ METEOR captures more human-like properties such as meaning and grammar
➢ Model performs better, proven by the numbers too
➢ Some baseline models used an anonymized version shown by *(@entity3 gave a speech)

RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
PHYSICS INVESTIGATORY PROJECT Step Down
67% (3)
PHYSICS INVESTIGATORY PROJECT Step Down
22 pages
NLP Script
No ratings yet
NLP Script
2 pages
Enterprise 3000 Service Manual
100% (1)
Enterprise 3000 Service Manual
84 pages
CS5984 Final Report
No ratings yet
CS5984 Final Report
57 pages
06-DL-Deep Learning For Text Data (LSTM Seq2Seq Models)
No ratings yet
06-DL-Deep Learning For Text Data (LSTM Seq2Seq Models)
44 pages
ACM Journals Primary Article Template Latest Version 4
No ratings yet
ACM Journals Primary Article Template Latest Version 4
31 pages
10 RNN
No ratings yet
10 RNN
56 pages
12 Issue Akira
100% (1)
12 Issue Akira
20 pages
Summaries of The Chapters
No ratings yet
Summaries of The Chapters
29 pages
Generating Wikipedia by Summarizing Long Sequence
No ratings yet
Generating Wikipedia by Summarizing Long Sequence
33 pages
Writing Win32 Dynamic Link Libraries (DLLS) and Calling Them From LabVIEW
100% (1)
Writing Win32 Dynamic Link Libraries (DLLS) and Calling Them From LabVIEW
11 pages
Get To The Point:: Summarization With Pointer-Generator Networks
No ratings yet
Get To The Point:: Summarization With Pointer-Generator Networks
32 pages
Group 13 Sem 2 Review 1
No ratings yet
Group 13 Sem 2 Review 1
20 pages
Short Updates-Machine Learning Based News Summarizer
No ratings yet
Short Updates-Machine Learning Based News Summarizer
11 pages
Combination of Abstractive and Extractive Approaches For Summarization of Long Scientific Texts
No ratings yet
Combination of Abstractive and Extractive Approaches For Summarization of Long Scientific Texts
11 pages
Unit5 3
No ratings yet
Unit5 3
48 pages
ML For NLP-LO3
No ratings yet
ML For NLP-LO3
61 pages
Inlg 19 TL DR Writeup 4
No ratings yet
Inlg 19 TL DR Writeup 4
7 pages
Get To The Point: Summarization With Pointer-Generator Networks
No ratings yet
Get To The Point: Summarization With Pointer-Generator Networks
20 pages
Tech Doc 2 (Repaired)
No ratings yet
Tech Doc 2 (Repaired)
22 pages
Catalogue ns80 300 Eng PDF
No ratings yet
Catalogue ns80 300 Eng PDF
268 pages
Chap 7.1 Sequence Analysis Using FFN
No ratings yet
Chap 7.1 Sequence Analysis Using FFN
47 pages
Reinforced Generative Adversarial Network For Abstractive Text Sum-Marization
No ratings yet
Reinforced Generative Adversarial Network For Abstractive Text Sum-Marization
9 pages
1805 03616 ReinforcedTopicAwareConvS2S PDF
No ratings yet
1805 03616 ReinforcedTopicAwareConvS2S PDF
8 pages
L22 - Attention in Deep Learning
No ratings yet
L22 - Attention in Deep Learning
65 pages
Long Short-Term Memory-Networks For Machine Reading
No ratings yet
Long Short-Term Memory-Networks For Machine Reading
11 pages
Class44-46 Introduction To Enncoder-Decoder Model Attention-03-09May2023
No ratings yet
Class44-46 Introduction To Enncoder-Decoder Model Attention-03-09May2023
35 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
Deep Recurrent Generative Decoder For Abstractive Text Summarization
No ratings yet
Deep Recurrent Generative Decoder For Abstractive Text Summarization
10 pages
Final4 W18-2706
No ratings yet
Final4 W18-2706
10 pages
Chapter 2
No ratings yet
Chapter 2
68 pages
Deep Learning Basics
No ratings yet
Deep Learning Basics
10 pages
Advanced Text Summarization Techniques: Integrating RNNS, Transformers, and Pca For Enhanced Performance
No ratings yet
Advanced Text Summarization Techniques: Integrating RNNS, Transformers, and Pca For Enhanced Performance
8 pages
Extracting Sentences and Words
No ratings yet
Extracting Sentences and Words
11 pages
Searching For Effective Neural Extractive Summarization: What Works and What's Next
No ratings yet
Searching For Effective Neural Extractive Summarization: What Works and What's Next
10 pages
Abstractive Text Summary Generation With Knowledge Graph Representation
No ratings yet
Abstractive Text Summary Generation With Knowledge Graph Representation
9 pages
MPX200 Multifunction Router User Guide
No ratings yet
MPX200 Multifunction Router User Guide
251 pages
1 s2.0 S1877050919311755 Main
No ratings yet
1 s2.0 S1877050919311755 Main
8 pages
M5 Topic 1 - Encoder Decoder
No ratings yet
M5 Topic 1 - Encoder Decoder
21 pages
Algebra FX 2.0 Plus FX 1.0 Plus: User's Guide
No ratings yet
Algebra FX 2.0 Plus FX 1.0 Plus: User's Guide
455 pages
Abstractive Sentence Summarization With Attentive Recurrent Neural Networks
No ratings yet
Abstractive Sentence Summarization With Attentive Recurrent Neural Networks
6 pages
Sequence-To-Sequence, Attention, Transformer - Machine Learning Lecture
No ratings yet
Sequence-To-Sequence, Attention, Transformer - Machine Learning Lecture
20 pages
Session2 2024 - 2025 - Natural Language Processing
No ratings yet
Session2 2024 - 2025 - Natural Language Processing
30 pages
BiLSTM BPTT
No ratings yet
BiLSTM BPTT
8 pages
Polynomial Expansion Paper
No ratings yet
Polynomial Expansion Paper
4 pages
Everest Group Intelligent Document Processing and Unstructured Document Processing Products PEAK Matrix Assessment 2023
No ratings yet
Everest Group Intelligent Document Processing and Unstructured Document Processing Products PEAK Matrix Assessment 2023
15 pages
A M3 RD Ipjn Yd Ps GKF
No ratings yet
A M3 RD Ipjn Yd Ps GKF
20 pages
Training Practice Questions V11
0% (1)
Training Practice Questions V11
8 pages
Mark
No ratings yet
Mark
3 pages
Unit 3
No ratings yet
Unit 3
4 pages
NNDL
No ratings yet
NNDL
10 pages
DAA FinalReport
No ratings yet
DAA FinalReport
14 pages
Comparative Analysis of T5 Model For Abstractive Text Summarization On Different Datasets
No ratings yet
Comparative Analysis of T5 Model For Abstractive Text Summarization On Different Datasets
7 pages
ANLP Assignment-2 (21BTRCA051)
No ratings yet
ANLP Assignment-2 (21BTRCA051)
4 pages
Unit III - Recurrent Neural Networks
No ratings yet
Unit III - Recurrent Neural Networks
44 pages
A Combined Model For Extractive and Abstractive Summarization Based On Transformer Model
No ratings yet
A Combined Model For Extractive and Abstractive Summarization Based On Transformer Model
4 pages
Checkpoint Packet Flow
No ratings yet
Checkpoint Packet Flow
3 pages
Pretraining-Based Natural Language Generation For Text Summarization
No ratings yet
Pretraining-Based Natural Language Generation For Text Summarization
7 pages
Paper 10
No ratings yet
Paper 10
2 pages
12204-1000 MSU Bio Engineering Facility - Volume 2 - Bid Release - 2 - Bids (2014 - 01 - 08) PDF
No ratings yet
12204-1000 MSU Bio Engineering Facility - Volume 2 - Bid Release - 2 - Bids (2014 - 01 - 08) PDF
101 pages
1 s2.0 S0893608005001206 Main
No ratings yet
1 s2.0 S0893608005001206 Main
9 pages
LSTM
No ratings yet
LSTM
24 pages
Pervasive Attention 2D Convolutional Neural Networks For Sequence-to-Sequence Prediction
No ratings yet
Pervasive Attention 2D Convolutional Neural Networks For Sequence-to-Sequence Prediction
11 pages
AAM Unit 6 Notes
No ratings yet
AAM Unit 6 Notes
20 pages
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
No ratings yet
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
13 pages
Thuyết Trình TWP
No ratings yet
Thuyết Trình TWP
7 pages
Reverse Engineering The YouTube Algorithm PT 2
No ratings yet
Reverse Engineering The YouTube Algorithm PT 2
6 pages
11461-Article Text-20356-1-10-20211106
No ratings yet
11461-Article Text-20356-1-10-20211106
5 pages
ML Question Bank - Beena Kapadia
No ratings yet
ML Question Bank - Beena Kapadia
3 pages
LSTM Material 1
No ratings yet
LSTM Material 1
3 pages
Supplier
No ratings yet
Supplier
117 pages
MySQL USBWebserver 8 Setup Guide
No ratings yet
MySQL USBWebserver 8 Setup Guide
8 pages
CLASS 12 IP LTST Practical Programs (1) (1) FINAL
No ratings yet
CLASS 12 IP LTST Practical Programs (1) (1) FINAL
40 pages
Data Visualization2.pdf - Crdownload
No ratings yet
Data Visualization2.pdf - Crdownload
18 pages
Unit-Ii 191eec303t Lic
No ratings yet
Unit-Ii 191eec303t Lic
125 pages
Serverless Computing
No ratings yet
Serverless Computing
6 pages
CRISC GF Application
No ratings yet
CRISC GF Application
10 pages
KV 2 Holiday Homework
100% (1)
KV 2 Holiday Homework
4 pages
Creating and Managing A Bluebeam Session For Construction Administration - r1
No ratings yet
Creating and Managing A Bluebeam Session For Construction Administration - r1
8 pages
E Governance Final Documentation E Governance - Docxmanisha
No ratings yet
E Governance Final Documentation E Governance - Docxmanisha
23 pages
Kumasi Girls' Senior High School: Personal Records Form
No ratings yet
Kumasi Girls' Senior High School: Personal Records Form
2 pages
APT Specification
No ratings yet
APT Specification
4 pages
How To - Localize With SurvCE PDF
No ratings yet
How To - Localize With SurvCE PDF
3 pages
5 Best Voicemail Greeting Examples For 2022 Tip
No ratings yet
5 Best Voicemail Greeting Examples For 2022 Tip
1 page
DCT-Net - Domain-Calibrated Translation For Portrait Stylization
No ratings yet
DCT-Net - Domain-Calibrated Translation For Portrait Stylization
9 pages
Dragonpay Payment Instruction
No ratings yet
Dragonpay Payment Instruction
1 page
Cs578: Internet of Things: Different Components of Iot
No ratings yet
Cs578: Internet of Things: Different Components of Iot
6 pages
Implementing Domain-Specific Languages with Xtext and Xtend - Second Edition
From Everand
Implementing Domain-Specific Languages with Xtext and Xtend - Second Edition
Lorenzo Bettini
4/5 (1)
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet