0% found this document useful (0 votes)

61 views9 pages

Ml6team - Keyphrase-Generation-Keybart-Inspec Hugging Face

This document provides information about the ml6team/keyphrase-generation-keybart-inspec model on Hugging Face, including a description of the model, intended uses and limitations, and an example usage for keyphrase extraction. The model was trained on the Inspec dataset and uses KeyBART as its base, fine-tuning it for the task of keyphrase generation from scientific papers and abstracts. Usage instructions are provided to load and use the model via a custom Text2TextGenerationPipeline for keyphrase extraction.

Uploaded by

Roberto Martinez Cruz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views9 pages

Ml6team - Keyphrase-Generation-Keybart-Inspec Hugging Face

Uploaded by

Roberto Martinez Cruz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

Search models, datasets, users...

ml6team / keyphrase-generation-keybart-inspec like 4

Text2Text Generation Transformers PyTorch midas/inspec English

bart keyphrase-generation Eval Results Inference Endpoints arxiv:2112.08547

License: mit

Train Deploy Use in Transformers

Model card Files Community 4

Downloads last month

Inference API
Text2Text Generation Example 2

The contextualized representation of pre-trained language models (PLMs) has been

highly successful in Keyphrase Extraction (KPE) tasks, achieving state-of-the-art (SotA)
results. However, due to the limited context within which they operate, extracting
keyphrases from lengthy documents poses a challenge, as the importance of the
keyphrase may rely on long-term dependencies that the PLM is not able to capture. To
overcome this limitation, we present an attention expansion mechanism that leverages
pre-trained word embeddings to allow the PLM to consider words beyond its contextual
boundaries, thereby enhancing the representation of words for KPE. To evaluate the
efficacy of our approach, we fine-tuned multiple PLMs on publicly available long
document KPE datasets, comparing results with and without our attention expansion
mechanism. PLMs with the expansion mechanism consistently outperformed state-of-
the-art models, exhibiting significant improvements in their F1 score—a metric that
harmoniously combines precision and recall to provide a comprehensive measure of a
model's accuracy—across all datasets.

1.8
Compute

Computation time on cpu: 1.695 s

contextualized representation ; pre-trained language models ; keyphrase extraction ;

JSON Output Maximize

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 1/9
3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

Dataset used to train ml6team/keyphrase-generation-keybart-inspec

midas/inspec
Viewer • Updated Mar 5, 2022 • 70 • 9

Spaces using ml6team/keyphrase-generation-keybart-inspec 2

🔑 ml6team/keyphrase-extraction 🐠 davidna22/bot-simulation-app

Evaluation results

F1@M (Present) on inspec self-reported 0.361

F1@O (Present) on inspec self-reported 0.329

F1@M (Absent) on inspec self-reported 0.083

F1@O (Absent) on inspec self-reported 0.080

View on Papers With Code

Edit model card

🔑 Keyphrase Generation Model: KeyBART-inspec

Keyphrase extraction is a technique in text analysis where you extract the important
keyphrases from a document. Thanks to these keyphrases humans can understand the
content of a text very quickly and easily without reading it completely. Keyphrase
extraction was first done primarily by human annotators, who read the text in detail
and then wrote down the most important keyphrases. The disadvantage is that if you
work with a lot of documents, this process can take a lot of time ⏳.

Here is where Artificial Intelligence 🤖 comes in. Currently, classical machine learning
methods, that use statistical and linguistic features, are widely used for the extraction
process. Now with deep learning, it is possible to capture the semantic meaning of a
text even better than these classical methods. Classical methods look at the frequency,
occurrence and order of words in the text, whereas these neural approaches can
capture long-term semantic dependencies and context of words in a text.

📓 Model Description

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 2/9
3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

This model uses KeyBART as its base model and fine-tunes it on the Inspec dataset.
KeyBART focuses on learning a better representation of keyphrases in a generative
setting. It produces the keyphrases associated with the input document from a
corrupted input. The input is changed by token masking, keyphrase masking and
keyphrase replacement. This model can already be used without any fine-tuning, but
can be fine-tuned if needed. You can find more information about the architecture in
this paper.

Kulkarni, Mayank, Debanjan Mahata, Ravneet Arora, and Rajarshi Bhowmik. "Learning
Rich Representation of Keyphrases from Text." arXiv preprint arXiv:2112.08547 (2021).

✋ Intended Uses & Limitations

🛑 Limitations

This keyphrase generation model is very domain-specific and will perform very
well on abstracts of scientific papers. It's not recommended to use this model for
other domains, but you are free to test it out.

Only works for English documents.

❓ How To Use

# Model parameters
from transformers import (
Text2TextGenerationPipeline,
AutoModelForSeq2SeqLM,
AutoTokenizer,
)

class KeyphraseGenerationPipeline(Text2TextGenerationPipeline):
def __init__(self, model, keyphrase_sep_token=";", *args, **kwargs)
super().__init__(
model=AutoModelForSeq2SeqLM.from_pretrained(model),
tokenizer=AutoTokenizer.from_pretrained(model),
*args,
**kwargs
)
self.keyphrase_sep_token = keyphrase_sep_token

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 3/9
3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

def postprocess(self, model_outputs):

results = super().postprocess(
model_outputs=model_outputs
)
return [[keyphrase.strip() for keyphrase in result.get("generat

# Load pipeline
model_name = "ml6team/keyphrase-generation-keybart-inspec"
generator = KeyphraseGenerationPipeline(model=model_name)

# Inference
text = """
Keyphrase extraction is a technique in text analysis where you extract
important keyphrases from a document. Thanks to these keyphrases humans
understand the content of a text very quickly and easily without readin
completely. Keyphrase extraction was first done primarily by human anno
who read the text in detail and then wrote down the most important keyp
The disadvantage is that if you work with a lot of documents, this proc
can take a lot of time.

Here is where Artificial Intelligence comes in. Currently, classical ma

learning methods, that use statistical and linguistic features, are wid
for the extraction process. Now with deep learning, it is possible to c
the semantic meaning of a text even better than these classical methods
Classical methods look at the frequency, occurrence and order of words
in the text, whereas these neural approaches can capture long-term
semantic dependencies and context of words in a text.
""".replace("\n", " ")

keyphrases = generator(text)

print(keyphrases)

# Output
[['keyphrase extraction', 'text analysis', 'keyphrases', 'human annotat

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 4/9
3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

📚 Training Dataset

Inspec is a keyphrase extraction/generation dataset consisting of 2000 English scientific

papers from the scientific domains of Computers and Control and Information
Technology published between 1998 to 2002. The keyphrases are annotated by
professional indexers or editors.

You can find more information in the paper.

👷‍♂️ Training Procedure

Training Parameters

Parameter Value

Learning Rate 5e-5

Epochs 15

Early Stopping Patience 1

Preprocessing

The documents in the dataset are already preprocessed into list of words with the
corresponding keyphrases. The only thing that must be done is tokenization and
joining all keyphrases into one string with a certain seperator of choice( ; ).

from datasets import load_dataset

from transformers import AutoTokenizer

# Tokenizer
tokenizer = AutoTokenizer.from_pretrained("bloomberg/KeyBART", add_pref

# Dataset parameters
dataset_full_name = "midas/inspec"
dataset_subset = "raw"
dataset_document_column = "document"

keyphrase_sep_token = ";"

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 5/9
3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

def preprocess_keyphrases(text_ids, kp_list):

kp_order_list = []
kp_set = set(kp_list)
text = tokenizer.decode(
text_ids, skip_special_tokens=True, clean_up_tokenization_space
)
text = text.lower()
for kp in kp_set:
kp = kp.strip()
kp_index = text.find(kp.lower())
kp_order_list.append((kp_index, kp))

kp_order_list.sort()
present_kp, absent_kp = [], []

for kp_index, kp in kp_order_list:

if kp_index < 0:
absent_kp.append(kp)
else:
present_kp.append(kp)
return present_kp, absent_kp

def preprocess_fuction(samples):
processed_samples = {"input_ids": [], "attention_mask": [], "labels
for i, sample in enumerate(samples[dataset_document_column]):
input_text = " ".join(sample)
inputs = tokenizer(
input_text,
padding="max_length",
truncation=True,
)
present_kp, absent_kp = preprocess_keyphrases(
text_ids=inputs["input_ids"],
kp_list=samples["extractive_keyphrases"][i]
+ samples["abstractive_keyphrases"][i],
)
keyphrases = present_kp
keyphrases += absent_kp

target_text = f" {keyphrase_sep_token} ".join(keyphrases)

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 6/9
3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

with tokenizer.as_target_tokenizer():
targets = tokenizer(
target_text, max_length=40, padding="max_length", trunc
)
targets["input_ids"] = [
(t if t != tokenizer.pad_token_id else -100)
for t in targets["input_ids"]
]
for key in inputs.keys():
processed_samples[key].append(inputs[key])
processed_samples["labels"].append(targets["input_ids"])
return processed_samples

# Load dataset
dataset = load_dataset(dataset_full_name, dataset_subset)
# Preprocess dataset
tokenized_dataset = dataset.map(preprocess_fuction, batched=True)

Postprocessing

For the post-processing, you will need to split the string based on the keyphrase
separator.

def extract_keyphrases(examples):
return [example.split(keyphrase_sep_token) for example in examples]

📝 Evaluation results

Traditional evaluation methods are the precision, recall and F1-score @k,m where k is
the number that stands for the first k predicted keyphrases and m for the average
amount of predicted keyphrases. In keyphrase generation you also look at F1@O where
O stands for the number of ground truth keyphrases.

The model achieves the following results on the Inspec test set:

Extractive Keyphrases

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 7/9
3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

Dataset P@5 R@5 F1@5 P@10 R@10 F1@10 P@M R@M F1@M P@O R@O

Inspec 0.40 0.37 0.35 0.20 0.37 0.24 0.42 0.37 0.36 0.33 0.33
Test Set

Abstractive Keyphrases

Dataset P@5 R@5 F1@5 P@10 R@10 F1@10 P@M R@M F1@M P@O R@O

Inspec 0.07 0.12 0.08 0.03 0.12 0.05 0.08 0.12 0.08 0.08 0.12
Test Set

🚨 Issues

Please feel free to start discussions in the Community Tab.

Company
TOS
Privacy
About
Jobs

Website
Models
Datasets

Spaces
Pricing

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 8/9
3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

Docs

https://huggingface.co/ml6team/keyphrase-generation-keybart-inspec?text=The+contextualized+representation+of+pre-trained+language+models+%… 9/9

1Z0-1127-24 OCI Generative AI Professional
100% (1)
1Z0-1127-24 OCI Generative AI Professional
15 pages
50 LLM Interview Questions
100% (1)
50 LLM Interview Questions
56 pages
RAG For Knowledge-Intensive NLP Tasks
No ratings yet
RAG For Knowledge-Intensive NLP Tasks
18 pages
Hugging Face
100% (1)
Hugging Face
11 pages
Transformers MUIA
No ratings yet
Transformers MUIA
34 pages
Lab Experiment 1 LLM
No ratings yet
Lab Experiment 1 LLM
3 pages
HuggingFace GPT2
No ratings yet
HuggingFace GPT2
43 pages
Deep Keywordnet: Automated English Keyword Extraction in Documents Using Deep Keyword Network Based Ranking
No ratings yet
Deep Keywordnet: Automated English Keyword Extraction in Documents Using Deep Keyword Network Based Ranking
33 pages
Hi Everyone So by Now You Have Probtranscript
No ratings yet
Hi Everyone So by Now You Have Probtranscript
31 pages
Joshua K. Cage - Python Transformers by Huggingface Hands On - 101 Practical Implementation Hands-On of ALBERT - ViT - BigBird and Other Latest Models With Huggingface Transformers
No ratings yet
Joshua K. Cage - Python Transformers by Huggingface Hands On - 101 Practical Implementation Hands-On of ALBERT - ViT - BigBird and Other Latest Models With Huggingface Transformers
186 pages
Keyphrase Generation For Scientific Document Retrieval
No ratings yet
Keyphrase Generation For Scientific Document Retrieval
10 pages
Text Generation (Final)
No ratings yet
Text Generation (Final)
36 pages
Synopsys
No ratings yet
Synopsys
17 pages
Can Pre-Trained Language Models Generate Titles For Research Papers?
No ratings yet
Can Pre-Trained Language Models Generate Titles For Research Papers?
18 pages
AlamiMerrouni2020 Article AutomaticKeyphraseExtractionAS
No ratings yet
AlamiMerrouni2020 Article AutomaticKeyphraseExtractionAS
34 pages
RAG For Knowledge Intensive Tasks
No ratings yet
RAG For Knowledge Intensive Tasks
19 pages
Gen Ai 6,7
No ratings yet
Gen Ai 6,7
6 pages
From Transformers Import RagTokenizer
No ratings yet
From Transformers Import RagTokenizer
2 pages
08 NLP With Deep Learning
No ratings yet
08 NLP With Deep Learning
31 pages
Open-Ended Long Text Generation Via Masked Languag
No ratings yet
Open-Ended Long Text Generation Via Masked Languag
20 pages
Quora Insincere Questions Classification: Joseph Lionel Shabharinath TR Saravanakumar Velayutham
No ratings yet
Quora Insincere Questions Classification: Joseph Lionel Shabharinath TR Saravanakumar Velayutham
12 pages
Steps
No ratings yet
Steps
3 pages
AIML LAB Week9 2
No ratings yet
AIML LAB Week9 2
3 pages
Key Algorithms For Keyphrase Generation: Instruction-Based Llms For Russian Scientific Keyphrases
No ratings yet
Key Algorithms For Keyphrase Generation: Instruction-Based Llms For Russian Scientific Keyphrases
13 pages
cl12 Huggingface
No ratings yet
cl12 Huggingface
34 pages
Identifying Machine-Paraphrased Plagiarism: Bibtex Ris Enw
No ratings yet
Identifying Machine-Paraphrased Plagiarism: Bibtex Ris Enw
22 pages
CrateDB and LangChain
No ratings yet
CrateDB and LangChain
14 pages
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
No ratings yet
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
8 pages
Image Captioning With Visual Attention PDF
No ratings yet
Image Captioning With Visual Attention PDF
16 pages
Ad3511 Set4
No ratings yet
Ad3511 Set4
3 pages
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
No ratings yet
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
13 pages
WWW Promptingguide Ai Techniques Rag
No ratings yet
WWW Promptingguide Ai Techniques Rag
4 pages
Pattern Rank
No ratings yet
Pattern Rank
6 pages
Seminar Report - Transformer Model
No ratings yet
Seminar Report - Transformer Model
16 pages
Bridging The Gap - Hugging Face Transformers and GitHub For Cross-Framework NLP - by Frank Morales Aguilera - Nov, 2024 - Medium
No ratings yet
Bridging The Gap - Hugging Face Transformers and GitHub For Cross-Framework NLP - by Frank Morales Aguilera - Nov, 2024 - Medium
13 pages
NLP Assignment2
No ratings yet
NLP Assignment2
7 pages
Mastering Chatgpt and LLM in 2024
100% (2)
Mastering Chatgpt and LLM in 2024
71 pages
Deep DL Manual Nainish
No ratings yet
Deep DL Manual Nainish
8 pages
Pgi20s02j - Lab Record
No ratings yet
Pgi20s02j - Lab Record
24 pages
Text Mining and Dataset Creation in Python
No ratings yet
Text Mining and Dataset Creation in Python
13 pages
Generative AI Mini Projects
No ratings yet
Generative AI Mini Projects
39 pages
Finally Final
No ratings yet
Finally Final
18 pages
Survey On LLM
No ratings yet
Survey On LLM
9 pages
Group No.17: Class-Ai - A Sub-Edi
No ratings yet
Group No.17: Class-Ai - A Sub-Edi
14 pages
Deep DL Manual Deep
No ratings yet
Deep DL Manual Deep
8 pages
8946-Article Text-12474-1-2-20201228
No ratings yet
8946-Article Text-12474-1-2-20201228
7 pages
2 Ai
No ratings yet
2 Ai
21 pages
Simple Unsupervised Keyphrase Extraction Using Sentence Embeddings
No ratings yet
Simple Unsupervised Keyphrase Extraction Using Sentence Embeddings
9 pages
UNIT VI Gen-AI ASP Notes
No ratings yet
UNIT VI Gen-AI ASP Notes
11 pages
Retrieval-Augmented Generation For Knowledge-Intensive NLP Tasks
No ratings yet
Retrieval-Augmented Generation For Knowledge-Intensive NLP Tasks
19 pages
Fine-Tuned Vs RAG Short Notes ?
No ratings yet
Fine-Tuned Vs RAG Short Notes ?
25 pages
Chat Bot
No ratings yet
Chat Bot
10 pages
Natural Language Processing GPT-2
No ratings yet
Natural Language Processing GPT-2
5 pages
Experiential Learning
No ratings yet
Experiential Learning
8 pages
ATV - CVPR'23 Tutorial
No ratings yet
ATV - CVPR'23 Tutorial
152 pages
Electronics 12 02165
No ratings yet
Electronics 12 02165
13 pages
A Knowledge-Grounded Neural Conversation Model
No ratings yet
A Knowledge-Grounded Neural Conversation Model
8 pages
Artificial Intelligence PDF
No ratings yet
Artificial Intelligence PDF
82 pages
Slides
No ratings yet
Slides
26 pages
RoPE Vit
No ratings yet
RoPE Vit
20 pages
(2303.18223) A Survey of Large Language Models
No ratings yet
(2303.18223) A Survey of Large Language Models
115 pages
How To Make Custom AI-Generated Text With GPT-2
No ratings yet
How To Make Custom AI-Generated Text With GPT-2
3 pages
State of AI Report 2023 - ONLINE
No ratings yet
State of AI Report 2023 - ONLINE
163 pages
Introduction
No ratings yet
Introduction
17 pages
DL Practical 09text Pre Processing
No ratings yet
DL Practical 09text Pre Processing
6 pages
Summarization
No ratings yet
Summarization
10 pages
The Transformer Model in Equations: John Thickstun
No ratings yet
The Transformer Model in Equations: John Thickstun
5 pages
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
No ratings yet
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
15 pages
Unit 5 A.I
No ratings yet
Unit 5 A.I
17 pages
Study of Spatial Attention Mechanisms
No ratings yet
Study of Spatial Attention Mechanisms
10 pages
Internvl: Scaling Up Vision Foundation Models and Aligning For Generic Visual-Linguistic Tasks
No ratings yet
Internvl: Scaling Up Vision Foundation Models and Aligning For Generic Visual-Linguistic Tasks
25 pages
Course Project Report For: Artificial Intelligence EL-3011
No ratings yet
Course Project Report For: Artificial Intelligence EL-3011
8 pages
MJ Report
No ratings yet
MJ Report
25 pages
Introduction To Artificial Intelligence Azure AI 900
No ratings yet
Introduction To Artificial Intelligence Azure AI 900
8 pages
A Survey of Large Language Models
No ratings yet
A Survey of Large Language Models
58 pages
MLSys Class LLM Introduction
No ratings yet
MLSys Class LLM Introduction
43 pages
A Lip Reading Method Based On 3D Convolutional Vision Transformer
No ratings yet
A Lip Reading Method Based On 3D Convolutional Vision Transformer
8 pages
Build-a-Bot Teaching Conversational AI Using A Tra
No ratings yet
Build-a-Bot Teaching Conversational AI Using A Tra
8 pages
Vision Transformer (Vit) Model For Birds Classification
No ratings yet
Vision Transformer (Vit) Model For Birds Classification
51 pages
An Evaluation of Machine Learning and Deep Learning Models For Drought Prediction Using Weather Dara
No ratings yet
An Evaluation of Machine Learning and Deep Learning Models For Drought Prediction Using Weather Dara
36 pages
Transformers
No ratings yet
Transformers
20 pages
Chem Prop
No ratings yet
Chem Prop
9 pages
27786-Article Text-31840-1-2-20240324
No ratings yet
27786-Article Text-31840-1-2-20240324
9 pages
Whisper Openai
No ratings yet
Whisper Openai
28 pages
Llamacare: A Large Medical Language Model For Enhancing Healthcare Knowledge Sharing
No ratings yet
Llamacare: A Large Medical Language Model For Enhancing Healthcare Knowledge Sharing
11 pages
The Illustrated Transformer - Jay Alammar - Visualizing Machine Learning One Concept at A Time - .Booklet
No ratings yet
The Illustrated Transformer - Jay Alammar - Visualizing Machine Learning One Concept at A Time - .Booklet
14 pages
TransDetector: A Transformer-Based Detector For Underwater Acoustic Differential OFDM Communications
No ratings yet
TransDetector: A Transformer-Based Detector For Underwater Acoustic Differential OFDM Communications
28 pages
Research Paper-On GPT Rise and Downfall
No ratings yet
Research Paper-On GPT Rise and Downfall
3 pages
Zero Initialization Initializi
No ratings yet
Zero Initialization Initializi
14 pages

Ml6team - Keyphrase-Generation-Keybart-Inspec Hugging Face

Uploaded by

Ml6team - Keyphrase-Generation-Keybart-Inspec Hugging Face

Uploaded by

3/13/24, 7:51 AM ml6team/keyphrase-generation-keybart-inspec · Hugging Face

Search models, datasets, users...

ml6team / keyphrase-generation-keybart-inspec like 4

Text2Text Generation Transformers PyTorch midas/inspec English

bart keyphrase-generation Eval Results Inference Endpoints arxiv:2112.08547

Train Deploy Use in Transformers

Model card Files Community 4

Downloads last month

The contextualized representation of pre-trained language models (PLMs) has been

Computation time on cpu: 1.695 s

contextualized representation ; pre-trained language models ; keyphrase extraction ;

JSON Output Maximize

Dataset used to train ml6team/keyphrase-generation-keybart-inspec

Spaces using ml6team/keyphrase-generation-keybart-inspec 2

F1@M (Present) on inspec self-reported 0.361

F1@O (Present) on inspec self-reported 0.329

F1@M (Absent) on inspec self-reported 0.083

F1@O (Absent) on inspec self-reported 0.080

View on Papers With Code

Edit model card

🔑 Keyphrase Generation Model: KeyBART-inspec

✋ Intended Uses & Limitations

Only works for English documents.

def postprocess(self, model_outputs):

Here is where Artificial Intelligence comes in. Currently, classical ma

Inspec is a keyphrase extraction/generation dataset consisting of 2000 English scientific

You can find more information in the paper.

👷‍♂️ Training Procedure

Learning Rate 5e-5

Early Stopping Patience 1

from datasets import load_dataset

def preprocess_keyphrases(text_ids, kp_list):

for kp_index, kp in kp_order_list:

target_text = f" {keyphrase_sep_token} ".join(keyphrases)

Please feel free to start discussions in the Community Tab.

You might also like