LLM Aiml

Transformer models have achieved state-of-the-art performance on NLP tasks. For code summarization, approaches include using encoder-decoder frameworks to encode code into a hidden space and decode into text, graph neural networks to model code structure, and tree-LSTMs to understand code hierarchy. Challenges include handling diverse coding styles and complexity, which may be addressed through larger training datasets, more advanced techniques, and human feedback mechanisms.

Uploaded by

powerites009

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views2 pages

LLM Aiml

Uploaded by

powerites009

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Literature review :

In the NLP domain, models based on Transformer architectures have shown

state-of-the-art performance on a broad set of NLP tasks. These models can be
categorized into three groups: encoder-only models such as BERT, RoBERTa, and
ELECTRA, decoder-only models like GPT, and encoder-decoder models such as
MASS, BART, and T5.
On the other hand, pre-training on the programming language is a nascent field
where recent work attempts to extend the NLP pre-training methods to source code.
Two pioneering models in this field are CuBERT and CodeBERT. CuBERT employs
BERT’s powerful masked language modeling objective to derive generic
code-specific representation, and CodeBERT further adds a replaced token
detection task to learn NL-PL cross-modal representation.
Despite these advancements, there are still challenges in the field of autonomous
code summarization. One of the main challenges is the complexity of code and the
difficulty of understanding the semantics of different programming constructs.

Approaches for Autonomous Code Summarization:

Encoder-Decoder Framework: As described earlier, this is a common approach
where the code is encoded into a hidden space and then decoded into natural
language space. However, this approach may suffer from the limitation of
considering only the sequential content of the code, ignoring the tree structure which
is also critical for code summarization.
Graph Neural Networks (GNN): Some studies propose using Graph Neural Networks
to capture the structural information of the Abstract Syntax Tree (AST) of the code.
This approach can handle the hierarchical nature of the code and might perform
better in capturing the semantic meaning of the code.
Tree-LSTM: Another approach is to use Tree-LSTM, which can handle the
hierarchical structure of the code more effectively. This approach treats each node in
the AST as a time step in the LSTM, allowing the model to understand the code in a
more structured manner.
Neural Attention Models: Some models utilize neural attention mechanisms to focus
on important parts of the code when generating the summary.
This approach is based on using an attention mechanism to assign weights to
different parts of the code based on their importance.
Implementation Strategy (using CodeT5) :
For a given programming language generate its AST.
Now we need to pass the root of AST tree to a summarize_function
We initialise CodeT5 model and build a pipeline for our summarization task.
The AST is traversed and functions are extracted using a parser.
The CodeT5 summarization pipeline is used to generate a summary for each
function text.

Future Challenges/Solutions :
Future challenges in the field of autonomous code summarization are as follows:
1. Handling Diverse Coding Styles: Different programmers have different coding
styles, which can make the code difficult to read and summarise. Future
models should be able to handle code written in various styles.
2. Increasing code complexity: Code can become increasingly complex as
software development practices evolve. Future models will need to be able to
understand and summarise complex code structures including recursion, and
advanced data structures.

Possible solutions to these challenges could include:

1. Larger Datasets: Training models on larger datasets could help them better
understand and handle diverse coding styles.
2. Advancements in Deep Learning and evolution of LLMs: With evolving
techniques like MoE (Mixture of experts) and further advancements in deep
learning and computing power, the ability to understand and summarise code
could increase.
3. Feedback Mechanism (RHLF): Reinforcement Learning with Human
Feedback (RLHF) is a promising approach to tackle the challenges in
autonomous code summarization. It combines the power of reinforcement
learning with human feedback to improve the quality of generated summaries.

Credits:
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code
Understanding and Generation.
A Transformer-based Approach for Source Code Summarization

Text Based RACE Writing Passages
100% (3)
Text Based RACE Writing Passages
29 pages
Liu-Shang3719 Retrieval Augmented Generation
No ratings yet
Liu-Shang3719 Retrieval Augmented Generation
16 pages
Mastropaolo CodeSummarization
No ratings yet
Mastropaolo CodeSummarization
12 pages
A Transformer-Based Approach For Source Code Summarization: Former
No ratings yet
A Transformer-Based Approach For Source Code Summarization: Former
10 pages
NLP Review 1
No ratings yet
NLP Review 1
17 pages
Distilled GPT For Source Code Summarization: Chia-Yi Su and Collin Mcmillan
No ratings yet
Distilled GPT For Source Code Summarization: Chia-Yi Su and Collin Mcmillan
26 pages
Code Summarization Using LLM
No ratings yet
Code Summarization Using LLM
13 pages
Eesha Survey Papers
No ratings yet
Eesha Survey Papers
12 pages
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
No ratings yet
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
13 pages
2022 Acl-Long 37
No ratings yet
2022 Acl-Long 37
15 pages
Advanced Text Summarization Techniques: Integrating RNNS, Transformers, and Pca For Enhanced Performance
No ratings yet
Advanced Text Summarization Techniques: Integrating RNNS, Transformers, and Pca For Enhanced Performance
8 pages
SP-Summarizing Source Code With Transferred API Knowledge
No ratings yet
SP-Summarizing Source Code With Transferred API Knowledge
9 pages
AST-Trans - Code Summarization With Efficient Tree-Structured
No ratings yet
AST-Trans - Code Summarization With Efficient Tree-Structured
13 pages
SP-Automatic Generation of Descriptive Comments For Code Blocks
No ratings yet
SP-Automatic Generation of Descriptive Comments For Code Blocks
8 pages
Group 13 Sem 2 Review 1
No ratings yet
Group 13 Sem 2 Review 1
20 pages
2021.findings Acl.251
No ratings yet
2021.findings Acl.251
10 pages
2018-NeurIPS-Neural Code Comprehension - A Learnable Representation of Code Semantics
No ratings yet
2018-NeurIPS-Neural Code Comprehension - A Learnable Representation of Code Semantics
17 pages
Literature Survey - Ai Mini Project: Research Papers
No ratings yet
Literature Survey - Ai Mini Project: Research Papers
5 pages
IEEE Xplore Reference Download 2024.10.8.14.47.26
No ratings yet
IEEE Xplore Reference Download 2024.10.8.14.47.26
2 pages
11461-Article Text-20356-1-10-20211106
No ratings yet
11461-Article Text-20356-1-10-20211106
5 pages
Applied Sciences: Abstractive vs. Extractive Summarization: An Experimental Review
No ratings yet
Applied Sciences: Abstractive vs. Extractive Summarization: An Experimental Review
20 pages
NLP Mini Project
No ratings yet
NLP Mini Project
19 pages
Hand Sign Language Recognition Using Deep Learning
No ratings yet
Hand Sign Language Recognition Using Deep Learning
14 pages
Abstractive Text Summarization Using Deep Learning
No ratings yet
Abstractive Text Summarization Using Deep Learning
43 pages
Automatic Code Summarization: A Systematic Literature Review
No ratings yet
Automatic Code Summarization: A Systematic Literature Review
12 pages
ACM Journals Primary Article Template Latest Version 4
No ratings yet
ACM Journals Primary Article Template Latest Version 4
31 pages
Deep Recurrent Generative Decoder For Abstractive Text Summarization
No ratings yet
Deep Recurrent Generative Decoder For Abstractive Text Summarization
10 pages
IOT Based Mini Project
No ratings yet
IOT Based Mini Project
28 pages
CS5984 Final Report
No ratings yet
CS5984 Final Report
57 pages
Ai-Text Summarization Synopsis
No ratings yet
Ai-Text Summarization Synopsis
36 pages
Comparative Analysis of T5 Model For Abstractive Text Summarization On Different Datasets
No ratings yet
Comparative Analysis of T5 Model For Abstractive Text Summarization On Different Datasets
7 pages
Natural Language Processing (NLP) For Code in Python: Rahul Saxena
No ratings yet
Natural Language Processing (NLP) For Code in Python: Rahul Saxena
7 pages
SUMMARIZATION Project For Ipec Solutions
No ratings yet
SUMMARIZATION Project For Ipec Solutions
18 pages
Leveraging ParsBERT and Pretrained MT5 For Persian Abstractive
No ratings yet
Leveraging ParsBERT and Pretrained MT5 For Persian Abstractive
7 pages
Inlg 19 TL DR Writeup 4
No ratings yet
Inlg 19 TL DR Writeup 4
7 pages
Zhang 2019
No ratings yet
Zhang 2019
12 pages
Project
No ratings yet
Project
20 pages
Lunyiu SOP UT
No ratings yet
Lunyiu SOP UT
2 pages
Review of Data-Driven Generative AI Models For Knowledge Extraction From Scientific Literature in Healthcare
No ratings yet
Review of Data-Driven Generative AI Models For Knowledge Extraction From Scientific Literature in Healthcare
20 pages
Using Artificial Intelligence in Source Code Summa
No ratings yet
Using Artificial Intelligence in Source Code Summa
7 pages
25654-Article Text-29717-1-2-20230626
No ratings yet
25654-Article Text-29717-1-2-20230626
9 pages
A Comprehensive Survey of Abstractive Text Summarization Techniques
No ratings yet
A Comprehensive Survey of Abstractive Text Summarization Techniques
5 pages
Temporary Report
No ratings yet
Temporary Report
28 pages
Code Review Automation Strengths and Weaknesses of The State of The Art
No ratings yet
Code Review Automation Strengths and Weaknesses of The State of The Art
16 pages
Short Updates-Machine Learning Based News Summarizer
No ratings yet
Short Updates-Machine Learning Based News Summarizer
11 pages
Code Confabulator Harnessing LLMs To Compile Code For Visualization
No ratings yet
Code Confabulator Harnessing LLMs To Compile Code For Visualization
6 pages
Few Shot Prompt Static Analysis
No ratings yet
Few Shot Prompt Static Analysis
12 pages
Guide Text Summarization Using Deep Learning in Python
No ratings yet
Guide Text Summarization Using Deep Learning in Python
30 pages
2020 Acl-Main 457
No ratings yet
2020 Acl-Main 457
14 pages
Do Machines and Humans Focus On Similar Code? Exploring Explainability of Large Language Models in Code Summarization
No ratings yet
Do Machines and Humans Focus On Similar Code? Exploring Explainability of Large Language Models in Code Summarization
5 pages
2019 ICLR CuBERT Pre Trained Contextual Embedding of Source Code
No ratings yet
2019 ICLR CuBERT Pre Trained Contextual Embedding of Source Code
22 pages
Deep Learning On Code With An Unbounded Vocabulary
No ratings yet
Deep Learning On Code With An Unbounded Vocabulary
11 pages
(2023) A Survey On Language Models For Code
No ratings yet
(2023) A Survey On Language Models For Code
55 pages
FinalMP 2 Removed
No ratings yet
FinalMP 2 Removed
46 pages
Report Group-8
No ratings yet
Report Group-8
16 pages
Final Ojt
No ratings yet
Final Ojt
31 pages
Toward Neurosymbolic Program Comprehension
No ratings yet
Toward Neurosymbolic Program Comprehension
5 pages
SP-Deep Code Comment Generation
No ratings yet
SP-Deep Code Comment Generation
12 pages
Experiential Learning
No ratings yet
Experiential Learning
8 pages
Get To The Point:: Summarization With Pointer-Generator Networks
No ratings yet
Get To The Point:: Summarization With Pointer-Generator Networks
32 pages
SNEHA JADHAV Projects........... 2000
No ratings yet
SNEHA JADHAV Projects........... 2000
84 pages
Latihan Minggu 7-2-Andri Rahman Kusumo-1B AKM
No ratings yet
Latihan Minggu 7-2-Andri Rahman Kusumo-1B AKM
5 pages
The Teaching Profession 2
No ratings yet
The Teaching Profession 2
11 pages
Papyrus History Lesson XL
No ratings yet
Papyrus History Lesson XL
9 pages
StartNow Overview
No ratings yet
StartNow Overview
22 pages
Leading For The Future
No ratings yet
Leading For The Future
4 pages
(Edited File) Major-Components
No ratings yet
(Edited File) Major-Components
12 pages
Unit One
No ratings yet
Unit One
14 pages
Amapl - SS316L - Dia 100 MM - HT - 24SL1214 - 2596.000 Kgs.
No ratings yet
Amapl - SS316L - Dia 100 MM - HT - 24SL1214 - 2596.000 Kgs.
4 pages
Financial Planning
No ratings yet
Financial Planning
53 pages
RLT A Question of Trust
No ratings yet
RLT A Question of Trust
3 pages
Retail Supply Chain Management
No ratings yet
Retail Supply Chain Management
12 pages
FEA Lab Manual
No ratings yet
FEA Lab Manual
17 pages
Carbolite Furnace Manual
No ratings yet
Carbolite Furnace Manual
16 pages
DJI FPV Goggles Disclaimer and Safety Guidelines
No ratings yet
DJI FPV Goggles Disclaimer and Safety Guidelines
54 pages
Understanding Operating Systems 7th Edition by Ida Flynn, Ann McIver McHoes 128509655X 978-1285096551instant Download
100% (4)
Understanding Operating Systems 7th Edition by Ida Flynn, Ann McIver McHoes 128509655X 978-1285096551instant Download
85 pages
AIR LF Brochure VF
No ratings yet
AIR LF Brochure VF
11 pages
Frequency-Dependence of Relative Permeability in Steel
No ratings yet
Frequency-Dependence of Relative Permeability in Steel
8 pages
Hybrid Organizations:: O, S, I, I
No ratings yet
Hybrid Organizations:: O, S, I, I
8 pages
Trip of Dreams PDF
No ratings yet
Trip of Dreams PDF
6 pages
Santos Et Al 2020
No ratings yet
Santos Et Al 2020
6 pages
Nervous System Regulation Script
No ratings yet
Nervous System Regulation Script
3 pages
JIP390
No ratings yet
JIP390
11 pages
Xudu
No ratings yet
Xudu
22 pages
5th Grade Colonial Village Unit Plan
100% (1)
5th Grade Colonial Village Unit Plan
25 pages
TS 3.01.01 RES I1
No ratings yet
TS 3.01.01 RES I1
5 pages
Uefa A 2014 Oppgave Luis Pimenta
No ratings yet
Uefa A 2014 Oppgave Luis Pimenta
45 pages
Businesses Proposal
No ratings yet
Businesses Proposal
9 pages

LLM Aiml

Uploaded by

LLM Aiml

Uploaded by

Literature review :

In the NLP domain, models based on Transformer architectures have shown

Approaches for Autonomous Code Summarization:

Possible solutions to these challenges could include:

You might also like