0% found this document useful (0 votes)

150 views

LongCoder: Open-Source Model For Code Completion With Long Code Input

Do you want to handle long code input and capture global information? If yes, then you need to check out LongCoder, a new AI model for code completion. LongCoder is a sparse Transformer model that can process long code input and generate accurate code outputs. In this article, you will learn how LongCoder works, and more.

Uploaded by

My Social

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

150 views

LongCoder: Open-Source Model For Code Completion With Long Code Input

Uploaded by

My Social

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

To read more such articles, please visit our blog https://socialviews81.blogspot.

com/

LongCoder: Open-Source Model for Code Completion with

Long Code Input

Introduction

Code completion is a task that aims to generate the next token or

statement given a partial code input. It is widely used in modern
integrated development environments (IDEs) and code editors to assist
programmers in writing code. Code completion can help programmers
save time, avoid typos, and discover new APIs or libraries.

However, most existing code completion models are based on standard

Transformer models, which have limitations in handling long code input.
Transformer models use self-attention mechanism to compute the
relevance between every pair of tokens in the input sequence. This
mechanism has two drawbacks: (1) it has quadratic complexity with
respect to the sequence length, which makes it computationally
expensive and memory-intensive for long sequences; (2) it treats every

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

token equally, which may result in noise or redundancy for long

sequences.

To overcome these limitations, researchers from Microsoft Research

Asia and University of California San Diego proposed a new model,
which is a long-range pre-trained language model for code completion.
This new Model is called 'LongCoder'.

What is LongCoder?

LongCoder is a sparse Transformer model that can handle long code

input for code completion tasks and capture both local and global
information. It employs a sliding window mechanism for self-attention
and introduces two types of globally accessible tokens - bridge tokens
and memory tokens - to improve performance and efficiency.

Key Features of LongCoder

LongCoder has several key features that make it a novel and effective
model for code completion tasks.

1. LongCoder can handle long code input up to 4,096 tokens, which

is much longer than previous models that can only handle up to
512 tokens.
2. LongCoder can capture both local and global information in the
code using a sliding window mechanism and bridge tokens. It can
also memorize important statements using memory tokens.
3. LongCoder is pre-trained on a large-scale corpus of Python code
from GitHub repositories using masked language modeling (MLM)
objective. It can leverage the general knowledge and syntax of
Python code learned from pre-training.
4. LongCoder can be fine-tuned on specific code completion tasks
using different datasets. It can adapt to different domains and
scenarios of code completion.
5. LongCoder achieves superior performance on code completion
tasks compared to previous models while maintaining comparable
efficiency in terms of computational resources during inference.

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

Capabilities/Use Cases of LongCoder

LongCoder can be used for various code completion tasks, such as:

● Token-level code completion: given a partial code input,

generate the next token that is most likely to follow.
● Statement-level code completion: given a partial code input,
generate the next statement that is most likely to follow.
● Code refinement: given a partial or incorrect code input, generate
the correct or improved code output.
● Code suggestion: given a partial or incomplete code input,
generate multiple possible code outputs that can complete or
extend the input.

LongCoder can also be used for other related tasks, such as:

● Code summarization: given a code input, generate a natural

language summary that describes its functionality or purpose.
● Code documentation: given a code input, generate a natural
language documentation that explains its usage or parameters.
● Code generation: given a natural language input, generate a code
output that implements its functionality or logic.

How does LongCoder work?

LongCoder is a model for code completion that can handle long code
input. It is based on a sparse Transformer architecture with an encoder
and a decoder. The encoder takes the input code tokens and produces
hidden states. The decoder generates the output code tokens based on
the hidden states and the previous outputs.

The encoder and decoder have sparse Transformer blocks with three
sub-layers: (1) self-attention with sliding window; (2) cross-attention with
bridge tokens and memory tokens; (3) feed-forward network.

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

The self-attention with sliding window splits the input code into segments
and processes each segment with self-attention. The output of each
segment is concatenated to form the output sequence. This reduces the
cost and memory of self-attention and keeps the local information in
each segment.

The cross-attention with bridge tokens and memory tokens captures the
global information across segments. Bridge tokens and memory tokens
are special tokens that can attend to all other tokens in the input code.
They act as bridges to aggregate local information and facilitate global
interaction.

The feed-forward network applies a non-linear transformation to each

token after the cross-attention. It helps to learn non-linear features from
the token representations.

The encoder and decoder also have layer normalization, residual

connection, and an output layer.

Performance evaluation with other models

source - https://arxiv.org/pdf/2306.14893.pdf

The LCC dataset (as shown in above table) shows that the sparse
models (LongFormer and LongCoder) perform better than the
non-sparse models on both EM and Edit Sim metrics. They also have a
similar inference speed. LongFormer is a modified version of UniXcoder
that uses a sliding window attention mechanism. This mechanism helps

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

the model process longer code input faster and more accurately. This
proves the usefulness of the sliding window attention mechanism for
code completion tasks.

LongCoder improves upon LongFormer by adding bridge tokens and

memory tokens. These tokens help the model capture more global
information and important statements in the code. LongCoder improves
the EM score by 0.8%–1.3% and the Edit Sim score by 4.0%–6.0%
compared to other sparse models (as shown in above table). This shows
the effectiveness of our proposed tokens.

source - source - https://arxiv.org/pdf/2306.14893.pdf

LongCoder also achieves the best performance on CodeXGLUE code

completion benchmarks (as shown in above table). These benchmarks
have shorter code input. LongCoder has a bigger advantage over
UniXcoder on these benchmarks. This shows its potential for more
complex scenarios.

How to access and use LongCoder?

LongCoder is available on GitHub, where you can find the codes and
data for pre-training and fine-tuning LongCoder, as well as the
instructions for running the experiments.

LongCoder is also available on Hugging Face, where you can load and
use LongCoder using PyTorch Transformers library. You can also use
LongCoder for feature extraction or fine-tuning on your own datasets.

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

LongCoder is open-source and free to use for research purposes.

However, if you want to use LongCoder for commercial purposes, you
need to obtain a license from Microsoft.

If you are interested to learn more about the LongCoder model, all
relevant links are provided under the 'source' section at the end of this
article.

Limitations

LongCoder is a novel and effective model for code completion tasks, but
it also has some limitations that need to be addressed in future work.

● LongCoder is currently only pre-trained on Python code, which

may limit its generalization to other programming languages. It
would be interesting to explore how to pre-train LongCoder on
multiple languages or cross-lingual code corpora.
● LongCoder uses a heuristic rule to select memory tokens based on
keywords, which may not capture all the important statements in
the code. It would be interesting to explore how to use more
sophisticated methods to select memory tokens based on
semantic or syntactic analysis.
● LongCoder uses a fixed number of bridge tokens and memory
tokens, which may not adapt well to different lengths or
complexities of code input. It would be interesting to explore how
to dynamically adjust the number or position of bridge tokens and
memory tokens based on the input context.

Conclusion

LongCoder is a promising model that can handle long code input and
generate accurate and relevant code outputs for code completion tasks.
It can help programmers write code faster and with fewer errors, as well
as discover new APIs or libraries. However, LongCoder also has some

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

limitations that need to be addressed in future work, such as

generalizing to other languages, selecting memory tokens more
effectively, and adapting to different code input.
Source
research paper - https://arxiv.org/abs/2306.14893
GitHub repo - https://github.com/microsoft/CodeBERT/tree/master/LongCoder
Parent github repo - https://github.com/microsoft/CodeBERT/
Hugging face longcoder base - https://huggingface.co/microsoft/longcoder-base
Microsoft research -
https://www.microsoft.com/en-us/research/publication/longcoder-a-long-range-pre-trained-language-m
odel-for-code-completion/

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

Update to Modern C++
From Everand
Update to Modern C++
James Raynard
No ratings yet
Practical C++ Backend Programming
From Everand
Practical C++ Backend Programming
Justin Barbara
No ratings yet
HTML PPT-1
No ratings yet
HTML PPT-1
32 pages
Beyond Effective Go: Part 1 - Achieving High-Performance Code
From Everand
Beyond Effective Go: Part 1 - Achieving High-Performance Code
Corey S Scott
No ratings yet
SRS - How to build a Pen Test and Hacking Platform
From Everand
SRS - How to build a Pen Test and Hacking Platform
alasdair gilchrist
2/5 (1)
C Programming For Beginners: The Simple Guide to Learning C Programming Language Fast!
From Everand
C Programming For Beginners: The Simple Guide to Learning C Programming Language Fast!
Tim Warren
5/5 (1)
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet
Chapter2 The Origin of Software
0% (1)
Chapter2 The Origin of Software
36 pages
BMW Inpa English User Guide PDF
No ratings yet
BMW Inpa English User Guide PDF
74 pages
Introduction To Logic Circuit Design With VHDL
From Everand
Introduction To Logic Circuit Design With VHDL
Bilgehan Erkal
No ratings yet
Groovy for Domain-Specific Languages, Second Edition: Extend and enhance your Java applications with domain-specific scripting in Groovy
From Everand
Groovy for Domain-Specific Languages, Second Edition: Extend and enhance your Java applications with domain-specific scripting in Groovy
Fergal Dearle
No ratings yet
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet
Code Beneath the Surface: Mastering Assembly Programming
From Everand
Code Beneath the Surface: Mastering Assembly Programming
Kameron Hussain
No ratings yet
Mastering the Craft: Unleashing the Art of Software Engineering
From Everand
Mastering the Craft: Unleashing the Art of Software Engineering
Kiran Nagesh
No ratings yet
Protocol Buffers Handbook: Getting deeper into Protobuf internals and its usage
From Everand
Protocol Buffers Handbook: Getting deeper into Protobuf internals and its usage
Clément Jean
No ratings yet
Go Debugging from Scratch: A Practical Guide with Examples
From Everand
Go Debugging from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
C++ Regular Expressions Simplified: A Practical Guide with Examples
From Everand
C++ Regular Expressions Simplified: A Practical Guide with Examples
William E. Clark
No ratings yet
Swift Programming Simplified: A Practical Guide with Examples
From Everand
Swift Programming Simplified: A Practical Guide with Examples
William E. Clark
No ratings yet
2401.14196
No ratings yet
2401.14196
23 pages
StarCoder2: AI-Powered Code Generation by Service Now, Hugging Face and NVIDIA
No ratings yet
StarCoder2: AI-Powered Code Generation by Service Now, Hugging Face and NVIDIA
8 pages
Writing Clean Code Step by Step: A Practical Guide with Examples
From Everand
Writing Clean Code Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
CODING FOR ABSOLUTE BEGINNERS: How to Keep Your Data Safe from Hackers by Mastering the Basic Functions of Python, Java, and C++ (2022 Guide for Newbies)
From Everand
CODING FOR ABSOLUTE BEGINNERS: How to Keep Your Data Safe from Hackers by Mastering the Basic Functions of Python, Java, and C++ (2022 Guide for Newbies)
Eric Vargas
No ratings yet
The Software Programmer: Basis of common protocols and procedures
From Everand
The Software Programmer: Basis of common protocols and procedures
S Mathioudakis
No ratings yet
Ethereum Blockchain Developer - The Bootcamp
From Everand
Ethereum Blockchain Developer - The Bootcamp
Thomas Wiesner
5/5 (3)
The FPGA Programming Handbook: An essential guide to FPGA design for transforming ideas into hardware using SystemVerilog and VHDL
From Everand
The FPGA Programming Handbook: An essential guide to FPGA design for transforming ideas into hardware using SystemVerilog and VHDL
Frank Bruno
No ratings yet
Introduction to Google's Go Programming Language: GoLang
From Everand
Introduction to Google's Go Programming Language: GoLang
Orhan Gazi
No ratings yet
Python OOP Step by Step: A Practical Guide with Examples
From Everand
Python OOP Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Byte by Byte
From Everand
Byte by Byte
Manuel Oliveira
No ratings yet
Programming And Coding in Intermidiate Level
From Everand
Programming And Coding in Intermidiate Level
Memo
No ratings yet
Code::Blocks Essentials: Definitive Reference for Developers and Engineers
From Everand
Code::Blocks Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
C++ Basics for New Programmers: A Practical Guide with Examples
From Everand
C++ Basics for New Programmers: A Practical Guide with Examples
William E. Clark
No ratings yet
Living with Linux in the Industrial World
From Everand
Living with Linux in the Industrial World
Elaiya Iswera Lallan
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
A Beginners Guide to Cursor
From Everand
A Beginners Guide to Cursor
Steven Mcananey
No ratings yet
Coding for beginners The basic syntax and structure of coding
From Everand
Coding for beginners The basic syntax and structure of coding
Diamond Moore
No ratings yet
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
jQuery Design Patterns
From Everand
jQuery Design Patterns
Greasidis Thodoris
No ratings yet
Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend
From Everand
Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend
Justin Barbara
No ratings yet
Pretraining and Evaluation CodeLLMs
No ratings yet
Pretraining and Evaluation CodeLLMs
71 pages
Mastering the Art of C# Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of C# Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Mastering Concurrent Programming with Go
From Everand
Mastering Concurrent Programming with Go
Brett Neutreon
No ratings yet
Game and Graphics Programming for iOS and Android with OpenGL ES 2.0
From Everand
Game and Graphics Programming for iOS and Android with OpenGL ES 2.0
Romain Marucchi-Foino
No ratings yet
Understanding Python: Beginner's Guide to Programming
From Everand
Understanding Python: Beginner's Guide to Programming
Sabry Fattah
No ratings yet
Beginning Swift Programming
From Everand
Beginning Swift Programming
Wei-Meng Lee
No ratings yet
C Programming Language The Beginner’s Guide
From Everand
C Programming Language The Beginner’s Guide
Çağatay Şanlı
No ratings yet
Fluent Rust: Crafting Robust Software with Idiomatic Design Principles
From Everand
Fluent Rust: Crafting Robust Software with Idiomatic Design Principles
Aarav Joshi
No ratings yet
Concurrency in C++: Writing High-Performance Multithreaded Code
From Everand
Concurrency in C++: Writing High-Performance Multithreaded Code
Robert Johnson
No ratings yet
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
No ratings yet
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
40 pages
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
C# OOP Step by Step: A Practical Guide with Examples
From Everand
C# OOP Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Agent Coder 2312.13010v2
No ratings yet
Agent Coder 2312.13010v2
21 pages
Terraform for Developers, Second Edition
From Everand
Terraform for Developers, Second Edition
Kimiko Lee
No ratings yet
Terraform for Developers, Second Edition: Essentials of Infrastructure Automation and Provisioning
From Everand
Terraform for Developers, Second Edition: Essentials of Infrastructure Automation and Provisioning
Kimiko Lee
No ratings yet
Efficient Development with CodeLite IDE: Definitive Reference for Developers and Engineers
From Everand
Efficient Development with CodeLite IDE: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
System Programming Essentials with Go: System calls, networking, efficiency, and security practices with practical projects in Golang
From Everand
System Programming Essentials with Go: System calls, networking, efficiency, and security practices with practical projects in Golang
Alex Rios
No ratings yet
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
CommonMark Ready Reference
From Everand
CommonMark Ready Reference
V. Subhash
No ratings yet
C Programming Wizardry: From Zero to Hero in 10 Days: Programming Prodigy: From Novice to Virtuoso in 10 Days
From Everand
C Programming Wizardry: From Zero to Hero in 10 Days: Programming Prodigy: From Novice to Virtuoso in 10 Days
kok keong teo
No ratings yet
Understanding Software Engineering Vol 2: Programming principles and concepts to build any software.
From Everand
Understanding Software Engineering Vol 2: Programming principles and concepts to build any software.
Gabriel Clemente
5/5 (1)
Learn C++
From Everand
Learn C++
Aishik Dutta
No ratings yet
Learn Java Programming in 24 Hours
From Everand
Learn Java Programming in 24 Hours
PublishDrive
No ratings yet
Software Architecture with Python
From Everand
Software Architecture with Python
Anand Balachandran Pillai
3/5 (1)
Gemma 3: Open Multimodal AI With Increased Context Window
No ratings yet
Gemma 3: Open Multimodal AI With Increased Context Window
9 pages
How Mistral-NeMo-Minitron 8B Achieves Top Accuracy With Model Compression
No ratings yet
How Mistral-NeMo-Minitron 8B Achieves Top Accuracy With Model Compression
8 pages
Qwen3 : MoE Architecture, Agent Tools, Global Language LLM
No ratings yet
Qwen3 : MoE Architecture, Agent Tools, Global Language LLM
8 pages
MindSearch: Open-Source AI For Enhanced Web Search Efficiency
No ratings yet
MindSearch: Open-Source AI For Enhanced Web Search Efficiency
8 pages
Llama3.2: Meta's Open Source, Lightweight, and Multimodal AI Models
No ratings yet
Llama3.2: Meta's Open Source, Lightweight, and Multimodal AI Models
8 pages
DeepSeek-V3 : Efficient and Scalable AI With Mixture-Of-Experts
No ratings yet
DeepSeek-V3 : Efficient and Scalable AI With Mixture-Of-Experts
9 pages
Meta AI's Llama 3.1: The Powerhouse of Open-Source Language Models
No ratings yet
Meta AI's Llama 3.1: The Powerhouse of Open-Source Language Models
8 pages
Qwen2.5-Coder: Advanced Code Intelligence for Multilingual Programming
No ratings yet
Qwen2.5-Coder: Advanced Code Intelligence for Multilingual Programming
9 pages
XLAM: Enhancing AI Agents With Salesforce's Large Action Models
No ratings yet
XLAM: Enhancing AI Agents With Salesforce's Large Action Models
8 pages
Qwen2.5: Versatile, Multilingual, Open-Source LLM Series
No ratings yet
Qwen2.5: Versatile, Multilingual, Open-Source LLM Series
9 pages
Cerebras DocChat: Fast, Scalable, and Open-Source AI Model
No ratings yet
Cerebras DocChat: Fast, Scalable, and Open-Source AI Model
8 pages
Reader-LM: Efficient HTML To Markdown Conversion With AI
No ratings yet
Reader-LM: Efficient HTML To Markdown Conversion With AI
8 pages
EchoScene: Revolutionizing 3D Indoor Scene Generation With AI
No ratings yet
EchoScene: Revolutionizing 3D Indoor Scene Generation With AI
9 pages
Palmyra-Med and Palmyra-Fin: Leading Domain-Specific AI Models
No ratings yet
Palmyra-Med and Palmyra-Fin: Leading Domain-Specific AI Models
8 pages
CamCo: Transforming Image-To-Video Generation With 3D Consistency
No ratings yet
CamCo: Transforming Image-To-Video Generation With 3D Consistency
7 pages
CodeGeeX4: Multilingual Open-Source Code Assistant
No ratings yet
CodeGeeX4: Multilingual Open-Source Code Assistant
9 pages
SAFE: Google DeepMind's Open-Source Solution For Fact Verification
No ratings yet
SAFE: Google DeepMind's Open-Source Solution For Fact Verification
8 pages
Meta AI's Chameleon: A Revolutionary Leap in Mixed-Modal AI
No ratings yet
Meta AI's Chameleon: A Revolutionary Leap in Mixed-Modal AI
8 pages
CodeGemma: Google's Open-Source Marvel in Code Completion
No ratings yet
CodeGemma: Google's Open-Source Marvel in Code Completion
9 pages
OpenAI's GPT-4o: A Quantum Leap in Multimodal Understanding
100% (1)
OpenAI's GPT-4o: A Quantum Leap in Multimodal Understanding
8 pages
Open-Source Revolution: Google's Streaming Dense Video Captioning Model
No ratings yet
Open-Source Revolution: Google's Streaming Dense Video Captioning Model
8 pages
DeepSeek-V2: High-Performing Open-Source LLM With MoE Architecture
No ratings yet
DeepSeek-V2: High-Performing Open-Source LLM With MoE Architecture
10 pages
Video2Game: Bridging Real-World Scenes To Interactive Virtual Worlds
No ratings yet
Video2Game: Bridging Real-World Scenes To Interactive Virtual Worlds
8 pages
Reka Series Unleashed: Exploring The Power of Reka Core
No ratings yet
Reka Series Unleashed: Exploring The Power of Reka Core
10 pages
Open-Sora: Create High-Quality Videos From Text Prompts
No ratings yet
Open-Sora: Create High-Quality Videos From Text Prompts
8 pages
Advanced AI Planning With Devika: New Open-Source Devin Alternative
No ratings yet
Advanced AI Planning With Devika: New Open-Source Devin Alternative
7 pages
DATA INTERPRETER: Open-Source Genius in Spotting Data Inconsistencies
No ratings yet
DATA INTERPRETER: Open-Source Genius in Spotting Data Inconsistencies
9 pages
Unveiling Jamba: The First Production-Grade Mamba-Based Model
No ratings yet
Unveiling Jamba: The First Production-Grade Mamba-Based Model
8 pages
Command-R: Revolutionizing AI With Retrieval Augmented Generation
No ratings yet
Command-R: Revolutionizing AI With Retrieval Augmented Generation
8 pages
How Stability AI's Stable Code Instruct 3B Outperforms Larger Models
No ratings yet
How Stability AI's Stable Code Instruct 3B Outperforms Larger Models
8 pages
SQL Server Syllabus: Module 1:-Introduction To Basic Database Concepts
No ratings yet
SQL Server Syllabus: Module 1:-Introduction To Basic Database Concepts
4 pages
CoralComputers Software Cenovnik
No ratings yet
CoralComputers Software Cenovnik
8 pages
00 MidtermFall2016-17Solution
No ratings yet
00 MidtermFall2016-17Solution
8 pages
Assignment 2 Specifications
No ratings yet
Assignment 2 Specifications
4 pages
Major Project On Facemask Detection
No ratings yet
Major Project On Facemask Detection
33 pages
Sample-Paper-cs311
No ratings yet
Sample-Paper-cs311
10 pages
Calling RFC Function Modules in ABAP
No ratings yet
Calling RFC Function Modules in ABAP
7 pages
Python Theory
No ratings yet
Python Theory
3 pages
SEPM Unit-3
No ratings yet
SEPM Unit-3
26 pages
Abhinav Ui React
No ratings yet
Abhinav Ui React
3 pages
Model Answer Midterm
No ratings yet
Model Answer Midterm
3 pages
Dhan Raj
No ratings yet
Dhan Raj
5 pages
Q.4) Explain Different Types of Inheritance With Suitable Examples of Each Type. Ans
No ratings yet
Q.4) Explain Different Types of Inheritance With Suitable Examples of Each Type. Ans
11 pages
Dependency Properties: Dictionary of Keys and Values Provided by The Base Class Dependencyobject. The Key of An
No ratings yet
Dependency Properties: Dictionary of Keys and Values Provided by The Base Class Dependencyobject. The Key of An
8 pages
Unit 4 MCQ
100% (1)
Unit 4 MCQ
12 pages
XML Parsers: Types of Parsers Using XML Parsers SAX DOM DOM Versus SAX Products Conclusion
No ratings yet
XML Parsers: Types of Parsers Using XML Parsers SAX DOM DOM Versus SAX Products Conclusion
20 pages
hd resume
No ratings yet
hd resume
2 pages
PYthon Class 21 Telugu
No ratings yet
PYthon Class 21 Telugu
4 pages
Hydra
No ratings yet
Hydra
13 pages
HTML Tips Tricks
No ratings yet
HTML Tips Tricks
10 pages
Multi Core Architectures and Programming
No ratings yet
Multi Core Architectures and Programming
10 pages
Hibernate Tutorial 08 Inheritance Mapping
No ratings yet
Hibernate Tutorial 08 Inheritance Mapping
4 pages
Plantillas Design Review Checklist, Code Review Checklist
No ratings yet
Plantillas Design Review Checklist, Code Review Checklist
2 pages
Major Takeaways of SDET Program
No ratings yet
Major Takeaways of SDET Program
3 pages
BPS Orderblock Dashboard Manual
No ratings yet
BPS Orderblock Dashboard Manual
12 pages
Syllabus: Don Bosco Technical Institute - Tarlac
No ratings yet
Syllabus: Don Bosco Technical Institute - Tarlac
3 pages
Wordpress Custom Post Type
No ratings yet
Wordpress Custom Post Type
1 page

LongCoder: Open-Source Model For Code Completion With Long Code Input

Uploaded by

LongCoder: Open-Source Model For Code Completion With Long Code Input

Uploaded by

To read more such articles, please visit our blog https://socialviews81.blogspot.

LongCoder: Open-Source Model for Code Completion with

Code completion is a task that aims to generate the next token or

However, most existing code completion models are based on standard

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

token equally, which may result in noise or redundancy for long

To overcome these limitations, researchers from Microsoft Research

LongCoder is a sparse Transformer model that can handle long code

Key Features of LongCoder

1. LongCoder can handle long code input up to 4,096 tokens, which

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

Capabilities/Use Cases of LongCoder

● Token-level code completion: given a partial code input,

● Code summarization: given a code input, generate a natural

How does LongCoder work?

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

The feed-forward network applies a non-linear transformation to each

The encoder and decoder also have layer normalization, residual

Performance evaluation with other models

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

LongCoder improves upon LongFormer by adding bridge tokens and

source - source - https://arxiv.org/pdf/2306.14893.pdf

LongCoder also achieves the best performance on CodeXGLUE code

How to access and use LongCoder?

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

LongCoder is open-source and free to use for research purposes.

● LongCoder is currently only pre-trained on Python code, which

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

limitations that need to be addressed in future work, such as

To read more such articles, please visit our blog https://socialviews81.blogspot.com/

You might also like