0% found this document useful (0 votes)

47 views13 pages

(2023) An Empirical Comparison of Pre-Trained Models of Source Code

Uploaded by

fky

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views13 pages

(2023) An Empirical Comparison of Pre-Trained Models of Source Code

Uploaded by

fky

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)

An Empirical Comparison of Pre-Trained Models of

Source Code
Changan Niu∗ , Chuanyi Li∗ , Vincent Ng† , Dongxiao Chen∗ , Jidong Ge∗ , Bin Luo∗
∗ State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China

Email: [email protected], [email protected], [email protected], gjd,[email protected]

† Human Language Technology Research Institute, University of Texas at Dallas, Richardson, Texas, USA
2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE) | 978-1-6654-5701-9/23/$31.00 ©2023 IEEE | DOI: 10.1109/ICSE48619.2023.00180

Email: [email protected]

Abstract—While a large number of pre-trained models of as there are code-specific characteristics that may not be
source code have been successfully developed and applied to properly taken into account by these models, such as the syn-
a variety of software engineering (SE) tasks in recent years, tactic [17], [18] and semantic structures [19] inherent in source
our understanding of these pre-trained models is arguably fairly
limited. With the goal of advancing our understanding of these code [20]. Consequently, SE researchers have developed a
models, we perform the first systematic empirical comparison number of pre-trained models of source code (henceforth
of 19 recently-developed pre-trained models of source code on CodePTMs) that take into account code-specific characteristics
13 SE tasks. To gain additional insights into these models, we in the past few years [21]–[26].
adopt a recently-developed 4-dimensional categorization of pre- Despite the fact that a large number of CodePTMs have
trained models, and subsequently investigate whether there are
correlations between different categories of pre-trained models been successfully developed and applied to a variety of SE
and their performances on different SE tasks. tasks in recent years, our understanding of CodePTMs is
Index Terms—Pre-training of Source Code, AI for SE arguably fairly limited. Currently, only one survey of pre-
trained models of source code is available from Niu et al. [27],
I. I NTRODUCTION but it just performs a summary and analysis from the results
Despite the successful application of deep learning to var- reported by the origin model. While pre-trained models are
ious Artificial Intelligence (AI) subfields such as natural lan- task-agnostic and therefore can be applied to different SE tasks
guage processing (NLP) and computer vision in recent years, a by design, virtually all CodePTMs have been evaluated on
large amount of annotated training data is typically needed to only a handful of SE tasks. For instance, TreeBERT [28], has
train the millions or even billions of network parameters in a only been evaluated on code summarization and method name
deep neural model. For many learning tasks, including those in generation. This is by no means ideal: without knowing how
software engineering (SE), obtaining annotated data is costly. TreeBERT performs on the remaining SE tasks, we do not
To address this data annotation bottleneck, NLP researchers know whether it can achieve state-of-the-art results on any of
have come up with an idea that can arguably be considered those tasks. This in turn implies that our understanding of these
one of the most exciting developments in recent deep learning models could be partial and that the current state-of-the-art
research, namely pre-training [1]–[4]. Rather than training a could have been very different had we evaluated the existing
model from scratch (i.e., with randomly initialized network models on most, if not all, of the available SE tasks. Even
weights), which typically requires a lot of task-specific anno- when two pre-trained models are being evaluated on the same
tated data, one can first pre-train it on one or more so-called SE task, a head-to-head comparison of these models could still
self-supervised tasks (i.e., tasks for which annotated data be made complicated if they are evaluated on different datasets
can be automatically generated and therefore large amounts available for this task [29].
of training data are readily available) so that its weights With the goal of advancing our understanding of exist-
encode general linguistic and commonsense knowledge about ing pre-trained models of source code, we conduct the first
language, and then the resulting pre-trained model can be systematic empirical comparison of 19 recently-developed
fine-tuned to learn the target task using (a potentially small CodePTMs on 13 popular SE tasks. To gain additional insights
amount of) task-specific annotated training data in the usual into these CodePTMs, we employ a recently-developed four-
supervised manner. A large number of pre-trained models of dimensional categorization of CodePTMs [27] to categorize
natural language (PTM-NLs) have been developed and widely existing the 19 CodePTMs used in our study, and subsequently
used in NLP, such as BERT [5], XLNet [6], RoBERTa [7], investigate whether there are correlations between categories
ELECTRA [8], GPT-2 [9], T5 [10], and BART [11]. of CodePTMs and their performances on SE tasks.
Soon thereafter, pre-trained models have made their way
II. E XPERIMENTAL S ETUP
into SE research. Initial applications of pre-trained models in
SE have primarily involved retraining PTM-NLs on source A. SE Tasks
code [12]–[16]. Nevertheless, employing the resulting re- Table I enumerates the 13 SE tasks we will use in our
trained models (henceforth PTM-Cs) for SE tasks is not ideal, comparative experiments. These are also the SE tasks that

1558-1225/23/$31.00 ©2023 IEEE 2136

DOI 10.1109/ICSE48619.2023.00180
Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
TABLE I C. Pre-trained Models
D ETAILS OF EVALUATION TASKS , DATASETS AND METRICS .
In this subsection, we first present an overview of 26 of the
Type I-O Task Ab. ID: Dataset Metrics PTMs that have been applied to SE tasks and then enumerate
D1: Devign [30] Acc
D2: DeepBugs [31] Acc
the 19 pre-trained models of source code that we will include
Defect Detection DD V1: VMC [12] Acc in our empirical comparison.
C-V W1: WBO [12] Acc
S3: SO [12] Acc 1) Categorization: Table II presents an overview of 26 of
B1: BigCloneBench [32] F1
Clone Detection CD
Und. C2: CLCDSA [33] F1 the PTMs that are either commonly used and/or developed
Exception Type ET K1: Kanade et al. [12] Acc
C-C
Code-to-Code
CR
P1: POJ-104 [34] MAP in SE. As can be seen, these PTMs can be divided into three
Retrieval C2: CLCDSA [33] MRR
C3: CodeSearchNet (Filtered) [35] MRR groups: PTM-NL, PTM-C, and CodePTM. Within each group,
Code Search CS
A1:
NL-C
Code Question C4:
AdvTest [35]
CoSQA [36], WebQueryTest [35]
MRR
MRR
we order them chronologically (by the date of the preprint
QA
Answering F1: FDM [12] Acc or the official publication). To enable the reader to better
C5: CodeTrans [35] EM/B./C.B.
Code Translation CT T1: TransCoder [37] CA understand their similarities and differences, we categorize the
C2: CLCDSA [33] R.L
Bug Fixing BF B2: BFP [38] EM/B./C.B. PTMs of source code (i.e., PTM-Cs and CodePTMs) along the
P2:
C-C
C6:
PY150 [39]
CugLM [40]
EM/ES
EM four dimensions proposed by Niu et al. [27]1 :
Code Completion CC
S1: SLM [41] EM
S2: Svyatkovskiy et al. [14] PPL
(1) Architecture (Arch.). Existing network architectures can
Gen.
Mutant Generation
Assert Generation
MG
AG
G1:
A3:
GM [42]
ATLAS [43]
EM/B.
EM/B.
be divided into Long Short-Term Memory (LSTM) [65], Trans-
C3: CodeSearchNet (Filtered) [35] B. former (TF) [66], Transformer-Encoder (TE, the encoder-only
A2: Attn2FC [44] B.
D3: DeepCom [45] B. portion of TF), and Transformer-Decoder (TD, the decoder-
C-NL Code Summarization SM P2: PY150 [39] EM
C7: code2seq [46] EM only portion of TF).
T2: TL-CodeSum [47] B.
M1: Miceli-Barone and Sennrich [48] B.
(2) Modality refers to the type of input a PTM assumes.
NL-C Code Generation CG C8: CONCODE [49] EM/B./C.B. The possible modalities include code, natural language (NL)
and code structure. How these different modalities should
be combined is determined by the underlying combination
are typically used to evaluate pre-trained models of source strategy, which can be together (+) or standalone (&)2 .
code. Following previous work [27], in the first two columns, (3) Pre-training Tasks (Tasks). If more than one task is
we classify each task along two dimensions: (1) whether the used, the tasks can be learned jointly (+), sequentially (&),
task concerns understanding (Und.) or generation (Gen.); and or alternately (/)3 . The definition of each pre-training task is
(2) the type of input assumed by the task and the type of given in Table III. Following Niu et al, [27], in the first column
produced output (I-O), where C, NL, and V denote code, we classify these tasks into four categories according to their
natural language, and extracted/predicted value, respectively. input modalities: (1) Code-Aware or Natural-Language-Aware
Table I also shows the abbreviation (Ab.), the dataset, and the (C/NLA) tasks, which are originated in NLP and can be
main evaluation metrics for each task. applied to either NL or Code sequence to mine latent infor-
To make the number of experiments manageable in our mation from NL or Code; (2) Code-Aware Only (CA) tasks,
comparison, whenever there are multiple datasets for a task, which can only be applied to mine latent information from
we choose the most popular one (shown in Gray in Table I) code text; (3) Structure-Aware Only (SA) tasks, which aim
except for Code Search, where we chose A1 over C3 since to learn representations of the code structure; and (4) Cross-
A1 is the filtered version of C3 and the results on A1 is more Modal-Aware (CMA) tasks, which seek to acquire knowledge
reflective of the generalization ability of a model [35]. from multiple input modalities and are further subdivided into
three categories based on which input modalities are involved,
B. Evaluation Metrics namely Code-NL (CN), Code-Structure (CS), and Code-NL-
For each SE task, we will perform evaluations using the Structure (CNS). In the second column, we classify these tasks
standard metrics listed in the last column of Table I. For clas- based on whether they are Generative (G, i.e., generate tokens)
sification and retrieval tasks, metrics such as Acc (Accuracy), or Categorical (C, i.e., predict labels) in nature.
F1, Precision (P), Recall (R), Mean Reciprocal Rank (MRR) (4) Programming language (PL). We categorize code PTMs
and Mean Average Precision (MAP) are used. For generation depending on whether they are pre-trained on one PL (Mono-
tasks, metrics developed in the NLP community such as lingual (Mono)) or multiple PLs (Multilingual (Multi)).
perplexity (PPL), Levenshtein edit similarity (ES) [14], BLEU In our empirical comparison, we exclude all LSTM-based
(B.) [50], as well as variants developed in the SE community, models of source code since they do not represent the state of
such as CodeBLEU (C.B.) [51], are used. Moreover, some the art, and retain all the Transformer-based models of source
generation tasks have also used variants of Accuracy for eval- code shown in Table II except OSCAR, because OSCAR does
uation, one of which indicates whether the sequence generated not target high-level PLs. This leaves us with 19 PTMs of
by the model exactly matches (EM) the correct answer, and the 1 Note that three of these four dimensions are also applicable to PTM-NL.
other, Computational Accuracy (CA), computes the number of 2 See the supplementary file for details.
times the hypothesis function generates the same output as the 3 See the supplementary file for details on the different ways of pre-training
reference when given the same inputs [37]. a model when more than one pre-training task is involved.

2137

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
TABLE II
C ATEGORIZATION OF EXISTING PRE - TRAINED MODELS AND THEIR PERFORMANCE ON SE TASKS AS REPORTED IN THEIR ORIGINAL PAPERS . T HE
STRONGEST RESULT FOR EACH DATASET IS BOLDFACED .

Code Understanding Tasks Code Generation Tasks

Models Arch. Modality Tasks PL C-V C-C NL-C C-C C-NL NL-C
DD CD ET CR CS QA CT BF CC MG AG SM CG
C3:61.7
RoBERTa [7] TE NL Text MLM - D1:61.05 B1:94.9 P1:76.67 C4:60.3 C3:16.57
A1:18.33
GPT-2 [9] TD NL Text ULM - P2:41.73 17.35
BART [11] TF NL Text DAE - 11.7
T5 [10] TF NL Text Seq2Seq MLM - D1:61.93 9.7 C3:18.35 18.65
SCELMo [52] LSTM Code BiLM Mono D2:93.12
V1:95.21 P2:33.48
CuBERT [12] TE Code MLM + NSP Mono W1:92.46 79.12 F1:98.09 C7:52.76
S3:93.36 D3:17.41
GPT-C [14] TD Code ULM Multi S2:1.65
C-BERT [13] TE Code MLM Mono D1:57.4
JavaBERT [15] TE Code MLM Mono
CodeGPT-adapted [35] TD Code ULM Multi P2:42.37 20.1
DeepDebug [16] TF Code Seq2Seq MLM Mono 15.05
C3:17.83
P2:35.97
CodeBERT [53] TE Code + Doc MLM & RTD Multi P1:82.67 C3:69.3 C4:65.7 C5:58.5 10.8
C7:56.52
D3:17.87
Code + Doc + MLM +
GraphCodeBERT [54] TE Multi B1:97.1 P1:85.16 C3:71.3 C4:68.4 C5:59.1 13.2
DFG Nodes EP + NA
CugLM [40] TE Code IMLM + NSP + ULM Multi C6:81.91
MLM &
DOBF [55] TF Code Multi B1:95.9 A1:38.3 T1:46.35 C3:18.65
Seq2Seq IMLM
T5-learning [56] TF Code & Doc Seq2Seq MLM Mono 6.5 28 40.5 A2:15
PLBART [57] TF Code & Post DAE Multi D1:63.18 B1:97.2 C4:65.0 C5:64.8 14.10 C3:18.32 18.75
ProphetNet-Code [58] TF Code & Doc FNP Multi C3:18.54
CoTexT [59] TF Code + Doc Seq2Seq MLM Multi D1:65.99 17.30 C3:18.38 20.1
D3:20.49
Code +
TreeBERT [28] TF TMLM + NOP Multi P2:45.81
AST Paths
C7:67.9
OSCAR [60] TE IR + AEI MLM + CCL Mono P1:49.17
Code + VGVAE + CLR
CodeDisen [61] LSTM Multi C2:90.0 C2:43.6 C2:50.08
AST Seq + PD + ACP
Seq2Seq MLM / IT /
CodeT5 [62] TF Code + Doc Multi D1:65.78 B1:97.2 C4:67.8 C5:66.4 17.79 C3:19.55 22.30
Seq2Seq IMLM / BDG
Code + Doc + MLM + IT + TEP + C3:74.0
SynCoBERT [63] TE Multi D1:64.5 B1:97.4 P1:88.24 C5:60.85
AST Seq MCL A1:38.1
Code + C3:15.0
CAP & MASS &
SPT-Code [29] TF Names + Multi C3:71.5 C5:62.18 14.2 S1:19.09 T2:49.1
MNG
AST Seq M1:36.1
AST Seq + MLM / ULM / Seq2Seq C3:74.4
UniXcoder [64] TE Multi B1:95.2 P1:90.52 C4:70.1 C3:19.30 22.60
Doc MLM / MCL / CMG A1:41.3

TABLE III
C ATEGORIZATION AND DESCRIPTION OF THE PRE - TRAINING TASKS MENTIONED IN TABLE II.

Type O. Task Full Name and Description

ULM [14] Unidirectional LM: conditional on words that have already appeared, maximizes the conditional probability of all next words.
FNP [58] Future N-gram Prediction: conditional on words that have appeared, maximizes the conditional probability of all next N (N > 1) words.
BiLM [52] Bidirectional LM: apply ULM to the input and its reversion to maximize the conditional probability of each word in both directions.
MLM [15] Masked Language Model: predicts a certain percentage of tokens that have been randomly masked in the input. (Basic version of MLM)
G.
WWM [13] Whole Word Masking: a variant of basic MLM, if parts of a word is masked, ensure all subwords/tokens in it be masked.
C/NLA
MASS [29] MAsked Seq2Seq: predicts 50% of the content that is randomly masked consecutively in the sentence in the encoder-decoder architecture.
SMLM [56] Seq2Seq MLM: sequentially predicts a set of token spans randomly masked in the input in the encoder-decoder framework.
DAE [57] Denoising Auto-Encoding: recovers the original input from the one tampered by masking, deleting, and replacing tokens, etc.
NSP [12] Next Sentence Prediction: determines whether the two given sentences or logical lines of code appear consecutively in real world.
C.
RTD [53] Replaced Token Detection: identifies whether a token in the input is a fake one that is produced by a small generator network.
IMLM [40] Identifier MLM: predicts a certain percentage of identifiers that randomly masked in the code text (an adaption of basic MLM to code).
G.
SIMLM [62] Seq2Seq IMLM: an adaptation of Seq2Seq MLM to source code that masks only a certain percentage of the identifiers in the code text.
CA
IT/IP [62] Identifier Tagging/Predicting: determines whether the input token at each position is an identifier or not via binary classification.
C.
CCL [60] Code Contrastive Learning: minimizes/maximizes the distances between the representations of similar/dissimilar code snippets.
EP/TEP [54] Edge Prediction: predicts the edges that are masked by randomly selecting source and target nodes in a DFG or AST.
SA C.
NOP [28] Node Order Prediction: determines if a change occurs in an AST where the order of some randomly selected nodes are changed.
BDG/CMG [62] Bimodal Dual Generation/Cross Modal Generation: generates a Natural Language/Code if Code/Natural Language is given.
CN G.
MNG [29] Method Name Generation: produces a name for the given method body by generating sub-tokens with the decoder sequentially.
CLR [61] Cross-Language Reconstruction: generates a code snippet in one PL functionally equivalent to the given one in other PLs.
G.
TMLM [28] Tree MLM: generates complete code from inputs where some terminal nodes/identifiers in ASTs/code are masked in encoder/decoder.
VGVAE [61] vMF-Gaussian Variational Autoencoder: disentangles code semantics from code syntax under the supervision of a masked AST.
CMA
CS CAP [29] Code-AST Prediction: determines whether the given code and AST in the input correspond to each other via binary classification.
C. NA [54] Node Alignment: predicts the masked edges connecting randomly sampled nodes in a DFG and its corresponding code token.
PD [61] Posterior Distribution: minimizes the difference in semantics distributions of functionally equivalent code snippets in different PLs.
ACP [61] Attentive Code Position: predicts the node type in AST of a code token in the input through an attention mechanism.
CNS C. MCL [63] Multi-modal Contrastive Learning: an adaptation of CCL to (Code,NL)/(Code,Structure) pairs where samples are no longer code pairs.

source code in our comparison. In addition, we will present D. Implementation Details

results of ﬁve models that are not pre-trained on source code. a) The 19 PTMs of source code: According to the public
They include four PTMs-NL (RoBERTa, GPT-2, BART, and availability of the artifacts, the 19 models of source code we
T5) and a vanilla Transformer model [66]. Comparing the use in our comparison can be divided into four categories:
results of these models and those obtained by the 19 PTMs (1) For those PTMs that have publicly available pre-trained
of source code could shed some light on the gains that can be models and tokenizers, we use them as provided. CuBERT,
obtained on each SE task via pre-training on source code. CodeBERT, GraphCodeBERT, DOBF, JavaBERT, CodeGPT-

2138

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
adapted, T5-learning, PLBART, ProphetNet-Code, CoTexT, 2) Outputs: The output required by a SE task may not
CodeT5, SPT-Code and UniXcoder are in this category. If be the same as the output produced by the PTMs. Hence,
more than one model is provided, we choose the “base” additional modules or operations may be needed in order to
version consistent with the approach in the original paper. get the output required by SE Tasks. The outputs that need to
(2) Of the remaining PTMs, if the source code and datasets be provided by PTMs for different SE tasks can be divided
are provided, we re-train them according to the setting intro- into two types:
duced in the original papers to get the pre-trained models and (1) Output based on the input representation: Among
the tokenizers. TreeBERT is the only model in this category. the SE tasks, Code Search and Code Question Answering
(3) For those that have the source code but not the datasets, use the input representation directly (to calculate the simi-
we collect the required datasets ourselves in the same way larity between two sequences), while the others need a fully
as the original authors did, and re-train them according to the connected layer and a softmax layer to be added to obtain
settings in the original papers. Only CugLM is in this category. a probability distribution. PTMs with different architectures
(4) If no source code is provided, we re-implement and use different ways to get the representation vector for the
pre-train according to the settings (e.g., tokenizer, hyper- input. For TE-based models, we use the vector that cor-
parameters, and dataset) described in the original papers. They responds to the position of the classification symbol in the
are GPT-C, C-BERT, DeepDebug and SynCoBERT4 . input (typically “[CLS]”) as the representation vector. For TD-
When evaluating on a downstream SE task, each of the 19 based models, we use the last time step of the output hidden
models is fine-tuned on the training data available for that task. state (i.e., the position of the special symbol “[endoftext]”
b) The 5 non-PTMs: As noted above, we also include in the input sequence). For TF-based models, it depends.
four PTMs-NL (RoBERTa, GPT-2, BART, and T5) and a Since T5-based models (i.e., T5, T5-learning, DeepDebug,
vanilla Transformer model in our comparison. For the four CoTexT and CodeT5) formalize all tasks as text-to-text tasks,
PTMs-NL, we use their publicly available implementations. for classification tasks we map all categories to text (e.g. for
Like the 19 PTMs of source code, these five models are being a binary classification task, 0 is mapped to “false” and 1 to
fine-tuned on task-specific data [66] before applying to each “true”), while for retrieval tasks, we use the output hidden
downstream task. state of the decoder corresponding to the “[EOS]” symbol as
E. Application to SE Tasks the representation vector. In contrast, for BART-based models
(i.e., BART, PLBART and SPT-Code), we keep the input of the
Two aspects need to be considered while applying PTMs to
decoder to be the same as the input of the encoder and use the
SE tasks, namely, Inputs and Outputs.
decoder hidden state of the last timestep as the representation
1) Inputs: The inputs for different SE tasks are different.
vector. For other TF-based models, we only use its encoder
When applying a PTM to a SE task, the input of the task
and adopt the same method as used in the TE-based models.
should be organized into a form needed by the PTM. The
(2) Output based on the ultimate output sequence: For
input of the SE tasks in Table II belongs to three types:
TE-based models, we follow Lu et al. [35] to randomly
(1) Using only a code snippet as input: Tasks such as
initialize a Transformer Decoder of the same size as the
Defect Detection and Code Translation assume input that
model to form an encoder-decoder architecture. For TD-based
belongs to this category. Here, we follow the input repre-
models, we follow GPT-2 [9]: for training, we concatenate
sentation as defined by PTMs. For example, for TreeBERT,
the input and output sequences using a special symbol; and
we parse the code into an AST and encode each path in the
for evaluation, we pass the input sequence concatenated with
AST before passing it to the Transformer, as described in the
this special symbol into the model and use the sequence
original paper; and for PLBART, we add a special symbol
predicted by the model as the output. TF-based models can be
indicating the programming language, e.g., “[java]”, to the
applied directly to this type of tasks. The Code Completion
input sequence.
task deserves special mention. Recall that it requires a model
(2) Using only a natural language description as input:
to complete the unfinished line given the previous context.
This is used by tasks such as Code Search and Code Genera-
However, during training, it follows the GPT-like, casual
tion. In this case, we input the text sequence directly. But for
language modeling manner. This is not applicable to TE- and
PLBART, we follow the approach described in its paper and
TF-based PTMs that adopt the encoder-decoder architecture
add a special symbol “[en]” to the input.
for this task. Therefore, when training TE- and TF-based
(3) Using a code-code pair or a code-NL pair as input:
PTMs, we randomly extract the first 15%-85% of the entire
Tasks like Clone Detection (inputs: code-code) and Code
sequence as input (since the input context in the test data is
Question Answering (inputs: code-NL) belong to this type. In
ensured to be at least 15% of the whole length [35]) of the
this case, we prepare the inputs for the two parts separately and
encoder, and the rest is used as the input of the decoder.
then concatenate them to obtain the final input representation.
4 To verify the validity of the latter two types of models pre-trained by us, F. Other Settings and Data Availability
we perform fine-tuning on the downstream tasks corresponding to the original For other settings, e.g., the hyperparameters and the opti-
paper and use pair-wise t-tests to ensure that the difference between our results
and those reported in the original papers are statistically indistinguishable. mizer, we adopt those used in the provided source code or
Details can be found in the supplementary materials. mentioned in the original paper. If neither of the above is

2139

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
available, we perform parameter tuning ourselves to maximize TABLE IV
model performance on held-out development data5 . C URRENT SOTA S AND NEW SOTA S .

Current SOTA New SOTA Δ

III. E VALUATION OF PTM S : T HE S TATUS Q UO Model Value (%) Model Value (%) (pts)
The current state of research on applying PTMs, including DD CoTexT 65.99 SynCoBERT 66.25 0.26
PTMs-NL, PTMs-C, and CodePTMs, to SE tasks is somewhat CD SynCoBERT 97.4 SynCoBERT 97.55 0.15
ET CuBERT 79.12 CodeT5 85.00 5.88
unsatisfactory. To understand this status quo, we show in CR UniXcoder 90.52 UniXcoder 90.55 0.03
Table II the ID of each dataset that each PTM is evaluated CS UniXcoder 41.3 UniXcoder 41.57 0.27
QA UniXcoder 70.1 UniXcoder 70.3 0.2
on (see Table I for an explanation of the dataset IDs) and the CT CodeT5 66.4 PLBART 67.6 1.2
BF CodeT5 17.79 CodeT5 17.98 0.19
corresponding results as reported in the original papers. To CC CodeGPT-adapted 42.37 CodeGPT-adapted 43.80 1.43
avoid overloading the reader with information, we (1) omit the MG T5-learning 28 CodeT5 34.83 6.83
AG T5-learning 40.5 PLBART 49.94 9.44
dataset ID when the SE task has only one dataset; (2) report SM CodeT5 19.55 CodeT5 19.71 0.16
results in terms of percentage using the first evaluation metric CG UniXcoder 22.60 CodeT5 23.43 0.83
(see Table I); and (3) average results over all data subsets when
a dataset is composed of multiple subsets6 .
Below we discuss the status quo based on the results shown performing PTMs of source code on many of these SE tasks.
in Table II, focusing our discussion on PTMs of source code Therefore, achieving a fair and systematic comparison of these
given that they are the focus of this paper. PTMs is the main motivation for the work in this paper.
a) Code Understanding Tasks: Among the code under-
IV. E VALUATION R ESULTS
standing tasks, only one PTM of source code is evaluated for
Exception Type and Code Question Answering, so we have In this section, we present our evaluation results.
no idea of the performance of the other models on these For each SE task, we repeat the fine-tuning and testing
tasks. Although three CodePTMs are evaluated on Code-to- experiments on each model three times using three random
Code Retrieval, they all used different datasets, thus making seeds (i.e., 24, 42 and 81) and report the average results in
direct comparisons impossible. Consequently, the only tasks Tables V, VI, VII, and VIII. Specifically, for each task and
for which we can compare PTMs of source code are Defect each evaluation metric of that task, we show under the “Cur”
Detection, Clone Detection and Code Search. For Defect and “New” columns the current results reported by existing
Detection, most of the models are evaluated on Devign with work and the results we obtained through our experiments,
CoTexT achieving the best results. For Clone Detection, most respectively. For each “New” column, the best and second best
models are evaluated on BigCloneBench with SynCoBERT results obtained by each type of pre-trained models (i.e., PTM-
achieving the best results. For Code-to-Code Retrieval (POJ- NL, PTM-C, and CodePTM) on the corresponding SE task are
104) and Code Search (CodeSearchNet and AdvTest), UniX- marked in bold and underline respectively7 . Note that the first
coder is the state-of-the-art CodePTM. row of each table show the results of vanilla Transformer.
b) Code Generation Tasks: Code generation tasks are
more popularly used to evaluate PTMs of source code. How- V. D ISCUSSION
ever, we cannot make valid comparisons on three of the seven Through our experiments, we obtain the new SOTA8 on
generation tasks shown in Table II: four PTMs of source each task. In Table IV we show for each task the current SOTA
code were evaluated on Code Completion, but they each used model (derived from existing work), the new SOTA model
different datasets. Only T5-learning was evaluated on Mutant (derived from our experiments), as well as their corresponding
Generation and Assert Generation. Of the four comparable performances9 . Moreover, we boldface the new SOTA model if
tasks and datasets, CodeT5 achieved the best results in three it is different from the current SOTA model. Finally, we show
of them: Code Translation (CodeTrans), Bug Fixing, and Code the absolute performance difference between the new SOTA
Summarization (CodeSearchNet). The remaining task, Code model and the current SOTA model under the “Δ” column.
Generation, is bested by UniXcoder. First, the SOTA models of all 13 SE tasks belong to the
c) Overall: Since different PTMs of source code are type of CodePTM, which covers models specifically designed
evaluated on different downstream tasks and dataset, it is to capture the unique features of Source Code, except for
impossible to compare them directly based only on the results
7 For each SE task, we apply the Approximate Randomization Test [67]
reported in existing paper. Consequently, we cannot draw
to every pair of results to determine whether their difference is statistically
conclusions that are more broadly applicable, the conclusions significant. These results are presented in the supplementary materials.
we draw are not reliable, and we could not know the best 8 For generation tasks (e.g., Code Translation, Assert Generation, Bug
Fixing, Code Completion, etc.) adopting multiple metrics (e.g., EM, BLEU,
5 Refer to the supplementary materials for details. CodeBLEU, etc.), we follow the existing papers and use EM (Exact Match)
6 The datasets that comprise multiple subsets are: C3 (which includes as the metric to determine the SOTA model. The reason is that EM is the most
subsets corresponding to six languages), C5 (which contains two subsets rigorous metric for evaluating the generation performance. Besides, if there
corresponding to bidirectional translation between Java and C#), B2 (which are multiple sub-datasets for evaluating one task, the average performance
consists of two subsets of different length distributions), A3 (which includes over all sub-datasets is used to determine the SOTA model.
9 If multiple sub-datasets are used for evaluation, the average performance
two subsets corresponding to raw and abstracted source code [43]) and C7
(which consists of three subsets of different sizes). is reported here.

2140

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
TABLE V
E XPERIMENTAL RESULTS ON CODE UNDERSTANDING TASKS .

DD CD ET CR CS QA
Model Acc F1 Acc MAP MRR MRR
Cur New Cur New Cur New Cur New Cur New Cur New
Transformer 64.40 89.27 48.98 64.27 3.12 52.89
RoBERTa 61.05 64.47 94.9 95.35 - 76.94 76.67 80.20 18.33 18.82 60.3 60.28
GPT-2 - 63.22 - 96.22 - 75.54 - 53.30 - 16.38 - 58.06
BART - 63.81 - 95.11 - 73.68 - 79.63 - 16.65 - 55.57
T5 61.93 61.87 - 94.86 - 74.75 - 69.16 - 16.97 - 45.63
CuBERT - 64.25 - 94.78 79.12 79.90 - 76.87 - 22.26 - 54.33
GPT-C - 63.77 - 95.46 - 78.26 - 55.23 - 24.39 - 50.32
C-BERT 57.4 64.05 - 95.00 -- 74.57 - 72.91 - 25.34 - 54.81
JavaBERT - 64.50 - 96.57 - 67.66 - 77.44 - 25.02 - 54.04
CodeGPT-adapted - 65.64 - 96.65 - 76.71 - 72.63 - 25.97 - 54.24
DeepDebug - 64.18 - 95.90 - 73.50 - 73.51 - 30.58 - 57.39
CodeBERT - 65.02 - 96.77 - 81.25 82.67 85.61 - 38.21 65.7 65.90
GraphCodeBERT - 65.92 97.1 97.11 - 83.26 85.16 87.73 - 38.76 68.4 68.55
CugLM - 64.19 - 96.44 - 79.01 - 83.32 - 36.20 - 61.44
DOBF - 63.86 95.9 96.84 - 79.04 - 87.31 38.3 38.56 - 61.31
T5-learning - 63.60 - 96.38 - 69.85 - 80.82 - 37.98 - 60.21
PLBART 63.18 64.21 97.2 97.01 - 77.93 - 85.02 - 38.70 65.0 65.01
ProphetNet-Code - 63.57 - 96.05 - 79.37 - 79.82 - 37.64 - 63.73
CoTexT 65.99 65.68 - 95.96 - 77.21 - 86.65 - 38.13 - 68.70
TreeBERT - 65.76 - 96.51 - 78.08 - 85.54 - 39.60 - 64.98
CodeT5 65.78 65.82 97.2 97.18 - 85.00 - 87.53 - 40.03 67.8 67.91
SynCoBERT 64.5 66.25 97.4 97.55 - 82.70 88.24 88.52 38.1 39.99 - 69.19
SPT-Code - 64.88 - 96.40 - 77.11 - 86.54 - 37.05 - 64.55
UniXcoder - 65.64 95.2 96.32 - 83.47 90.52 90.55 41.3 41.57 70.1 70.30

the Code Completion task whose SOTA model, CodeGPT- than TD or TF, for several reasons. First, to draw this
adapted, is of type PTM-C, which covers models designed conclusion, we need to compare the results of two models
for Natural Language but pre-trained on Source Code. that differ only w.r.t. architecture, but there do not exist two
Second, while many PTMs have been proposed, only five of PTMs on our list that differ only w.r.t. architecture. Second,
them have managed to achieve SOTA performance on at least TF-based CoTexT, which uses MLM as the only pre-traning
one SE task. They are CodeT5 (SOTA on 5 tasks), UniXcoder task, outperforms TE-based UniXcoder, which uses three more
(SOTA on 3 tasks), PLBART (SOTA on 2 tasks), SynCoBERT complex pre-training tasks, ULM, MCL and CMG. Finally,
(SOTA on 2 tasks), and CodeGPT-adapted (SOTA on 1 task). TF-based DeepDebug achieves better results than TE-based
Third, vanilla Transformer’s performance relative to the C-BERT when using only code as input and MLM as its only
PTMs is different for different SE tasks: (1) on Clone De- pre-training task.
tection (CD), Error Type prediction (ET), Code Search (CS),
Code Translation, Assert Generation, and Code Summariza- 2) Modality: Both code structure and NL are shown to
tion, vanilla Transformer is surpassed in performance by all have a positive effect on the performance of the models on
types of PTMs (i.e., PTM-NL, PTM-C, and CodePTM); (2) on this task, but the way they are being used also matters. As
Code Completion and Mutant Generation, vanilla Transformer an example, TF-based TreeBERT outperforms some of the TF-
is beaten by all PTMs-C and CodePTMs but it outperforms based models that use code and NL (e.g, DOBF, T5-learning,
two PTMs-NL, BART and T5; (3) on Code-to-Code Retrieval PLBART) significantly owning to its use of ASTs. As another
(CR) and Code Question Answering (QA), vanilla Transformer example, TF-based CoTexT outperforms TF-based T5-learning
not only surpasses a PTM-NL (GPT-2 on CR and T5 on QA), considerably: CoText concatenates Code and the correspond-
but also beats one PTM-C (GPT-C for both tasks); and (4) on ing Doc as one single input, whereas T5-learning treats the
Defect Detection (DD) and Bug Fixing, vanilla Transformer features derived from these two modalities as separate data
even outperforms CodePTMs in addition to PTMs-NL and instances. This suggests that how the information derived from
PTMs-C, beating CugLM, DOBF, T5-learning, PLBART, and these modalities is used has an impact on performance.
ProphetNet-Code on DD, and CugLM on Bug Fixing.
In the following, we discuss in detail the observations 3) Pre-training Tasks: First, the results in the New column
obtained from the current and the new results on each task. of this task in Table V reveal that the most influential pre-
training tasks are cross-modal-aware classification tasks
A. Defect Detection as they are being used by the Top-5 models. These tasks
SynCoBERT defeats CoTexT and becomes the new SOTA include TEP/EP (used by SynCoBERT and GraphCodeBERT),
PTM for this task, and Accuracy improves by 0.26. MCL (used by SynCoBERT and UniXcoder), and NA (adopted
1) Architecture: While the Top-2 models on this classifi- by GraphCodeBERT). This observation is different from the
cation task, SynCoBERT and GraphCodeBERT, are both TE- conclusion derived from the Cur column, where Seq2Seq
based, there is not enough empirical evidence for us to MLM (the only pre-training task used by the old SOTA model,
conclude that TE is a better architecture for this task CoTexT) seems to have the greatest impact on defect detection.

2141

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
B. Clone Detection their pre-training tasks (ProphetNet-Code uses FNP whereas
The new results on this task do not change significantly PLBART uses DAE), ProphetNet-Code surpasses PLBART in
from the current ones, except that PLBART, which is currently performance.
tied for second place, has slipped to fourth place. The drop D. Code-to-Code Retrieval
in PLBART’s rank seems to suggest that using multiple pre-
Currently, the relative advantages and disadvantages of
training tasks is better than using a single pre-training task on
different model architectures are not available since only four
this task: while the Top-2 models, SynCoBERT and CodeT5,
TE-based models are evaluated on this task. However, with
employ four distinct pre-training tasks, PLBART uses DAE as
the new results, the conclusion that TE-based models have
the only pre-training task. Besides, the new results also enable
more advantages over the other architectures on this task
us to see the performance of TD-based models on this task; in
can be verified, since the Top-3 models of this task are all TE-
particular, the best TD-based PTM, CodeGPT-adapted, ranks
based (i.e., UniXcoder, SynCoBERT, and GraphCodeBERT).
7th.
Besides, the performance of the TF- and TD-based models is
C. Exception Type also measurable. Specifically, the best performing TF-based
This task is the only multi-label classification task among model (CodeT5) and TD-based one (CodeGPT-adapted) ranks
our 13 SE evaluation tasks. Currently, only one model (i.e., 4th and 20th, respectively.
CuBERT) has been applied to this task, which prevents us E. Code Search
from drawing any conclusions about the relative performance 1) Architecture: Although the SOTA model on this task is
of different types of models on a multi-label classification task still UniXcoder (TE-based), the rank of CodeT5 (TF-based)
like this. Fortunately, our results enable us to draw several new improved from third to second in the new results, and the
conclusions: third position is taken by SynCoBERT (TE-based). TreeBERT
1) Architecture: Most notably, according to the new results, (TF-based) ranks fourth, GraphCodeBERT (TE-based) ranks
the SOTA performance on this task is not achieved by fifth, and PLBART (TF-based) ranks sixth. These results seem
a TE-based model. Instead, TF-based CodeT5, which turns to suggest that TE-based and TF-based models perform
the task into a text-to-text form, achieves the best results. comparably on this task, as they alternate in the Top-6. In
The best TE-based model (UniXcoder) and the best TD-based addition, the performance of TD-based models on this task is
model (GPT-C) rank second and tenth respectively, and their now measurable: the best TD-based PTM (CodeGPT-adapted)
accuracies are 1.53 and 6.74 points lower than that of CodeT5. ranks 15th.
Recall that in Section II-E, we mentioned that as a T5-based 2) Pre-training Tasks: The MLM pre-training task and
model, CodeT5, when applied to a classification task, maps its variants, as well as cross-modal-aware tasks demon-
each label to a unique text string. Specifically for Exception strate their necessity in achieving top performance on
Type, it does not predict the index of each exception, but rather this task. Specifically, the pre-training tasks the top-ranked
the text string of that exception. In this way, CodeT5 turns this models used all include MLM (and its variants such as Seq2seq
classification task into one of generating NL, which is exactly MLM), as well as cross-modal-aware tasks (e.g., MCL, BDG,
what CodeT5 is good at. In contrast, for TE-based models EP). On one hand, MLM and its variants enable a model to
(e.g., SynCoBERT, UniXcoder, GraphCodeBERT), most of the generate better input representations. On the other hand, the
tasks they use in pre-training are binary classification tasks cross-modal-aware tasks typically allow a model to learn the
(e.g., MCL, TEP/EP, NA), so they may lack the knowledge alignment between different input modalities with the same
needed for multi-label classification. semantics. These two types of pre-training tasks therefore
2) Modality: The impact of each modality on this task allow a model to generate a more uniform input representation
becomes clear as well. All of the Top-3 models (i.e., CodeT5, for multimodal inputs, which is exactly what a model needs
UniXcoder, and GraphCodeBERT) use NL as one of the input to have for Code Search.
modalities, while both code and code structure were only used 3) Modality: Pre-training on multiple modalities appear
by two of them (CodeT5 and UniXcoder). This seems to to benefit this task since all of the Top-6 models are pre-
suggest that NL has a better positive impact on this task trained on two or three modalities. Concretely, UniXcoder is
than the other two modalities. pre-trained on NL and Structure, TreeBERT is pre-trained on
3) Pre-training Tasks: Both the classification pre-training Code and Structure, CodeT5 and PLBART are both pre-trained
task NSP and the generative pre-training task FNP seem on Code and NL, while SynCoBERT and GraphCodeBERT
to have positive impacts on this task. To exemplify, while are pre-trained on all of the three modalities. It is hard to
CuBERT and C-BERT are both TE-based models that use tell which modality has the largest impact on performance,
code as the only modality and differ only in their pre-training because the absence of any one of them would not prevent a
tasks (CuBERT uses both MLM and NSP whereas C-BERT model from becoming the Top-6..
uses only MLM), CuBERT outperforms C-BERT by as many
as 5 percent points in accuracy. As another example, while F. Code Question Answering
ProphetNet-Code and PLBART are both TF-based models that The new SOTA model remains the same as the current one,
use code and NL as input modalities and differ only in terms of i.e., UniXcoder. But our newly reported SOTA performance

2142

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
TABLE VI
E XPERIMENTAL RESULTS ON CODE TRANSLATION AND ASSERT GENERATION .

Code Translation Assert Generation

Java->C# C#->Java abs raw
Model
EM BLEU CodeBLEU EM BLEU CodeBLEU EM BLEU EM BLEU
Cur New Cur New Cur New Cur New Cur New Cur New Cur New Cur New Cur New Cur New
Transformer 33.0 40.8 55.84 60.22 63.74 67.10 37.9 43.9 50.47 54.86 61.59 61.84 - 28.65 - 63.24 - 35.35 - 67.62
RoBERTa - 58.9 - 79.70 - 83.77 - 59.5 - 73.14 - 78.63 - 36.12 - 69.11 - 50.68 - 76.23
GPT-2 - 54.4 - 65.39 - 76.92 - 56.0 - 69.41 - 71.24 - 35.02 - 66.89 - 48.23 - 74.15
BART - 49.5 - 67.91 - 74.25 - 55.2 - 72.32 - 70.80 - 33.82 - 65.11 - 48.14 - 74.27
T5 - 45.3 - 69.23 - 76.05 - 53.2 - 73.84 - 71.52 - 33.10 - 64.95 - 48.04 - 74.13
CuBERT - 55.6 - 75.15 - 79.30 - 55.6 - 70.07 - 75.31 - 37.14 - 68.34 - 50.60 - 74.31
GPT-C - 60.9 - 77.91 - 82.48 - 59.6 - 72.94 - 78.18 - 37.16 - 66.39 - 51.64 - 76.12
C-BERT - 55.4 - 74.65 - 81.63 - 56.4 - 70.28 - 75.73 - 36.64 - 65.51 - 50.70 - 72.11
JavaBERT - 61.1 - 80.74 - 80.45 - 58.3 - 70.10 - 77.14 - 38.23 - 71.22 - 52.66 - 78.51
CodeGPT-adapted - 62.0 - 80.21 - 85.47 - 60.3 - 72.93 - 79.02 - 39.42 - 69.61 - 52.85 - 76.12
DeepDebug - 59.5 - 81.46 - 83.95 - 63.8 - 75.10 - 82.43 - 39.89 - 71.10 - 56.65 - 76.84
CodeBERT 59.0 61.2 79.92 81.16 85.10 85.29 58.0 60.1 72.14 73.73 79.41 80.11 - 38.40 - 70.65 - 53.23 - 77.54
GraphCodeBERT 59.4 62.6 80.58 81.24 - 85.34 58.8 61.5 72.64 73.67 - 80.63 - 38.98 - 70.87 - 53.71 - 77.69
CugLM - 60.8 - 78.34 - 83.65 - 61.6 - 73.95 - 78.06 - 39.36 - 73.20 - 52.85 - 77.46
DOBF - 64.8 - 80.27 - 82.77 - 64.6 - 75.44 - 80.53 - 40.01 - 72.39 - 54.71 - 78.40
T5-learning - 62.9 - 78.19 - 81.13 - 64.8 - 75.64 - 81.09 34 40.95 - 72.70 47 56.85 - 77.09
PLBART 64.6 67.8 83.02 84.75 87.92 88.16 65.0 67.4 78.35 79.75 85.27 85.05 - 42.44 - 74.21 - 57.43 - 79.51
ProphetNet-Code - 62.5 - 80.38 - 81.64 - 64.5 - 75.68 - 81.04 - 37.38 - 68.45 - 56.26 - 77.64
CoTexT - 65.7 - 83.35 - 85.63 - 65.4 - 77.98 - 82.31 - 38.19 - 71.51 - 56.04 - 78.55
TreeBERT - 62.1 - 81.72 - 84.34 - 64.2 - 76.33 - 81.17 - 42.32 - 73.95 - 57.21 - 79.89
CodeT5 65.9 67.2 84.03 84.97 - 87.50 66.9 66.3 79.87 79.67 - 83.70 - 40.67 - 71.77 - 56.90 - 79.71
SynCoBERT 60.4 64.1 80.75 82.52 84.85 85.60 61.3 63.8 76.52 77.53 82.22 82.36 - 39.10 - 70.42 - 54.66 - 79.27
SPT-Code 64.1 66.6 90.34 83.24 - 85.15 60.3 63.9 86.10 78.82 - 85.33 - 42.35 - 74.53 - 57.09 - 79.36
UniXcoder - 64.5 - 81.66 - 85.60 - 64.1 - 77.37 - 82.56 - 39.46 - 71.25 - 54.94 - 78.99

TABLE VII
E XPERIMENTAL RESULTS ON BUG FIXING , CODE COMPLETION AND MUTANT GENERATION .

Bug Fixing Code Mutant

small medium Completion Generation
Model
EM BLEU CodeBLEU EM BLEU CodeBLEU EM ES EM BLEU
Cur New Cur New Cur New Cur New Cur New Cur New Cur New Cur New Cur New Cur New
Transformer 14.7 14.63 77.21 76.92 - 73.88 3.7 8.46 89.25 89.23 - 86.86 - 37.71 - 67.95 - 24.50 - 78.60
RoBERTa - 13.88 - 79.72 - 78.31 - 9.09 - 88.69 - 84.05 - 39.01 - 68.98 - 28.18 - 80.34
GPT-2 - 14.97 - 64.42 - 68.10 - 5.05 - 74.11 - 72.42 41.73 41.77 - 70.30 - 26.77 - 79.45
BART 16.7 15.60 - 71.34 - 72.43 6.7 6.97 - 82.37 - 81.85 - 30.67 - 56.17 - 23.21 - 77.13
T5 15.3 14.34 - 69.71 - 73.10 4.11 6.49 - 78.22 - 78.20 - 28.58 - 55.24 - 24.17 - 79.16
CuBERT - 14.87 - 74.93 - 75.28 - 8.92 - 86.12 - 83.09 - 38.32 - 66.97 - 27.08 - 78.66
GPT-C - 13.08 - 70.06 - 71.83 - 8.26 - 85.41 - 82.47 - 42.82 - 71.35 - 27.24 - 76.55
C-BERT - 14.04 - 73.19 - 74.54 - 9.37 - 85.57 - 83.87 - 41.07 - 67.82 - 26.63 - 77.43
JavaBERT - 15.39 - 78.98 - 76.02 - 9.41 - 85.33 - 84.42 - 39.16 - 67.67 - 28.14 - 79.33
CodeGPT-adapted - 13.66 - 76.07 - 77.13 - 11.00 - 85.28 - 84.55 42.37 43.80 - 72.54 - 27.64 - 79.40
DeepDebug 18.7 18.13 - 76.64 - 76.91 11.4 11.09 - 87.10 - 85.80 - 39.68 - 65.28 - 30.11 - 79.45
CodeBERT 16.4 14.66 77.42 78.41 - 78.09 5.2 9.72 90.07 86.94 - 83.88 - 40.77 - 68.58 - 28.61 - 80.59
GraphCodeBERT 17.3 16.85 80.02 79.61 - 79.68 9.1 10.14 91.31 87.63 - 85.33 - 40.26 - 69.88 - 29.72 - 80.53
CugLM - 13.78 - 75.36 - 75.20 - 9.08 - 85.20 - 84.82 - 42.94 - 71.89 - 27.09 - 78.95
DOBF - 15.45 - 75.52 - 74.62 - 10.28 - 87.93 - 85.01 - 39.35 - 69.30 - 29.16 - 79.10
T5-learning 10 17.62 - 77.05 - 76.30 3 10.94 - 88.46 - 86.42 - 38.06 - 66.58 28 29.90 - 78.44
PLBART 19.21 19.40 77.02 78.03 - 77.58 8.98 11.05 88.5 88.48 - 86.67 - 41.74 - 68.42 - 33.08 - 80.07
ProphetNet-Code - 17.23 - 75.40 - 75.60 - 10.75 - 86.82 - 84.19 - 39.66 - 67.24 - 29.63 - 77.12
CoTexT 21.58 21.33 77.28 77.20 77.38 77.75 13.03 13.37 88.68 87.13 84.41 85.14 - 40.36 - 70.16 - 31.87 - 80.00
TreeBERT - 20.73 - 79.38 - 75.17 - 12.89 - 89.05 - 87.15 - 41.73 - 70.14 - 33.20 - 80.46
CodeT5 21.61 21.65 77.43 77.55 - 77.24 13.96 14.30 87.64 89.23 - 87.05 - 40.52 - 71.29 - 34.83 - 80.75
SynCoBERT - 20.32 - 78.81 - 78.56 - 11.17 - 87.94 - 86.10 - 41.52 - 70.26 - 29.86 - 80.16
SPT-Code 17.54 18.59 75.10 78.51 - 74.97 10.86 12.06 87.88 88.37 - 84.35 - 40.20 - 68.97 - 33.00 - 79.18
UniXcoder - 19.05 - 79.18 - 79.45 - 13.96 - 87.59 - 86.23 - 41.69 - 69.84 - 29.78 - 80.02

has an improvement of 0.2 percent MRR points. Note that on this task are CoTexT, SPT-Code, DOBF, and UniXcoder
MCL, the pre-training task used by the SOTA model UniX- respectively.
coder, aims to distinguish whether two inputs match each 1) Architecture: According to the new results, that TF-
other, which is also the goal of the Code Question Answering based models take the absolute lead on this task can be
task. verified, since the Top-5 models are all TF-based, and given
that we have more TF-based models in our comparison than
G. Code Translation
before, the rank of the best performing TE-based model (i.e.,
Although the models are ranked according to their average SynCoBERT in Current and UniXcoder in New) drops from
EM value on the “Java to C#” and the “C# to Java” sub- fourth to sixth.
datasets, we find that the Top-2 models on the two sub-datasets 2) Modality: The importance of NL is well validated,
are both PLBART and CodeT510 . Besides, the Top 3–6 models due to the fact that the Top-4 performers in both the current
10 For a discussion of the results w.r.t. other evaluation metrics. See the (i.e., CodeT5, PLBART, SPT-Code and SynCoBERT) and the
supplementary file. new results (i.e., PLBART, CodeT5, CoTexT and SPT-Code)

2143

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
TABLE VIII
E XPERIMENTAL RESULTS ON CODE GENERATION TASKS INVOLVING NATURAL LANGUAGE .

Code Summarization Code

Java Py JS PHP Go Ruby Generation
Model
BLEU BLEU BLEU BLEU BLEU BLEU EM BLEU CodeBLEU
Cur New Cur New Cur New Cur New Cur New Cur New Cur New Cur New Cur New
Transformer 16.26 16.37 15.81 16.47 11.59 10.26 22.12 23.41 16.38 16.46 11.18 11.09 - 6.10 - 21.67 - 26.98
RoBERTa 16.47 17.38 18.14 17.49 11.90 11.75 24.02 24.42 17.72 17.63 11.17 11.26 - 19.30 - 31.92 - 35.40
GPT-2 - 18.62 - 18.92 - 14.13 - 23.91 - 17.59 - 12.03 17.35 17.55 25.37 23.62 29.69 29.93
BART - 18.98 - 19.37 - 14.68 - 24.46 - 18.80 - 13.65 - 18.90 - 31.38 - 33.72
T5 18.35 18.87 19.26 19.57 14.57 14.45 24.59 24.32 19.17 18.70 14.18 13.72 18.65 18.70 32.74 32.02 35.95 33.26
CuBERT - 16.75 - 17.77 - 11.34 - 22.76 - 16.09 - 10.46 - 19.05 - 30.14 - 32.82
GPT-C - 17.18 - 17.78 - 12.01 - 23.42 - 16.96 - 10.54 - 19.85 - 30.45 - 33.10
C-BERT - 17.44 - 18.39 - 13.14 - 23.90 - 17.47 - 12.14 - 19.80 - 33.62 - 35.99
JavaBERT - 18.23 - 17.57 - 11.91 - 22.87 - 17.13 - 10.94 - 18.45 - 34.62 - 36.93
CodeGPT-adapted - 17.68 - 18.46 - 12.91 - 24.68 - 17.38 - 12.39 20.10 20.15 32.79 35.94 35.98 37.27
DeepDebug - 19.00 - 18.85 - 14.39 - 23.37 - 17.68 - 13.27 - 18.40 - 36.52 - 38.90
CodeBERT 17.65 18.61 19.06 19.23 14.90 14.75 25.16 24.70 18.07 18.26 12.16 12.53 - 21.15 - 31.45 - 35.26
GraphCodeBERT - 18.93 - 19.39 - 14.90 - 25.64 - 18.50 - 12.63 - 21.00 - 34.33 - 37.55
CugLM - 18.04 - 18.20 - 14.07 - 24.66 - 18.53 - 11.47 - 21.80 - 33.70 - 35.91
DOBF 19.05 19.18 18.24 18.41 - 13.22 - 23.83 - 18.28 - 13.21 - 20.35 - 35.26 - 37.41
T5-learning - 18.84 - 18.23 - 13.18 - 23.10 - 17.29 - 12.51 - 18.95 - 35.76 - 38.30
PLBART 18.45 19.31 19.30 19.41 15.56 15.73 23.58 24.47 18.91 19.01 14.11 14.15 18.75 19.85 36.69 36.63 38.52 39.29
ProphetNet-Code 19.39 19.29 17.87 18.20 16.60 15.95 24.57 24.28 18.43 18.31 14.37 14.39 - 21.70 - 37.67 - 39.79
CoTexT 19.10 19.19 19.52 19.72 14.77 15.08 24.47 24.57 19.37 19.13 13.07 14.28 20.10 21.80 36.51 38.74 39.49 40.63
TreeBERT - 18.90 - 19.44 - 15.05 - 23.82 - 18.74 - 13.57 - 22.05 - 38.27 - 39.73
CodeT5 20.31 20.35 20.01 20.17 16.16 16.75 26.03 25.97 19.56 19.68 15.24 15.36 22.30 23.40 40.73 40.75 43.20 43.40
SynCoBERT - 18.89 - 18.74 - 14.57 - 25.55 - 18.36 - 13.91 - 21.35 - 38.39 - 41.21
SPT-Code - 18.90 - 19.71 - 15.28 - 24.78 - 19.17 - 14.38 - 20.00 - 37.91 - 39.90
UniXcoder - 19.42 - 18.64 - 14.27 - 25.70 - 18.59 - 14.32 22.60 22.65 - 38.73 - 40.86

use NL. The role of code structure, on the other hand, is less This seems to suggest that the TF architecture should be
clear, since the Top-2 models (i.e., PLBART, CodeT5) are considered first when designing high performance pre-
not pre-trained on the Structure modality and the best model trained models for this task.
using code structure drops from the third place in “Cur” (i.e., 2) Pre-training Tasks: The most useful pre-training tasks
SynCoBERT) to the fourth place in “New” (i.e., SPT-Code). for Bug Fixing is the sequence-to-sequence variants of
3) Pre-training Tasks: The new results show that the pre- MLM adapted to the Transformer decoder. They enable a
training objective DAE has a more significant impact than model to acquire the ability to generate target sequences from
BDG/CMG. To exemplify, consider PLBART and CodeT5, an incomplete one. As an example, consider the top-3 TF-
both of which are TF-based and employ the same modalities based models, which all use such pre-training tasks: Seq2seq
(code and NL). They differ only in terms of the pre-training MLM in CodeT5 and CoTexT, TMLM in TreeBERT, and
tasks: the former uses DAE and the latter uses BDG/CMG. Seq2seq IMLM in CodeT5. Moreover, by using Seq2seqMLM
The fact that PLBART outperforms CodeT5 can therefore be as the only pre-training task, the second-highest ranked model,
attributed to the fact that DAE is a better pre-training task for CoTexT, achieves better performance than TreeBERT, which
Code Translation than BDG/CMG. This conclusion is contrary uses NOP in addition to TMLM for pre-training.
to the conclusion drawn from the CUR results, where BDG
is believed to have a stronger influence than DAE on Code
I. Code Completion
Translation due to the fact that CodeT5 beat PLBART by 1.3
percent EM points. This is the only SE task among the ones we consider
where SOTA performance is achieved by the TD-based model
H. Bug Fixing CodeGPT-adapted, and it is the SOTA model in both the
Considering the EM value averaged over the “small” and current and new results. This seems to suggest the absolute
“medium” datasets, the Top-4 models change from CodeT5, dominance of the TD architecture on this task. Our new
CoTexT, DeepDebug, and SPT-Code (listed in decreasing results further suggest that the pre-training objective ULM
order of performance) to CodeT5, CoTexT, TreeBERT, and (adopted by the Top-3 models on this task, i.e., CodeGPT-
UniXcoder. A closer examination of the sub-datasets reveals adapted, CugLM, and GPT-C), whose goal is similar to
that UniXcoder outperforms CoText and TreeBERT, achieving that of code completion, plays an influential role in Code
the second best performance on the “medium” dataset while Completion. As an example, consider the TE-based model
ranking 4th on the “small” one. CugLM, which outperforms another TE-based model CuBERT
1) Architecture: The Top-3 performance is achieved by (pretrained on MLM and NSP) and achieves the second best
three TF-based models (i.e, CodeT5, CoTexT, and TreeBERT) performance by using ULM in addition to MLM and NSP.
and the best and second TE-based models (i.e., UniXcoder Moreover, in terms of modality, Code Completion is the only
and SynCoBERT) rank 4th and 5th respectively. Besides, the task where neither NL nor code structure plays a positive
best TD-based model (i.e., CodeGPT-adapted) only rank 14th. role since all of the Top-3 models use code as the only input

2144

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
modality. We speculate the reason is that there is currently no M. Code Generation
effective way to combine these two modalities with ULM. The SOTA model changes from TE-based (UniXcoder)
to TF-based (CodeT5), and the best TE-based (UniXcoder)
J. Assert Generation and TD-based (CodeGPT-adapted) models rank 2nd and 11th,
respectively. While the new SOTA model (i.e., CodeT5) for
Since only T5-learning has been evaluated on this task Code Generation is also the SOTA model for Code Summa-
currently, all conclusions drawn from the new results could rization, the ranks of T5 and BART (pre-trained only on
be viewed as new ﬁndings. First, the Top-5 performers are NL) on this task are lower than their ranks on Code
all TF-based models (i.e., PLBART, TreeBERT, SPT-Code, Summarization, because understanding code and generating
T5-learning, and CodeT5 by order). The best performing TE- code are fundamentally different in nature. In addition, the
based (i.e., UniXcoder) and TD-based (i.e., CodeGPT-adapted) importance of the code and NL modalities for this task is
models rank 8th and 16th, respectively. As far as modality not as clear as that for Code Summarization, considering
is concerned, NL seems to have a greater impact than other that among the Top-3 models (CodeT5, UniXcoder, and Tree-
modalities, as four of the Top-5 models (i.e., PLBART, SPT- BERT), only CodeT5 uses both code and NL: UniXcoder uses
Code, T5-learning, and CodeT5) use NL, whereas only two code structure and NL and TreeBERT uses code structure and
(i.e., TreeBERT and SPT-Code) use code structures. code instead. Moreover, although only NL is the input of this
task, pre-training on code structure has a positive impact
K. Mutant Generation on this task, since both of the second and third best models
(UniXcoder and TreeBERT) are pre-trained on tasks such as
NL and code structure appear to have positive im-
MCL, CMG, and NOP.
pacts, since the Top-3 models (i.e., CodeT5, TreeBERT, and
PLBART by order) use either NL or code structure as one of VI. I NSIGHTS AND TAKEAWAYS
the inputs in addition to the code. As for pre-training tasks,
After analysis and discussion by task, we have some insights
DAE alone is able to help the model (i.e., PLBART) achieve
and takeaways to provide to subsequent researchers.
the third best performance. The structure-aware pre-training
tasks, such as TMLM and NOP used by TreeBERT (the second • When designing a new model to solve multiple tasks, look

best) and CAP used by SPT-Code (the fourth best) clearly have up the current SOTA model’s architecture, features, and
positive impacts on this task. pre-training tasks for each task, and use such information
as a starting point.
• Always pre-train on multiple programming languages.
L. Code Summarization
• Always pre-train with NL, since all of the new SOTAs
1) Architecture: The best TE-based model (UniXcoder) use NL.
ranks 5th, with the Top-4 being TF-based models (i.e., • Utilize structure information in PTMs for code under-
CodeT5, ProphetNet-Code, CoTexT, and SPT-Code), which standing tasks.
suggests the strong positive inﬂuence of the TF architecture • Ensure the pre-training tasks are as similar in form as
on this task. This is not in line with the current results in possible to the target downstream task.
which the best TE-based model ranked second. With the new • Use different CodePTMs for different target task types
results, the best TD-based model (GPT-2) ranks 15th. since there is no almighty CodePTM, as per our results
2) Modality: The highest ranks achieved by models pre- and Zeng et al. [69].
trained on NL only (e.g., T5 and BART) are 5th and 9th in Particularly, for the following tasks, we have additional
the current and new results, respectively. The reasons why takeaways:
they perform even better than some of the models pre-trained • Clone Detection: Although the TE-based model achieves
on code or structure in addition to NL (e.g., CodeBERT, the best performance, comparable results are achieved
GraphCodeBERT, etc.) are two-fold. First, they acquire the with the TF-based model. Besides, the use of NL and
ability to generate NL during pre-training, which is required code structure is beneﬁcial. Finally, MLM and its variants
by Code Summarization, and (2) because of the “naturalness” have better results on this task.
of the source code [13], [68], they are able to understand the • Code-to-Code Retrieval: Utilize NL and code structure
code to some extent although they only have the ability to following the ”Altogether” strategy. Besides, MLM and
understand NL. its variants, as well as structure-aware pre-training tasks,
3) Pre-training Tasks: Cross-modal(Code and Natural have positive effects on this task.
Language)-aware generation tasks such as BDG/CMG and • Code Question Answering: Prefer TE models and use
MNG have positive impacts on a model’s performance on NL whenever possible.
this task. As an example, CodeT5, which utilizes BDG, and • Assert Generation: NL is not a required modality. The
SPT-Code, which utilizes MNG, are the top performers among reason is that although the model with the best perfor-
TF-based models, and UniXcoder, which utilizes CMG, is the mance uses NL, NL is not used in the same data sample
top performer among TE-based models. as other modalities (because of the Standalone strategy).

2145

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
Seq2seq pre-training tasks, such as DAE, MASS and ACKNOWLEDGMENT
MNG, should be prioritized. This work was supported by National Natural Science Foun-
• Code Generation: Besides TF, TE is worth trying. The dation of China (61802167), Natural Science Foundation of
use of NL-Code generation code pre-training tasks (e.g., Jiangsu Province, China (BK20201250), Cooperation Fund of
BDG and CMG) is mandatory. Huawei-NJU Creative Laboratory for the Next Programming,
Finally, through our experiments, we propose several pos- and NSF award 2034508. We also thank the reviewers for
sible subsequent research directions as follows. their helpful comments. Chuanyi Li and Jidong Ge are the
• Design more efficient pre-training tasks to make Code- corresponding authors.
PTMs learn source code features better [20].
R EFERENCES
• Improve the efficiency of CodePTMs for fine-tuning on
downstream tasks [70]. [1] A. M. Dai and Q. V. Le, “Semi-supervised sequence learning,” Advances
in neural information processing systems, vol. 28, 2015.
• Make the large CodePTMs lighter [71], [72]. [2] J. Howard and S. Ruder, “Universal language model fine-tuning for
• Improve the robustness of CodePTMs. text classification,” in Proceedings of the 56th Annual Meeting of the
Association for Computational Linguistics (Volume 1: Long Papers),
VII. T HREATS TO VALIDITY 2018, pp. 328–339.
[3] M. E. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark, K. Lee,
Construct Validity: As discussed in Section II-D, we have and L. Zettlemoyer, “Deep contextualized word representations,” in
re-implemented some PTMs (category IV) or re-collected Proceedings of the 2018 Conference of the North American Chapter
some datasets (category III and some in IV). The replication of the Association for Computational Linguistics: Human Language
Technologies, Volume 1 (Long Papers). New Orleans, Louisiana:
may not be perfect but we have tried our best to do the Association for Computational Linguistics, Jun. 2018, pp. 2227–2237.
re-implementation and collect the datasets to minimize the [Online]. Available: https://aclanthology.org/N18-1202
deviations from the original model (See Section II-D). Besides, [4] A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever, “Improving
language understanding by generative pre-training,” 2018.
we adopt the statistical significance testing to measuring the [5] J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “Bert: Pre-training
differences between our implementation and the original ones. of deep bidirectional transformers for language understanding,” in
Internal Validity: It is widely agreed that, during fine- Proceedings of the 2019 Conference of the North American Chapter
of the Association for Computational Linguistics: Human Language
tuning, hyperparameters have a significant impact on the Technologies, Volume 1 (Long and Short Papers), 2019, pp. 4171–4186.
performance of pre-trained models. For models where hyper- [6] Z. Yang, Z. Dai, Y. Yang, J. Carbonell, R. R. Salakhutdinov, and
parameters for fine-tuning are not available (See Section II-D), Q. V. Le, “Xlnet: Generalized autoregressive pretraining for language
understanding,” Advances in neural information processing systems,
the settings we obtain by hyperparameter search may introduce vol. 32, 2019.
some bias with the performance reported in the original paper. [7] Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis,
But we have tried best to derive best performance of these L. Zettlemoyer, and V. Stoyanov, “Roberta: A robustly optimized bert
pretraining approach,” arXiv preprint arXiv:1907.11692, 2019.
models on each SE task. [8] K. Clark, M.-T. Luong, Q. V. Le, and C. D. Manning, “Electra: Pre-
External Validity: The results and observations we obtained training text encoders as discriminators rather than generators,” in
in this work may apply only to the downstream tasks and International Conference on Learning Representations, 2019.
[9] A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, and I. Sutskever,
corresponding datasets we have evaluated. For the other SE “Language models are unsupervised multitask learners,” 2019.
tasks and datasets, we cannot guarantee exactly the same [10] C. Raffel, N. Shazeer, A. Roberts, K. Lee, S. Narang, M. Matena,
results and observations. Y. Zhou, W. Li, and P. J. Liu, “Exploring the limits of transfer learning
with a unified text-to-text transformer,” Journal of Machine Learning
VIII. C ONCLUSION Research, vol. 21, pp. 1–67, 2020.
[11] M. Lewis, Y. Liu, N. Goyal, M. Ghazvininejad, A. Mohamed, O. Levy,
We conducted the first systematic empirical comparison of V. Stoyanov, and L. Zettlemoyer, “Bart: Denoising sequence-to-sequence
existing pre-trained models of source code11 . We believe that pre-training for natural language generation, translation, and comprehen-
sion,” in Proceedings of the 58th Annual Meeting of the Association for
the results of our large-scale evaluation and the associated dis- Computational Linguistics, 2020, pp. 7871–7880.
cussion can provide SE researchers with a better understanding [12] A. Kanade, P. Maniatis, G. Balakrishnan, and K. Shi, “Learning and
of existing PTMs and their relative strengths and weaknesses, evaluating contextual embedding of source code,” in International
Conference on Machine Learning. PMLR, 2020, pp. 5110–5121.
as well as a better characterization of the state-of-the-art of [13] L. Buratti, S. Pujar, M. Bornea, S. McCarley, Y. Zheng, G. Rossiello,
each SE task on which PTMs are commonly evaluated. A. Morari, J. Laredo, V. Thost, Y. Zhuang et al., “Exploring soft-
This paper provides many valuable findings that are either ware naturalness through neural language models,” arXiv preprint
arXiv:2006.12641, 2020.
not available based on the existing results alone or completely [14] A. Svyatkovskiy, S. K. Deng, S. Fu, and N. Sundaresan, “Intellicode
contrary to current findings. For example, we found that compose: Code generation using transformer,” in Proceedings of the
TF-based models have clear advantages for not only code 28th ACM Joint Meeting on European Software Engineering Conference
and Symposium on the Foundations of Software Engineering, 2020, pp.
generation tasks but also code understanding tasks. We hope 1433–1443.
that this paper could provide interested researchers with a [15] N. T. De Sousa and W. Hasselbring, “Javabert: Training a transformer-
comprehensive and comparable insights into the current state based model for the java programming language,” in 2021 36th
IEEE/ACM International Conference on Automated Software Engineer-
of this domain and inspire them to design more powerful pre- ing Workshops (ASEW). IEEE, 2021, pp. 90–95.
trained models of source code. [16] D. Drain, C. Wu, A. Svyatkovskiy, and N. Sundaresan, “Generating bug-
fixes using pretrained transformers,” in Proceedings of the 5th ACM
11 All materials used in our experiments are available at https://github.com/ SIGPLAN International Symposium on Machine Programming, 2021,
NougatCA/FineTuner and https://doi.org/10.5281/zenodo.7318110. pp. 1–8.

2146

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
[17] U. Alon, M. Zilberstein, O. Levy, and E. Yahav, “code2vec: Learning [36] J. Huang, D. Tang, L. Shou, M. Gong, K. Xu, D. Jiang, M. Zhou,
distributed representations of code,” Proceedings of the ACM on Pro- and N. Duan, “Cosqa: 20,000+ web queries for code search and
gramming Languages, vol. 3, no. POPL, pp. 1–29, 2019. question answering,” in Proceedings of the 59th Annual Meeting of the
[18] J. Zhang, X. Wang, H. Zhang, H. Sun, K. Wang, and X. Liu, “A novel Association for Computational Linguistics and the 11th International
neural source code representation based on abstract syntax tree,” in Joint Conference on Natural Language Processing (Volume 1: Long
2019 IEEE/ACM 41st International Conference on Software Engineering Papers), 2021, pp. 5690–5700.
(ICSE). IEEE, 2019, pp. 783–794. [37] B. Roziere, M.-A. Lachaux, L. Chanussot, and G. Lample, “Unsu-
[19] T. Ben-Nun, A. S. Jakobovits, and T. Hoefler, “Neural code comprehen- pervised translation of programming languages,” Advances in Neural
sion: A learnable representation of code semantics,” Advances in Neural Information Processing Systems, vol. 33, pp. 20 601–20 611, 2020.
Information Processing Systems, vol. 31, 2018. [38] M. Tufano, C. Watson, G. Bavota, M. D. Penta, M. White, and
[20] A. Karmakar and R. Robbes, “What do pre-trained code models know D. Poshyvanyk, “An empirical study on learning bug-fixing patches in
about code?” in 2021 36th IEEE/ACM International Conference on the wild via neural machine translation,” ACM Transactions on Software
Automated Software Engineering (ASE). IEEE, 2021, pp. 1332–1336. Engineering and Methodology (TOSEM), vol. 28, no. 4, pp. 1–29, 2019.
[21] M. Allamanis, M. Brockschmidt, and M. Khademi, “Learning to rep- [39] V. Raychev, P. Bielik, and M. Vechev, “Probabilistic model for code with
resent programs with graphs,” in International Conference on Learning decision trees,” in Proceedings of the 2016 ACM SIGPLAN International
Representations, 2018. Conference on Object-Oriented Programming, Systems, Languages, and
[22] C. Cummins, H. Leather, Z. Fisches, T. Ben-Nun, T. Hoefler, and Applications, 2016, pp. 731–747.
M. O’Boyle, “Deep data flow analysis,” 2020. [Online]. Available: [40] F. Liu, G. Li, Y. Zhao, and Z. Jin, “Multi-task learning based pre-
https://arxiv.org/abs/2012.01470 trained language model for code completion,” in Proceedings of the
[23] T. Hoang, H. J. Kang, D. Lo, and J. Lawall, “Cc2vec: Distributed 35th IEEE/ACM International Conference on Automated Software En-
representations of code changes,” in Proceedings of the ACM/IEEE 42nd gineering, 2020, pp. 473–485.
International Conference on Software Engineering, 2020, pp. 518–529. [41] U. Alon, R. Sadaka, O. Levy, and E. Yahav, “Structural language models
[24] W. Ma, M. Zhao, E. Soremekun, Q. Hu, J. M. Zhang, M. Papadakis, of code,” in International conference on machine learning. PMLR,
M. Cordy, X. Xie, and Y. L. Traon, “Graphcode2vec: generic code em- 2020, pp. 245–256.
bedding via lexical and program dependence analyses,” in Proceedings [42] M. Tufano, C. Watson, G. Bavota, M. Di Penta, M. White, and
of the 19th International Conference on Mining Software Repositories, D. Poshyvanyk, “Learning how to mutate source code from bug-fixes,”
2022, pp. 524–536. in 2019 IEEE International Conference on Software Maintenance and
Evolution (ICSME). IEEE Computer Society, 2019, pp. 301–312.
[25] N. D. Bui, Y. Yu, and L. Jiang, “Infercode: Self-supervised learning of
[43] C. Watson, M. Tufano, K. Moran, G. Bavota, and D. Poshyvanyk, “On
code representations by predicting subtrees,” in 2021 IEEE/ACM 43rd
learning meaningful assert statements for unit test cases,” in Proceedings
International Conference on Software Engineering (ICSE). IEEE, 2021,
of the ACM/IEEE 42nd International Conference on Software Engineer-
pp. 1186–1197.
ing, 2020, pp. 1398–1409.
[26] K. Zhang, W. Wang, H. Zhang, G. Li, and Z. Jin, “Learning to
[44] S. Haque, A. LeClair, L. Wu, and C. McMillan, “Improved auto-
represent programs with heterogeneous graphs,” in Proceedings of the
matic summarization of subroutines via attention to file context,” in
30th IEEE/ACM International Conference on Program Comprehension,
Proceedings of the 17th International Conference on Mining Software
2022, pp. 378–389.
Repositories, 2020, pp. 300–310.
[27] C. Niu, C. Li, B. Luo, and V. Ng, “Deep learning meets software [45] X. Hu, G. Li, X. Xia, D. Lo, and Z. Jin, “Deep code comment gener-
engineering: A survey on pre-trained models of source code,” in Pro- ation,” in 2018 IEEE/ACM 26th International Conference on Program
ceedings of the Thirty-First International Joint Conference on Artificial Comprehension (ICPC). IEEE, 2018, pp. 200–20 010.
Intelligence, IJCAI 2022, 2022, pp. 5546–5555. [46] U. Alon, S. Brody, O. Levy, and E. Yahav, “code2seq: Generating
[28] X. Jiang, Z. Zheng, C. Lyu, L. Li, and L. Lyu, “Treebert: A tree-based sequences from structured representations of code,” in International
pre-trained model for programming language,” in Proceedings of the Conference on Learning Representations, 2019.
Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, vol. [47] X. Hu, G. Li, X. Xia, D. Lo, S. Lu, and Z. Jin, “Summarizing source
161. PMLR, 27–30 Jul 2021, pp. 54–63. code with transferred api knowledge,” in Proceedings of the 27th
[29] C. Niu, C. Li, V. Ng, J. Ge, L. Huang, and B. Luo, “Spt-code: Sequence- International Joint Conference on Artificial Intelligence, IJCAI 2018,
to-sequence pre-training for learning source code representations,” in 2018, pp. 2269–2275.
2022 IEEE/ACM 44th International Conference on Software Engineer- [48] A. V. Miceli-Barone and R. Sennrich, “A parallel corpus of python
ing (ICSE), 2022, pp. 01–13. functions and documentation strings for automated code documentation
[30] Y. Zhou, S. Liu, J. Siow, X. Du, and Y. Liu, “Devign: Effective vul- and code generation,” in Proceedings of the Eighth International Joint
nerability identification by learning comprehensive program semantics Conference on Natural Language Processing (Volume 2: Short Papers),
via graph neural networks,” Advances in neural information processing 2017, pp. 314–319.
systems, vol. 32, 2019. [49] S. Iyer, I. Konstas, A. Cheung, and L. Zettlemoyer, “Mapping language
[31] M. Pradel and K. Sen, “Deepbugs: A learning approach to name-based to code in programmatic context,” in Proceedings of the 2018 Confer-
bug detection,” Proceedings of the ACM on Programming Languages, ence on Empirical Methods in Natural Language Processing, 2018, pp.
vol. 2, no. OOPSLA, pp. 1–25, 2018. 1643–1652.
[32] J. Svajlenko, J. F. Islam, I. Keivanloo, C. K. Roy, and M. M. Mia, [50] K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu, “Bleu: a method for
“Towards a big data curated benchmark of inter-project code clones,” automatic evaluation of machine translation,” in Proceedings of the 40th
in Proceedings of the 2014 IEEE International Conference on Software annual meeting of the Association for Computational Linguistics, 2002,
Maintenance and Evolution, 2014, pp. 476–480. pp. 311–318.
[33] K. W. Nafi, T. S. Kar, B. Roy, C. K. Roy, and K. A. Schneider, “Clcdsa: [51] S. Ren, D. Guo, S. Lu, L. Zhou, S. Liu, D. Tang, N. Sundaresan,
cross language code clone detection using syntactical features and api M. Zhou, A. Blanco, and S. Ma, “Codebleu: a method for automatic
documentation,” in 2019 34th IEEE/ACM International Conference on evaluation of code synthesis,” arXiv preprint arXiv:2009.10297, 2020.
Automated Software Engineering (ASE). IEEE, 2019, pp. 1026–1037. [52] R.-M. Karampatsis and C. Sutton, “Scelmo: Source code embeddings
[34] L. Mou, G. Li, L. Zhang, T. Wang, and Z. Jin, “Convolutional neural from language models,” arXiv preprint arXiv:2004.13214, 2020.
networks over tree structures for programming language processing,” in [53] Z. Feng, D. Guo, D. Tang, N. Duan, X. Feng, M. Gong, L. Shou,
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, B. Qin, T. Liu, D. Jiang, and M. Zhou, “Codebert: A pre-trained model
2016, pp. 1287–1293. for programming and natural languages,” in Proceedings of the 2020
[35] S. Lu, D. Guo, S. Ren, J. Huang, A. Svyatkovskiy, A. Blanco, Conference on Empirical Methods in Natural Language Processing:
C. Clement, D. Drain, D. Jiang, D. Tang, G. Li, L. Zhou, L. Shou, Findings, 2020, pp. 1536–1547.
L. Zhou, M. Tufano, M. GONG, M. Zhou, N. Duan, N. Sundaresan, [54] D. Guo, S. Ren, S. Lu, Z. Feng, D. Tang, S. LIU, L. Zhou, N. Duan,
S. K. Deng, S. Fu, and S. LIU, “CodeXGLUE: A machine learning A. Svyatkovskiy, S. Fu, M. Tufano, S. K. Deng, C. Clement, D. Drain,
benchmark dataset for code understanding and generation,” in Thirty- N. Sundaresan, J. Yin, D. Jiang, and M. Zhou, “Graphcodebert: Pre-
fifth Conference on Neural Information Processing Systems Datasets training code representations with data flow,” in International Confer-
and Benchmarks Track (Round 1), 2021. ence on Learning Representations, 2021.

2147

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.
[55] B. Roziere, M.-A. Lachaux, M. Szafraniec, and G. Lample, “Dobf: A
deobfuscation pre-training objective for programming languages,” arXiv
preprint arXiv:2102.07492, 2021.
[56] A. Mastropaolo, S. Scalabrino, N. Cooper, D. N. Palacio, D. Poshy-
vanyk, R. Oliveto, and G. Bavota, “Studying the usage of text-to-text
transfer transformer to support code-related tasks,” in 2021 IEEE/ACM
43rd International Conference on Software Engineering (ICSE). IEEE,
2021, pp. 336–347.
[57] W. Ahmad, S. Chakraborty, B. Ray, and K.-W. Chang, “Unified pre-
training for program understanding and generation,” in Proceedings of
the 2021 Conference of the North American Chapter of the Association
for Computational Linguistics: Human Language Technologies, 2021,
pp. 2655–2668.
[58] W. Qi, Y. Gong, Y. Yan, C. Xu, B. Yao, B. Zhou, B. Cheng, D. Jiang,
J. Chen, R. Zhang et al., “Prophetnet-x: Large-scale pre-training models
for english, chinese, multi-lingual, dialog, and code generation,” arXiv
preprint arXiv:2104.08006, 2021.
[59] L. Phan, H. Tran, D. Le, H. Nguyen, J. Annibal, A. Peltekian, and Y. Ye,
“Cotext: Multi-task learning with code-text transformer,” in Proceedings
of the 1st Workshop on Natural Language Processing for Programming
(NLP4Prog 2021), 2021, pp. 40–47.
[60] D. Peng, S. Zheng, Y. Li, G. Ke, D. He, and T.-Y. Liu, “How could
neural networks understand programs?” in International Conference on
Machine Learning. PMLR, 2021, pp. 8476–8486.
[61] J. Zhang, H. Hong, Y. Zhang, Y. Wan, Y. Liu, and Y. Sui, “Disentangled
code representation learning for multiple programming languages,” in
Findings of the Association for Computational Linguistics: ACL-IJCNLP
2021, 2021, pp. 4454–4466.
[62] Y. Wang, W. Wang, S. Joty, and S. C. Hoi, “Codet5: Identifier-aware
unified pre-trained encoder-decoder models for code understanding
and generation,” in Proceedings of the 2021 Conference on Empirical
Methods in Natural Language Processing, 2021, pp. 8696–8708.
[63] X. Wang, Y. Wang, F. Mi, P. Zhou, Y. Wan, X. Liu, L. Li, H. Wu,
J. Liu, and X. Jiang, “Syncobert: Syntax-guided multi-modal contrastive
pre-training for code representation,” arXiv preprint arXiv:2108.04556,
2021.
[64] D. Guo, S. Lu, N. Duan, Y. Wang, M. Zhou, and J. Yin, “Unixcoder:
Unified cross-modal pre-training for code representation,” in Proceed-
ings of the 60th Annual Meeting of the Association for Computational
Linguistics (Volume 1: Long Papers), 2022, pp. 7212–7225.
[65] S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural
computation, vol. 9, no. 8, pp. 1735–1780, 1997.
[66] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez,
Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” in Advances
in Neural Information Processing Systems, vol. 30. Curran Associates,
Inc., 2017.
[67] E. S. Edgington, “Approximate randomization tests,” The Journal of
Psychology, vol. 72, no. 2, pp. 143–149, 1969.
[68] M. D. Ernst, “Natural language is a programming language: Applying
natural language processing to software development,” in 2nd Summit
on Advances in Programming Languages (SNAPL 2017). Schloss
Dagstuhl-Leibniz-Zentrum fuer Informatik, 2017.
[69] Z. Zeng, H. Tan, H. Zhang, J. Li, Y. Zhang, and L. Zhang, “An extensive
study on pre-trained models for program understanding and generation,”
in Proceedings of the 31st ACM SIGSOFT International Symposium on
Software Testing and Analysis, 2022, pp. 39–51.
[70] D. Wang, Z. Jia, S. Li, Y. Yu, Y. Xiong, W. Dong, and X. Liao,
“Bridging pre-trained models and downstream tasks for source code
understanding,” in Proceedings of the 44th International Conference on
Software Engineering, 2022, pp. 287–298.
[71] Z. Zhang, H. Zhang, B. Shen, and X. Gu, “Diet code is healthy:
Simplifying programs for pre-trained models of code,” in Proceedings
of the 30th ACM Joint European Software Engineering Conference
and Symposium on the Foundations of Software Engineering, 2022, pp.
1073–1084.
[72] J. Shi, Z. Yang, B. Xu, H. J. Kang, and D. Lo, “Compressing pre-
trained models of code into 3 mb,” in The 37th IEEE/ACM International
Conference on Automated Software Engineering, ASE 2022, 2022.

2148

Authorized licensed use limited to: XIDIAN UNIVERSITY. Downloaded on April 09,2024 at 03:06:46 UTC from IEEE Xplore. Restrictions apply.

Paypal Credit Cashout Guide
100% (2)
Paypal Credit Cashout Guide
6 pages
XCOM User Guide PDF
100% (1)
XCOM User Guide PDF
39 pages
Security Survey of SITE
100% (2)
Security Survey of SITE
4 pages
Code Generation Tools (Almost) For Free? A Study of Few-Shot, Pre-Trained Language Models On Code
No ratings yet
Code Generation Tools (Almost) For Free? A Study of Few-Shot, Pre-Trained Language Models On Code
12 pages
Legal 2 AI
No ratings yet
Legal 2 AI
10 pages
Exploring and Evaluating Personalized Models For Code Generation
No ratings yet
Exploring and Evaluating Personalized Models For Code Generation
9 pages
Deepseek-Coder: When The Large Language Model Meets Programming - The Rise of Code Intelligence
No ratings yet
Deepseek-Coder: When The Large Language Model Meets Programming - The Rise of Code Intelligence
23 pages
2019 ICLR CuBERT Pre Trained Contextual Embedding of Source Code
No ratings yet
2019 ICLR CuBERT Pre Trained Contextual Embedding of Source Code
22 pages
Evaluating Large Language Models Trained On Code
No ratings yet
Evaluating Large Language Models Trained On Code
35 pages
OpenAI Codex Arxiv
No ratings yet
OpenAI Codex Arxiv
35 pages
A Survey On Language Models For Code
No ratings yet
A Survey On Language Models For Code
125 pages
SO Snippet ENASE
No ratings yet
SO Snippet ENASE
10 pages
(2023) A Survey On Language Models For Code
No ratings yet
(2023) A Survey On Language Models For Code
55 pages
Automated Source Code Generation and Auto-Completion Using Deep Learning: Comparing and Discussing Current Language Model-Related Approaches
No ratings yet
Automated Source Code Generation and Auto-Completion Using Deep Learning: Comparing and Discussing Current Language Model-Related Approaches
16 pages
Programming Lang Processing
No ratings yet
Programming Lang Processing
70 pages
(2022) Bridging Pre-Trained Models and Downstream Tasks For Source Code Understanding
No ratings yet
(2022) Bridging Pre-Trained Models and Downstream Tasks For Source Code Understanding
12 pages
Evaluating Large Language Models On Non-Code Software Engineering Tasks
No ratings yet
Evaluating Large Language Models On Non-Code Software Engineering Tasks
16 pages
Bert - Se: A P - L R M S E: RE Trained Anguage Epresentation Odel For Oftware Ngineering
No ratings yet
Bert - Se: A P - L R M S E: RE Trained Anguage Epresentation Odel For Oftware Ngineering
17 pages
Seed Coder
No ratings yet
Seed Coder
46 pages
SEED: Customize Large Language Models With Sample-Efficient Adaptation For Code Generation
No ratings yet
SEED: Customize Large Language Models With Sample-Efficient Adaptation For Code Generation
13 pages
CodeGeeX4: Multilingual Open-Source Code Assistant
No ratings yet
CodeGeeX4: Multilingual Open-Source Code Assistant
9 pages
Code Tree
No ratings yet
Code Tree
16 pages
Towards Efficient Fine-Tuning of Pre-Trained Code Models: An Experimental Study and Beyond
No ratings yet
Towards Efficient Fine-Tuning of Pre-Trained Code Models: An Experimental Study and Beyond
13 pages
Pretraining and Evaluation CodeLLMs
No ratings yet
Pretraining and Evaluation CodeLLMs
71 pages
Pre Trained Models For NLP
No ratings yet
Pre Trained Models For NLP
15 pages
1 s2.0 S2095809922006324 Main
No ratings yet
1 s2.0 S2095809922006324 Main
20 pages
Abstract
No ratings yet
Abstract
1 page
2022 - Multilingual Training For Software Engineering
No ratings yet
2022 - Multilingual Training For Software Engineering
13 pages
Assessing Large Language Models For Code Generation: A Comprehensive Framework
No ratings yet
Assessing Large Language Models For Code Generation: A Comprehensive Framework
6 pages
Studying The Quality of Source Code Generated by Different AI Generative Engines An Empirical Evaluation
No ratings yet
Studying The Quality of Source Code Generated by Different AI Generative Engines An Empirical Evaluation
19 pages
Towards General Text Embeddings With Multi-Stage Contrastive Learning
No ratings yet
Towards General Text Embeddings With Multi-Stage Contrastive Learning
18 pages
O C: T O C T - T C L L M: PEN Oder HE PEN Ookbook For OP IER ODE Arge Anguage Odels
No ratings yet
O C: T O C T - T C L L M: PEN Oder HE PEN Ookbook For OP IER ODE Arge Anguage Odels
35 pages
CodeGen4Libs A Two-Stage Approach For Library-Oriented Code Generation
No ratings yet
CodeGen4Libs A Two-Stage Approach For Library-Oriented Code Generation
12 pages
L D S M C S C S N C: Earning EEP Emantic Odel For ODE Earch Using ODE Earch ET Orpus
No ratings yet
L D S M C S C S N C: Earning EEP Emantic Odel For ODE Earch Using ODE Earch ET Orpus
6 pages
Qiu Et Al. - 2020 - Pre-Trained Models For Natural Language Processing
No ratings yet
Qiu Et Al. - 2020 - Pre-Trained Models For Natural Language Processing
28 pages
Code Researcher:: Deep Research Agent For Large Systems Code and Commit History
No ratings yet
Code Researcher:: Deep Research Agent For Large Systems Code and Commit History
30 pages
Word Embedding Comparison
No ratings yet
Word Embedding Comparison
19 pages
Pre-Trained Models For Natural Language Processing: A Survey
No ratings yet
Pre-Trained Models For Natural Language Processing: A Survey
31 pages
Text and Code Embeddings by Contrastive Pre-Training
No ratings yet
Text and Code Embeddings by Contrastive Pre-Training
13 pages
CodeGeeX - A Pre-Trained Model For Code Generation With Multilingual Evaluations On HumanEval-X
No ratings yet
CodeGeeX - A Pre-Trained Model For Code Generation With Multilingual Evaluations On HumanEval-X
30 pages
JCST Papers: Only For Academic and Non-Commercial Use
No ratings yet
JCST Papers: Only For Academic and Non-Commercial Use
26 pages
UER: An Open-Source Toolkit For Pre-Training Models
No ratings yet
UER: An Open-Source Toolkit For Pre-Training Models
6 pages
A Conversational Paradigm For Program Synthesis
No ratings yet
A Conversational Paradigm For Program Synthesis
25 pages
Natural Language To Code: Improving Semantic Reasoning in Code Generation Models
No ratings yet
Natural Language To Code: Improving Semantic Reasoning in Code Generation Models
10 pages
Chatgpt vs. Deepseek: A Comparative Study On Ai-Based Code Generation
No ratings yet
Chatgpt vs. Deepseek: A Comparative Study On Ai-Based Code Generation
5 pages
Codexglue: A Machine Learning Benchmark Dataset For Code Understanding and Generation
No ratings yet
Codexglue: A Machine Learning Benchmark Dataset For Code Understanding and Generation
14 pages
Magicoder - Source Code Is All You Need
No ratings yet
Magicoder - Source Code Is All You Need
16 pages
Docprompting:: G C R D
No ratings yet
Docprompting:: G C R D
19 pages
Agent Coder 2312.13010v2
No ratings yet
Agent Coder 2312.13010v2
21 pages
CodePori - Large-Scale System For Autonomous Software Development Using Multi-Agent Technology - 2402.01411v2
No ratings yet
CodePori - Large-Scale System For Autonomous Software Development Using Multi-Agent Technology - 2402.01411v2
23 pages
Enhancing Source Code Classification Effectiveness Via Prompt Learning Incorporating Knowledge Features
No ratings yet
Enhancing Source Code Classification Effectiveness Via Prompt Learning Incorporating Knowledge Features
23 pages
Software Defect Prediction PPR
No ratings yet
Software Defect Prediction PPR
11 pages
Case Study For Procurement
No ratings yet
Case Study For Procurement
62 pages
Lecture 15 - Foundation Models - CLIP and GPT
No ratings yet
Lecture 15 - Foundation Models - CLIP and GPT
45 pages
Is Github'S Copilot As Bad As Humans at Introducing Vulnerabilities in Code?
No ratings yet
Is Github'S Copilot As Bad As Humans at Introducing Vulnerabilities in Code?
24 pages
Kantek DP
No ratings yet
Kantek DP
100 pages
Software Testing With Large Language Models: Survey, Landscape, and Vision
No ratings yet
Software Testing With Large Language Models: Survey, Landscape, and Vision
31 pages
OpenCoder 1731317971
No ratings yet
OpenCoder 1731317971
35 pages
Codegemma Report
No ratings yet
Codegemma Report
9 pages
ChatGPT Coding CompSac 23
No ratings yet
ChatGPT Coding CompSac 23
9 pages
Fully Autonomous Programming With Large Language Models
No ratings yet
Fully Autonomous Programming With Large Language Models
10 pages
Code Generation Techniques and Applications: Definitive Reference for Developers and Engineers
From Everand
Code Generation Techniques and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
IGNOU BCA Introduction to Software Engineering Previous Year Unsolved Papers BCS 051
From Everand
IGNOU BCA Introduction to Software Engineering Previous Year Unsolved Papers BCS 051
Manish Soni
No ratings yet
DADT Group 29 Section A MBAFT24
No ratings yet
DADT Group 29 Section A MBAFT24
10 pages
Lab Record
No ratings yet
Lab Record
126 pages
An Energy-Efficient Hybrid Clustering Technique EEHCT For IoT-Based Multilevel Heterogeneous Wireless Sensor Networks
No ratings yet
An Energy-Efficient Hybrid Clustering Technique EEHCT For IoT-Based Multilevel Heterogeneous Wireless Sensor Networks
18 pages
Relay Setting Calculation-Super Cement
100% (2)
Relay Setting Calculation-Super Cement
41 pages
OPERATION MANUAL For JINMA164Y
No ratings yet
OPERATION MANUAL For JINMA164Y
61 pages
Product Description For Qlik Cloud Subscriptions
No ratings yet
Product Description For Qlik Cloud Subscriptions
8 pages
Documentation Sheet Steel Spring Isolator General
No ratings yet
Documentation Sheet Steel Spring Isolator General
2 pages
Lo1 - Construction Equipments
No ratings yet
Lo1 - Construction Equipments
47 pages
Submissions S Itcs123 Bit16 Intermediate Programming Lec 2nd Sem 2022 2023 S Itcs123 Finals Summative Dlsu D College Gs
No ratings yet
Submissions S Itcs123 Bit16 Intermediate Programming Lec 2nd Sem 2022 2023 S Itcs123 Finals Summative Dlsu D College Gs
9 pages
Visual Basic Concepts Visual Studio 6.0 Using The Communications Control
No ratings yet
Visual Basic Concepts Visual Studio 6.0 Using The Communications Control
5 pages
Lecture#04
No ratings yet
Lecture#04
42 pages
Co - Dkb3223 CNSD
No ratings yet
Co - Dkb3223 CNSD
10 pages
Installation and Operation Guide FP 125A System (B - 1503287 - 1 - 1)
No ratings yet
Installation and Operation Guide FP 125A System (B - 1503287 - 1 - 1)
18 pages
CN Lab Manual
No ratings yet
CN Lab Manual
70 pages
CPU Structure and Functions
No ratings yet
CPU Structure and Functions
39 pages
PL ION 7550 7650 - TRAN Model
No ratings yet
PL ION 7550 7650 - TRAN Model
3 pages
DP1500 2
No ratings yet
DP1500 2
3 pages
Profile APIs
No ratings yet
Profile APIs
2 pages
Indicador 4-20ma
No ratings yet
Indicador 4-20ma
20 pages
Applications of Nanofabrication Technologies
No ratings yet
Applications of Nanofabrication Technologies
26 pages
UX Design For Mobile: Bottom Navigation: by Nick Babich
No ratings yet
UX Design For Mobile: Bottom Navigation: by Nick Babich
12 pages
White Paper Centralized UPS and Distributed UPS A Comparison WP0012 en Us
No ratings yet
White Paper Centralized UPS and Distributed UPS A Comparison WP0012 en Us
6 pages
Final Report
No ratings yet
Final Report
29 pages
CRM 101
No ratings yet
CRM 101
25 pages
Lamia - Ali Abdallah - Resume - Cloud - Engineer
No ratings yet
Lamia - Ali Abdallah - Resume - Cloud - Engineer
2 pages
04421001193-Aditya Bhuvad
No ratings yet
04421001193-Aditya Bhuvad
33 pages
Chapter 1 New
No ratings yet
Chapter 1 New
28 pages