0% found this document useful (0 votes)

32 views40 pages

E1. Code Language Models

The document is a presentation on code language models delivered by Ansong Ni from Yale University. It discusses the significance of building code language models, their applications, and the methodologies involved in their training and post-training processes. Key historical developments and advancements in code language models from 2020 to 2024 are also highlighted, along with data collection and tokenization strategies.

Uploaded by

9gt5rqjjnq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views40 pages

E1. Code Language Models

Uploaded by

9gt5rqjjnq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Code Language Models

Guest Lecture @ HKU

Yale NLP
Ansong Ni
Yale University
[email protected]

04/05/2024
Why Build Code Language Models
• Quick Poll
• GitHub Copilot
• OpenAI ChatGPT

1
Why Build Code Language Models
• How to automatically write programs is one of the oldest and hardest
problems in AI and CS:

This process of constructing instruction tables should be very fascinating. There need be no real
danger of it ever becoming a drudge, for any processes that are quite mechanical may be turned
over to the machine itself. — Alan Turing (1945)

2
Why Build Code Language Models
• They relate to several important areas in CS
• Programming Languages (PL)
• Software Engineering (SE)
• Machine Learning (ML)
• Natural Language Processing (NLP)
• Human-Computer Interaction (HCI) PL &
SE
• … ML

Code AI

I NLP
HC

3
Why Build Code Language Models
• Code generation is a great testbeds for intelligence:
• language understanding
• symbolic reasoning
• planning & search
• interactive learning
• …

Code & Math

language symbolic planning & interactive

…
understanding reasoning search learning

4
Why Build Code Language Models

• They empower many real-world applications:

FlashFill - Excel
AI-assisted Programming

Virtual Assistants

Robotics Control
Database Query and Visualization

Images from: https://developers.googleblog.com/2018/03/new-creative-ways-to-build-with-actions.html; https://support.microsoft.com/en-

5 us/office/save-time-with-flash-fill-9159216a-75a0-4c11-82e6-8eca29cb3b89; https://github.com/features/copilot; https://code-as-policies.github.io/;
https://www.tableau.com/blog/ask-data-simplifying-analytics-natural-language-98655
Before we start…

6
Preliminaries
• Assume basic knowledge on terms in NLP and related to LLMs
• E.g., BERT, GPT, prompting, autoregressive, retrieval, etc
• Mixing of terms
• Foundation Models ≈ LM ≈ LLM
• Code LM/LLM: Language models that have seen code during training
• Code and Math LMs
• They are deeply connected as
• Both are formal languages;
• Both require symbolic reasoning
• This lecture mostly focuses on code LMs but many methods apply for math
LMs as well

7
Outline
• A brief history of code LMs
• Data collection, filtering and tokenization
• Training of code LLMs
• Decoder-only models and code infilling
• Encoder-only models;
• Encoder-decoder models;
• Reinforcement Learning
• Post-training methods for code LLMs
• Neuro-symbolic approaches
• Prompting methods for code
• Retrieval-augmented generation for code

8
A Brief History of LMs for Code

9
Key Events (2020-2021)
• Feb 2020: CodeBERT [1]
• First attempt -- 16 months after original BERT paper
• 125M parameters
• May 2020: GPT-3 [2]
• People find that GPT-3 has some coding abilities
• Though it is not specifically trained on code
• Jun 2021: GitHub Copilot
• Revolutionary performance
• Multi-line, whole function completion for the first time
• Jul 2021: Codex [3]
• First 10B+ model trained specifically for code
• Hero behind GitHub Copilot
[1] Feng et al. (2020), “CodeBERT: A Pre-Trained Model for Programming and Natural Languages.”
[2] Brown et al. (2020), “Language Models are Few-Shot Learners.”
10 [3] Chen et al. (2021), “Evaluating Large Language Models Trained on Code.”
Key Events (2022)
• Feb 2022: AlphaCode [1]
• Claims 54.3% rankings in competitions with human participants
• Up to 41B, model not released nor publicly accessible
• Mar 2022: CodeGen [2]
• Open-source 10B+ code LM
• Later found that the model is severely under-trained (later CodeGen2)
• Apr 2022: PaLM [3]
• PaLM-Coder is a 540B code model
• The models are also severely under-trained (later PaLM-2)
• Nov 2022: The Stack [5]
• 3TB of permissively licensed code data
• Foundational data work for many code LMs in the future

[1] Li et al. (2022), “Competition-Level Code Generation with AlphaCode.”

[2] Nijkamp et al. (2022), “CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis.”
[3] Chowdhery et al. (2022), “PaLM: Scaling Language Modeling with Pathways.”
11 [4] Kocetkov et al. (2022), “The Stack: 3 TB of permissively licensed source code.”
Key Events (2023)
• Feb 2023: LLaMA [1]
• Trained with more data (1T tokens)
• Not as large but more performant than larger models
• Mar 2023: GPT-4 [2]
• State-of-the-art in every aspect, coding included
• May 2023: StarCoder [3]
• SoTA in open-source, matches Codex-12B in performance
• Trained on the Stack
• Aug 2023: CodeLLaMA [4]
• Shortly after the release of LLaMA 2 in Jul 2023
• Continued training of LLaMA 2 on code
• Dec 2023: Gemini [5] and AlphaCode 2 [6]
• AlphaCode 2 scores 85th percentile on codeforces
[1] Touvron et al. (2023), “LLaMA: Open and Efficient Foundation Language Models.”
[2] OpenAI. (2022), “GPT-4 Technical Report.”
[3] BigCode. (2022), “StarCoder: May the source be with you!”
[4] Rozière et al. (2023), “Code Llama: Open Foundation Models for Code.”
12 [5] Gemini Team (2023), “Gemini: a family of highly capable multimodal models.”
[6] AlphaCode Team (2023), “AlphaCode 2 Technical Report.”
Entering 2024…
• Feb 2024: StarCoder 2 and Stack v2 [1]
• Add more data (notebooks, PRs, Code docs…)
• Improved performance (StarCoder2-15B rivals CodeLLaMA-34B)
• Mar 2024: Devin
• Coding agent
• ”First AI software engineer”

[1] Lozhkov et al. (2023), “StarCoder 2 and The Stack v2: The Next Generation.”
[2] Cognition AI. (2022), “https://www.cognition-labs.com/introducing-devin.”

13
Data Collection, Filtering and
Tokenization

14
Code Data Collection and Filtering
• Data Sources:
• Mostly GitHub and similar platforms;
• More recently:
• Kaggle Notebooks
• Software Documentation
• Commits, issues, pull requests
• Quality Filtering (take [1] as an example):
• GitHub stars >= 5
• 1% <= Comment-to-code ratio <= 80%
• License:
• Only permissive licensed open-source repo may be used;
• E.g., MIT, Apache 2.0

15 [1] Ben Allal et al. (2023), “SantaCoder: Don’t Reach for the Stars!”
Deduplication and De-contamination
• Deduplication:
• Remove (near-)duplicated files from the training data;
• Why: repeated training data can significantly hurt the performance [1]
• Decontamination:
• Remove the files that contain solutions to benchmarks used for evaluation;
• Why: better measure generalization ability of trained LMs
• Methods:
• Exact match
• Near-deduplication

[1] Hernandez et al. (2023), “Scaling laws and interpretability of learning from repeated data.”
16
[2] Ben Allal et al. (2023), “SantaCoder: Don’t Reach for the Stars!”
Tokenization for Code LM (1)
• Tokenization for LMs

• Tokenization is a big deal for coding task

17
Tokenization for Code LM (2)
• Tokenization is a big deal for coding task
• Code looks very similar but also very different than natural language:
• Similar: semantic meaning of variable/function/class names
• E.g., ”is_correct”, “AttentionLayer”, “compute_perplexity”
• Different: Whitespace characters, punctuation, indentations
• E.g., “df.shape[1]”, “def f(x):\n\tif x>0:\n\t\treturn x\n\telse:\n\t\treturn x+1”
• Trade-off between:
• Vocabulary size
• # tokens needed to encode the same sequence
• Generalization ability for different tasks

18
Tokenization for Code LM (3)
• Trade-off between:
• Vocabulary size
• # tokens needed to encode the same sequence
• Generalization ability for different tasks à downstream performance

[1] Chirkova and Troshin (2023), “CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code.”
19
Training of Code LLMs

20
Decoder-only (GPT) Models
• Model architecture and pretraining objectives:
• Mostly follow those of general-purpose LLMs, e.g., Codex follows the GPT-3
• Multi-stage training:
• Some models are based off a general-purpose LM
• E.g., [1] CodeGen-NLàCodeGen-MultiàCodeGen-Mono
• E.g., [2] LLaMA 2àCodeLLaMA

[1] Nijkamp et al. (2023), “CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis.”
21
[2] Rozière et al. (2023), “Code Llama: Open Foundation Models for Code.”
Code Infilling: Fill in the middle
• Infilling task:
• <prefix>, <suffix> à <middle>
• Trained via data augmentation [1]:
• Preprocessing:
• Special tokens <IF>
• <prefix>, <middle>, <suffix>
• <prefix>,<IF>, <suffix>,<IF>, <middle>
• Mixing with original data
• Training with normal autoregressive
objectives
A use case of infilling [2]

[1] Bavarian et al. (2022), “Efficient Training of Language Models to Fill in the Middle.”
22
[2] Fried et al. (2022), “InCoder: A Generative Model for Code Infilling and Synthesis.”
Encoder (BERT) Models for Code (1)

• Aka code representation learning

• Code is multi-modal and it’s usually automatic to obtain other
modalities
• Other modalities of code may better capture the semantics of code

[1] Wang et al. (2022), “CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training.”
23
Encoder (BERT) Models for Code (2)

• Code is multi-modal
• Natural language;
• Surface form;
• Control flow graph;
• Abstract-syntax-tree (AST);
• Data flow graph;
• Dependency graph;
• Compiled machine code;
• …
Using Data Flow Graph

• General idea: jointly encode other modalities with surface form

[1] Guo et al. (2021), “GraphCodeBert: Pre-training Code Representations with Data FLow.”
24
Encoder-Decoder (BART/T5) Models for Code
• A mixture of classification and generation tasks for code are typically
used during pretraining
• Researchers get very creative in proposing new pretraining tasks
• E.g., CodeT5 [1]

[1] Wang et al. (2021), “CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code
25
Understanding and Generation.”
Reinforcement Learning (1)
• Code generation is a natural task to apply RL as we can automatically
obtain feedback from computers:
• Pass/fail a parser;
• Pass/fail compilation;
• With/without runtime error;
• Pass/fail test cases
Rewards used for CodeRL
• Examples:
• CodeRL [1] (offline actor-critic)
• RLTF [2] (online w/ feedback from compiler)

[1] Le et al. (2021), “CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning.”
26
[2] Liu et al. (2023), “RLTF: Reinforcement Learning from Unit Test Feedback.”
Reinforcement Learning (2)
• Benefits of using RL:
• Not limited to learning from a single solution from the dataset;
• Release the dependency for annotated solutions;
• Able to directly incorporate fine-grained preferences as reward function;
• Limitations:
• Insufficient test cases may lead to false positives [1]
• Rewards are typically sparse and underspecified [2];
• Especially if we start with a weaker model
• It usually involves exploration (sampling) with LMs, which are expensive

[1] Smith et al. (2015), “Is the Cure Worse Than the Disease? Overfitting in Automated Program Repair.”
27
[2] Agarwal et al. (2019), “Learning to Generalize from Sparse and Underspecified Rewards.”
Post-Training Methods for Code LLMs

28
Neuro-Symbolic Approaches (1): Incorporating Code Execution

• In addition to providing RL learning signal at training time

• Execution information can also help improve models at test time
• Methods:
• Sampling + filtering (codex [1])
• Sampling solutions then filter out those fail to pass a small subset of test cases

Codex-12B on APPs. Filtered Pass@k is significantly better

[1] Chen et al. (2021), “Evaluating Large Language Models Trained on Code.”
29
Neuro-Symbolic Approaches (1): Incorporating Code Execution

• Methods:
• Sampling + filtering (codex [1])
• Sampling + filtering + clustering (AlphaCode [2])
• Sample lots of diversified program candidates (i.e., up to 1M)
• Filter using open test cases
• Diversify the picked candidates by clustering and selecting from different clusters

[1] Chen et al. (2021), “Evaluating Large Language Models Trained on Code.”
[2] Li et al. (2022), “Competition-Level Code Generation with AlphaCode.”
30
Neuro-Symbolic Approaches (1): Incorporating Code Execution

• Methods:
• Sampling + filtering (codex [1])
• Sampling + filtering + clustering (AlphaCode [2])
• Sampling + verification + voting (LEVER [3])
• Train a verifier to verify the program with its execution results
• Aggregate the probability from programs that reach the same execution results

[1] Chen et al. (2021), “Evaluating Large Language Models Trained on Code.”
[2] Li et al. (2022), “Competition-Level Code Generation with AlphaCode.”
31 [3] Ni et al. (2023), “LEVER: Learning to Verify Language-to-Code Generation using Execution.”
Neuro-Symbolic Approaches (2): Constraint Decoding

• How does code completion work before LLMs?

• Remember: programs are in formal languages, which means that they are
regulated by strict grammar;
• Completion Engine (CE): tells you the valid next tokens w/ static analysis 👇
• Sounds a lot like a language model, right?
• But it is a symbolic process
• Combining LM with CE [1]:
• Filter out next token from the LM that
are not approved by CE
• Best of both worlds!

[1] Poesia et al. (2022), “Synchromesh: Reliable code generation from pre-trained language models.”
32
Neuro-Symbolic Approaches (3): Planning and Search

• Programs are compositional by design

• Human programmers typically decompose the problem into smaller parts and
write functions to solve each of them à Planning + Implementation
• Given the components (e.g., individual functions), we can use a solver to find
out if they are sufficient in completing the task à Search
• Example 1: Parsel [1]

[1] Zelikman et al. (2022), “Parsel : Algorithmic Reasoning with Language Models by Composing Decomposition.”
33
Neuro-Symbolic Approaches (3): Planning and Search

• Programs are compositional by design

[1] Xi et al. (2023), “SATLM: Satisfiability-Aided Language Models Using Declarative Prompting.”
34
Prompting Methods using Code for LLMs
• Chain-of-thought (CoT) prompting [1]
• Explicitly write the reasoning process as natural
language
• Program-of-thought (PoT) prompting [2] and
Program-aided LM (PAL) [3]
• Explicitly write the reasoning process as a program
• Use program execution to obtain the final answer
• Works well with math and other symbolic
reasoning tasks
• Also closely related to tool-use of LLMs
[1] Wei et al. (2022), “Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.”
[2] Chen et al. (2022), “Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks.”
35 [3] Gao et al. (2022), “PAL: Program-aided Language Models.”
Retrieval Augmented Generation for Code
• Retrieval-augmented generation (RAG)
• Retrieves relevant pieces of information from some knowledge base and
include them in the prompt
• When programmers code, we look at:
• Current file (e.g., defined variables, function, classes)
• Documentation of external libraries “DocPrompting” [1]
• Definitions of imported functions and classes “Repo-level Prompt Generator” [2]
• Github, StackOverflow, geeksforgeeks… “REDCODER” [3]

• We should give such information to the LLMs as well!

[1] Zhou et al. (2022), “DocPrompting: Generating Code by Retrieving the Docs.”
[2] Shrivastava et al. (2023), “Repository-Level Prompt Generation for Large Language Models of Code.”
[3] Parvez et al. (2021), “Retrieval Augmented Code Generation and Summarization.”
36
Summary
• A brief history of code LMs
• Data collection, filtering and tokenization
• Training of code LLMs
• Decoder-only models and code infilling
• Encoder-only models;
• Encoder-decoder models;
• Reinforcement Learning
• Post-training methods for code LLMs
• Neuro-symbolic approaches
• Prompting methods for code
• Retrieval-augmented generation for code

37
Extended Readings
• Interdisciplinary applications
• Code as Policies: Language Model Programs for Embodied Control (2023)
• Large Language Models for Compiler Optimization (2023)
• Self-Improvement with code LLMs
• STaR: Bootstrapping Reasoning With Reasoning (2022)
• CodeT: Code Generation with Generated Tests (2022)
• Teaching Large Language Models to Self-Debug (2023)
• DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines (2023)
• More ways to learn a code LLM
• Show Your Work: Scratchpads for Intermediate Computation with Language Models (2021)
• Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions (2022)

38
Hope you enjoyed the lecture!

Questions?

Car Rental System Project Report
70% (210)
Car Rental System Project Report
40 pages
Building LLMs - Stanford
No ratings yet
Building LLMs - Stanford
78 pages
DLP RW 2018 Orientation
75% (12)
DLP RW 2018 Orientation
2 pages
Pretraining and Evaluation CodeLLMs
No ratings yet
Pretraining and Evaluation CodeLLMs
71 pages
(2023) A Survey On Language Models For Code
No ratings yet
(2023) A Survey On Language Models For Code
55 pages
A Survey On Large Language Models For Code Generation: Juyong Jiang, Fan Wang, Jiasi Shen, Sungju Kim, Sunghun Kim
No ratings yet
A Survey On Large Language Models For Code Generation: Juyong Jiang, Fan Wang, Jiasi Shen, Sungju Kim, Sunghun Kim
49 pages
Large Language Models Meet NL2Code A Survey
No ratings yet
Large Language Models Meet NL2Code A Survey
22 pages
Legal 2 AI
No ratings yet
Legal 2 AI
10 pages
S 001: N Q A C LLM E: Afurai EW Ualitative Pproach For ODE Valuation
No ratings yet
S 001: N Q A C LLM E: Afurai EW Ualitative Pproach For ODE Valuation
22 pages
Code Llama: Open Foundation Models For Code
No ratings yet
Code Llama: Open Foundation Models For Code
48 pages
Code Llama
No ratings yet
Code Llama
47 pages
Deepseek-Coder: When The Large Language Model Meets Programming - The Rise of Code Intelligence
No ratings yet
Deepseek-Coder: When The Large Language Model Meets Programming - The Rise of Code Intelligence
23 pages
OpenCoder 1731317971
No ratings yet
OpenCoder 1731317971
35 pages
O C: T O C T - T C L L M: PEN Oder HE PEN Ookbook For OP IER ODE Arge Anguage Odels
No ratings yet
O C: T O C T - T C L L M: PEN Oder HE PEN Ookbook For OP IER ODE Arge Anguage Odels
35 pages
Evaluating Large Language Models Trained On Code
No ratings yet
Evaluating Large Language Models Trained On Code
35 pages
Code Generation Tools (Almost) For Free? A Study of Few-Shot, Pre-Trained Language Models On Code
No ratings yet
Code Generation Tools (Almost) For Free? A Study of Few-Shot, Pre-Trained Language Models On Code
12 pages
A Survey On Language Models For Code
No ratings yet
A Survey On Language Models For Code
125 pages
Programming Lang Processing
No ratings yet
Programming Lang Processing
70 pages
Augmenting LLMs Survey
No ratings yet
Augmenting LLMs Survey
33 pages
OpenAI Codex Arxiv
No ratings yet
OpenAI Codex Arxiv
35 pages
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
No ratings yet
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
53 pages
Starcoder 2
No ratings yet
Starcoder 2
61 pages
Competition Level Code Generation With Alphacode
No ratings yet
Competition Level Code Generation With Alphacode
74 pages
P G - C 2: B L L M C R F: AN U Oder Oosting Arge Anguage Odels For Ode With Anking Eedback
No ratings yet
P G - C 2: B L L M C R F: AN U Oder Oosting Arge Anguage Odels For Ode With Anking Eedback
15 pages
Agents For Software Development
No ratings yet
Agents For Software Development
53 pages
代码大模型
No ratings yet
代码大模型
18 pages
Qwen2.5-Coder Technical Report
No ratings yet
Qwen2.5-Coder Technical Report
32 pages
Qwen2.5-Coder Technical Report: Binyuan Hui Jian Yang Zeyu Cui Jiaxi Yang
No ratings yet
Qwen2.5-Coder Technical Report: Binyuan Hui Jian Yang Zeyu Cui Jiaxi Yang
23 pages
CodeGeeX4: Multilingual Open-Source Code Assistant
No ratings yet
CodeGeeX4: Multilingual Open-Source Code Assistant
9 pages
Recent Advances in Language Modeling (2022-2025)
No ratings yet
Recent Advances in Language Modeling (2022-2025)
5 pages
Agent Coder 2312.13010v2
No ratings yet
Agent Coder 2312.13010v2
21 pages
Code Tree
No ratings yet
Code Tree
16 pages
Codecot and Beyond: Learning To Program and Test Like A Developer
No ratings yet
Codecot and Beyond: Learning To Program and Test Like A Developer
10 pages
Code As Policies
No ratings yet
Code As Policies
16 pages
Large Language Model-Aware In-Context Learning For Code Generation
No ratings yet
Large Language Model-Aware In-Context Learning For Code Generation
12 pages
Code Generation With LLMs
No ratings yet
Code Generation With LLMs
59 pages
Assessing Large Language Models For Code Generation: A Comprehensive Framework
No ratings yet
Assessing Large Language Models For Code Generation: A Comprehensive Framework
6 pages
10.48550 Arxiv.2204.02311
No ratings yet
10.48550 Arxiv.2204.02311
87 pages
Paper 1
No ratings yet
Paper 1
10 pages
Automated Source Code Generation and Auto-Completion Using Deep Learning: Comparing and Discussing Current Language Model-Related Approaches
No ratings yet
Automated Source Code Generation and Auto-Completion Using Deep Learning: Comparing and Discussing Current Language Model-Related Approaches
16 pages
Towards Advancing Code Generation With Large Language Models: A Research Roadmap
No ratings yet
Towards Advancing Code Generation With Large Language Models: A Research Roadmap
10 pages
Seed Coder
No ratings yet
Seed Coder
46 pages
Docprompting:: G C R D
No ratings yet
Docprompting:: G C R D
19 pages
2022 - Expectation vs. Experience - Evaluating The Usability of Code Generation Tools Powered by Large Language Models
No ratings yet
2022 - Expectation vs. Experience - Evaluating The Usability of Code Generation Tools Powered by Large Language Models
7 pages
Day 1
No ratings yet
Day 1
32 pages
A: Active Retrieval in Knowledge Soup For Code Generation
No ratings yet
A: Active Retrieval in Knowledge Soup For Code Generation
16 pages
Perspectives in Business Ethics
No ratings yet
Perspectives in Business Ethics
113 pages
Code Agents
No ratings yet
Code Agents
24 pages
CodeAsPolicies2209 07753
No ratings yet
CodeAsPolicies2209 07753
18 pages
A - Review - On - Code - Generation - With - LLMs - Application - and - Evaluation 2
No ratings yet
A - Review - On - Code - Generation - With - LLMs - Application - and - Evaluation 2
6 pages
LLM's For Code Generation
No ratings yet
LLM's For Code Generation
31 pages
Code Generation 2305.10679v1
No ratings yet
Code Generation 2305.10679v1
13 pages
Aryan 2022PH11425
No ratings yet
Aryan 2022PH11425
3 pages
CodeGeeX - A Pre-Trained Model For Code Generation With Multilingual Evaluations On HumanEval-X
No ratings yet
CodeGeeX - A Pre-Trained Model For Code Generation With Multilingual Evaluations On HumanEval-X
30 pages
Code Llama
No ratings yet
Code Llama
2 pages
Code Generation AceCoder - Preprint
No ratings yet
Code Generation AceCoder - Preprint
12 pages
C++ OOP Made Simple: A Practical Guide with Examples
From Everand
C++ OOP Made Simple: A Practical Guide with Examples
William E. Clark
No ratings yet
Qwen Technical Report
No ratings yet
Qwen Technical Report
59 pages
Self-Planning Code Generation With Large Language Models
No ratings yet
Self-Planning Code Generation With Large Language Models
29 pages
A Guide To All Programming and Coding Languages
From Everand
A Guide To All Programming and Coding Languages
Don Carlos
No ratings yet
WEN Echnical Eport
No ratings yet
WEN Echnical Eport
59 pages
Large Language Model Algorithms in Plain English
No ratings yet
Large Language Model Algorithms in Plain English
8 pages
1 Solution
No ratings yet
1 Solution
3 pages
E3. AI Agents
No ratings yet
E3. AI Agents
49 pages
E5. Efficient LM Methods
No ratings yet
E5. Efficient LM Methods
41 pages
Pre-Training & LLM 2
No ratings yet
Pre-Training & LLM 2
46 pages
LLM Scaling Laws & Emergent Capacities
No ratings yet
LLM Scaling Laws & Emergent Capacities
23 pages
Neural Language Models & Tokenization
No ratings yet
Neural Language Models & Tokenization
70 pages
LLM Prompting & In-Context Learning
No ratings yet
LLM Prompting & In-Context Learning
18 pages
Deep Learning Recap
No ratings yet
Deep Learning Recap
13 pages
Multi-Class Classification
No ratings yet
Multi-Class Classification
52 pages
Introduction
No ratings yet
Introduction
6 pages
Subspace and Basis
No ratings yet
Subspace and Basis
60 pages
Orthogonality
No ratings yet
Orthogonality
61 pages
0.1. Probability Review
No ratings yet
0.1. Probability Review
6 pages
Matrices and Linear Transformations
No ratings yet
Matrices and Linear Transformations
74 pages
Chen, Bansal - 2018 - Fast Abstractive Summarization With Reinforce-Selected Sentence Rewriting-Annotated
No ratings yet
Chen, Bansal - 2018 - Fast Abstractive Summarization With Reinforce-Selected Sentence Rewriting-Annotated
12 pages
PHL216
No ratings yet
PHL216
141 pages
CNF Finals LP
No ratings yet
CNF Finals LP
10 pages
Loss Calculation of Contraction and Enlargement - Flow
No ratings yet
Loss Calculation of Contraction and Enlargement - Flow
10 pages
Research Defense Template by Rome
No ratings yet
Research Defense Template by Rome
28 pages
GERUNDS
No ratings yet
GERUNDS
3 pages
Inv Preview Acc 5721434 88556 23
No ratings yet
Inv Preview Acc 5721434 88556 23
2 pages
Think 2ed 2 Workbook
No ratings yet
Think 2ed 2 Workbook
15 pages
Loading The Module in Our Python Code
No ratings yet
Loading The Module in Our Python Code
6 pages
Speaking Skills - Functional Language
No ratings yet
Speaking Skills - Functional Language
7 pages
A2DP at Commands v1.1
No ratings yet
A2DP at Commands v1.1
8 pages
Microsoft 70-744 Exam Q&A
No ratings yet
Microsoft 70-744 Exam Q&A
194 pages
Toots Thielemans
No ratings yet
Toots Thielemans
89 pages
Maula Nikma English
No ratings yet
Maula Nikma English
3 pages
Hastasya Bhushanam Danam - Eng
No ratings yet
Hastasya Bhushanam Danam - Eng
7 pages
03 Mind Map Theory
No ratings yet
03 Mind Map Theory
24 pages
Version Control Systems
No ratings yet
Version Control Systems
14 pages
Sparklebox Homework Certificates
100% (1)
Sparklebox Homework Certificates
7 pages
Exploratory Data Analysis and Data Mining On Yelp Restaurant Review Using Ada Boosting and MLP Techniques
No ratings yet
Exploratory Data Analysis and Data Mining On Yelp Restaurant Review Using Ada Boosting and MLP Techniques
5 pages
15-Gerunds and Infinitives - Complex Forms
No ratings yet
15-Gerunds and Infinitives - Complex Forms
3 pages
Read Me
No ratings yet
Read Me
1 page
Kumbam Prathyusha (1) - Prathyusha Kumbam
No ratings yet
Kumbam Prathyusha (1) - Prathyusha Kumbam
2 pages
MATHS 205 Final (2014-2015) (1st)
No ratings yet
MATHS 205 Final (2014-2015) (1st)
11 pages
The Brief Assessment of Cognition in Affective Disorders (Bac-A) : A New Instrument For Assessing Cognitive Functioning in Bipolar Disorder
No ratings yet
The Brief Assessment of Cognition in Affective Disorders (Bac-A) : A New Instrument For Assessing Cognitive Functioning in Bipolar Disorder
1 page
Kabire Magazine Issue 3
No ratings yet
Kabire Magazine Issue 3
39 pages
Introduction To MSMQ
No ratings yet
Introduction To MSMQ
39 pages
(E-Persediaanmengajar) : SK Parit Melana, Durian Tunggal, Melaka
No ratings yet
(E-Persediaanmengajar) : SK Parit Melana, Durian Tunggal, Melaka
5 pages
Unit 7 & 8 Choose The Correct Option To Complete The Sentences
No ratings yet
Unit 7 & 8 Choose The Correct Option To Complete The Sentences
3 pages