0% found this document useful (0 votes)

65 views58 pages

Lec20 LLM

This document is a lecture on Large Language Models (LLMs) presented by Roshan Sharma, covering topics such as emergent abilities, architecture, training procedures, and evaluation of LLMs. It discusses the evolution of language models, the significance of scaling, and the challenges of training and fine-tuning models like GPT-3 and Llama 2. Additionally, it addresses the importance of multimodal capabilities and the alignment problem in ensuring LLM outputs align with human values.

Uploaded by

hoylarglen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views58 pages

Lec20 LLM

Uploaded by

hoylarglen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 58

Introduction to Deep Learning

Lecture 20
Large Language Models
Roshan Sharma

Some slides borrowed from Danqi Chen, Chenyan Xiong and Graham Neubig – thanks!

11-785, Spring 2024

1
Agenda
● Emergent Abilities and Scaling Effects
● What are LLMs?
● Modern LLM Architecture
● LLM Training Procedure
● LLM Inference – Prompting, In-Context Learning and Chain of Thought
● Evaluating LLMs
● Multimodal LLMs
Review: Language Models as Generalists
● Language models can be used to not just perform a single task, but
multiple tasks by learning to predict the next token or sentence
Review: The LLM Era – Paradigm Shift in Machine Learning

BERT GPT
Oct 2018 Jun 2018

Representation Generation

4
Review: The LLM Era – Paradigm Shift in Machine Learning

BERT GPT
Oct 2018 Jun 2018

Representation Generation

5
GPT 2 – Generalizing to Unseen Tasks
● LMs can be used for different tasks by pre-training a “base” model and
then fine-tuning for the task(s) of interest

● Practical Issues:
○ Too many copies of the model
○ Need for large-scale labeled data for fine-tuning
○ Can do only specific task
GPT 2 – Generalizing to Unseen Tasks
● LMs can be used for different tasks by pre-training a “base” model and
then fine-tuning for the task(s) of interest
● Practical Issues:
○ Too many copies of the model
○ Need for large-scale labeled data for fine-tuning

● Multi-task Training?
○ Data remains a challenge
○ Humans don’t need such large volumes of data to learn – can we do better?

● Train a model that can perform NLP tasks in a zero-shot manner

GPT 2 – Task Specifications
● Primary shift comes from modeling assumptions from single-task to
general model
Single Task Model General Model

P(output | input)
P(output | input,task)
GPT 2 – Task Specifications
● Primary shift comes from modeling assumptions from single-task to
general model
Single Task Model General Model

P(output | input)
P(output | input,task)

● Task descriptions may be provided as text – for example, translate this

French text to English
GPT 2 – what makes such an LM work ?
● Diverse training data
○ Model can do many disparate tasks with no training at all!

● Scaling model capacity and data

Scaling in GPT-2
● Scaling improves the perplexity of the LM and improves performance
Why is this interesting? Look at data scaling
● We know that typical scaling effects look like this when we increase the
amount of training data
Why is this interesting? Look at data scaling
● Loss and dataset size is linear on a log-log plot
● This is “power-law scaling”
Scaling - (Kaplan,2020)
● Can we understand scaling by positing scaling laws ?

● With scaling laws, we can make decisions on architecture, data,

hyperparameters by training smaller models

● Open AI Study : Scaling Laws for Neural Language Models (Kaplan et al.
2020)
Scaling - (Kaplan,2020)
● Open AI Study : Scaling Laws for Neural Language Models (Kaplan et al.
2020)

● Key Findings:
o Performance depends strongly on scale, and weakly on the model shape
○ Larger models are more sample-efficient
○ Smooth power laws (y = axk) b/w empirical performance & N - parameters, D -
dataset size, C - compute
Scaling Effects
● The effect of some hyperparameters on big LMs can be predicted before
training – optimizer (Adam v/s SGD), model depth, LSTM v/s Transformer

● Idea:
○ Train a few smaller models
○ Establish a scaling law (e.g. ADAM vs SGD scaling law)
○ Select optimal hyper param based on the scaling law prediction
Model Scaling: GPT-3

Source:
https://bmk.sh/2020/05/29/GPT-3-A-Brief

Size (billions of parameters)

-Summary/

175b params!
GPT-2 was 1.5b
Emergent Abilities with GPT-3 – Wei et. al 2022
● Emergent abilities:
○ not present in smaller models but is present in larger models
○ Do LLMs like GPT3 have these ?

● Findings:
○ GPT-3 trained on text can do arithmetic problems like addition and subtraction
○ Different abilities “emerge” at different scales
Emergent Abilities with GPT-3 – Wei et. al 2022
● Emergent abilities:
○ not present in smaller models but is present in larger models
○ Do LLMs like GPT3 have these ?

● Findings:
○ GPT-3 trained on text can do arithmetic problems like addition and subtraction
○ Different abilities “emerge” at different scales
○ Model scale is not the only contributor to emergence – for 14 BIG-Bench tasks,
LaMDA 137B and GPT-3 175B models perform at near-random, but PaLM 62B
achieves above-random performance
○ Problems LLMs can’t solve today may be emergent for future LLMs
Large Language Models
● Language models that have many parameters (over 1B) and can perform
multiple tasks through prompting

● Eg. GPT, Llama2, Gemini, PaLM, Mistral, Mixtral etc.

LLM Realization - Architecture
● Encoder-only (BERT)
○ Pre-training : Masked Language Modeling (MLM)
○ Great for classification tasks, but hard to do generation

● Decoder-only (GPT)
○ Pre-training: Auto-regressive Language Modeling
○ Stable training, faster convergence
○ Better generalization after pre-training

● Encoder-decoder (T0/T5)
○ Pre-training : Masked Span Prediction
○ Good for tasks like MT, summarization
T5/ T0 : Masked Span Prediction
● Masked span prediction involves:
○ Mask continuous set of tokens (span) in input
○ Predict this masked span from the decoder
Attention patterns (Wang et. al)

• Causal decoder -- each token attends to the previous tokens only.

• In both non-causal decoder and encoder-decoder, attention is allowed to be bidirectional on any conditioning information.
• For the encoder-decoder, that conditioning is fed into the encoder part of the model.
Empirical Observations (Wang et. al)
● Decoder-only models outperform encoder-decoder models using similar
configuration
Llama 2 Architecture (Ouyang et. al.)
● Decoder-only model
● Changes in transformer module:
○ Norm after sublayer -> Norm before sublayer
○ LayerNorm -> RMSNorm for stability
○ Activation: ReLU -> SwiGLU(x) = Swish(xW)xV = xWSigmoid(AxW)xV
○ Position Embedding: Absolute/Relative -> RoPE (Rotary PE)
○ Long contexts : Multi-head attention -> Grouped-query attention
Poll 1
Which of the following is true about emergent abilities?

A. A language model with fewer parameters than 175B cannot have any
emergent abilities
B. They are found in large models but not in small models
C. Summarization is likely an emergent ability in a model pre-trained on a
summarization corpus
D. Emergent abilities arise only because of scaling.
Training of Decoder-only LLMs – Llama 2
1. Auto-regressive Pre-training - Train to predict the next token on very
large-scale corpora ( ~3 trillion tokens)
Training of Decoder-only LLMs – Llama 2
1. Auto-regressive Pre-training - Train to predict the next token on very large
scale corpora ( ~3 trillion tokens)
2. Instruction Fine-tuning/ Supervised Fine-tuning (SFT) - Fine-tune the pre-
trained model with pairs of (instruction+input,output) with large dataset
and then with small high-quality dataset

Instruction fine-tuning provides as a prefix a natural language description of

the task along with the input.
● E.g. Translate into French this sentence: my name is -> je m’appelle
Supervised Fine-tuning versus Pre-training
● Objective function
○ Loss computed only for target tokens in SFT, all tokens are targets in pre-training

● Input and Target

○ Instruction + input as input with the target in SFT and only input as input with
shifted input as target

● Purpose
○ Pre-training makes good generalist auto-completes but good SFT builds models
that can do many unseen tasks
○ SFT can also guide nature of outputs in terms of safety and helpfulness
Instruction Tuning (Wei et. al. 2021)
Unsafe Outputs – Alignment Problem
● LLMs may produce
○ Harmful text – unparliamentary language, bias and discrimination
○ Text that can cause direct harm – allowing easy access to dangerous information

● Therefore, LLMs should be trained to produce outputs that align with

human preferences and values

● Modern LLMs do so by using SFT and by using human preference directly

in model training
Training of Decoder-only LLMs – Llama 2
1. Auto-regressive Pre-training - Train to predict the next token on very large
scale corpora ( ~3 trillion tokens)
2. Instruction Fine-tuning/ Supervised Fine-tuning (SFT) - Fine-tune the pre-
trained model with pairs of (instruction+input,output) with large dataset
and then with small high-quality dataset
3. Safety / RLHF - Design a reward model based on human feedback and use
policy gradient methods with the trained reward model to update LLM
parameters so that outputs align with human values
RLHF
Model Fine-tuning for RLHF
Note on LLM Safety and Harmfulness
● Does doing RLHF and safety tuning mean LLMs will never produce harmful
outputs ?
Note on LLM Safety and Harmfulness
● Does doing RLHF and safety tuning mean LLMs will never produce harmful
outputs?

● No! The list of harmful outputs is not exhaustive and very large

● What are the other concerns?

○ Adversarial Robustness – adversaries can force the LLM to produce harmful
outputs by attacking the model

● In our experience, Claude produces harmful outputs the least when

compared to models like ChatGPT and Llama
Poll 2
Which of the following is a feature of Llama 2?

A. Swishy activations
B. Relativistic positional embeddings
C. Multi-query attention
D. Grouped-query attention
Poll 2
Which of the following is a feature of Llama 2?

A. Swishy activations
B. Relativistic positional embeddings
C. Multi-query attention
D. Grouped-query attention
LLM Inference: Prompting
● Prompts
○ Tell the model what to do in natural language
○ For example, generate a textual summary of this paragraph:
○ Can be as short or long as required

● Prompt Engineering
○ The task of identifying the correct prompt needed to perform a task
○ General rule of thumb be as specific and descriptive as possible
○ Can be manual or automatic ( prefix-tuning, paraphrasing etc.)
ChatGPT Prompt example
In-context learning/ Few-shot prompting (Brown,21)
● Provide a few examples along with the instruction
Chain of thought prompting (Wei, 2021)
● Get the model to work through the steps of the problem
What to Pick? Stronger
task-specific
performance
1. Full Fine-tuning (FT)
a. + Strongest performance
b. - Need curated and labeled dataset for each new task
(typically 1k-100k+ ex.)
c . - Poor generalization, spurious feature exploitation
2. Few-shot (FS)
a. + Much less task-specific data needed
b. + No spurious feature exploitation
c. - Challenging
3. One-shot (1S)
a. +"Most natural," e.g. giving humans instructions
b. - Challenging
4. Zero-shot (0S)
a. +Most convenient
More convenient,
b. - Challenging, can be ambiguous general, less data 19
Note on Parameter Efficient Fine-tuning
● When we don’t have large enough data for SFT
○ Freeze the LM and keep some parameters trainable (which?)
○ Add an external adapter module to adapt model parameters to the task
○ Perform Low-rank Adaptation (LoRA)
Poll 3
Which of the following describes in-context learning?

A. Providing detailed instructions during RLHF

B. Providing examples within LLM prompts
C. Asking the LLM to show its work
D. Zero-shot prompting
Poll 3
Which of the following describes in-context learning?

A. Providing detailed instructions during RLHF

B. Providing examples within LLM prompts
C. Asking the LLM to show its work
D. Zero-shot prompting
Evaluating LLMs
● Evaluation is challenging
○ Evaluate on as many datasets and tasks as possible
Multimodal LLMs
● Text is only part of the picture
○ We want LLMs that can understand the world by seeing and listening as well
○ Models should be able to do cross-modal reasoning and learning

● Multimodality can be introduced

○ From pre-training: Gemini
○ From instruction-tuning: AudioGPT, Flamingo
Modelling data using continuous representations
● Using continuous speech
representations
○ Pros
■ Rich information
SpeechLM
■ Good performance
○ Cons
■ Computationally heavy
■ Storage heavy

Modality Encoder
Text embedding
Speech Encoder
+ Text tokenizer

This is text input.

51
Modeling data using Discrete Units
● Recently discrete units shows promising performance and benefit
Chang, Xuankai, et al. "Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study."
arXiv preprint arXiv:2309.15800 (2023).

○ Storage
■ Audio features (HuBERT): 1024 dim * 32 bit (float)
■ Discrete unit (1000 / 2000-cluster): 12 bit
○ Sequence length (> 50% reduction)
■ De-duplication
■ Subword Modeling
○ Performance is okay
■ >fbank, ~<SSL feature
○ We used semantic features from SSL
■ ASR / ST / SLU

52
Modeling data using Discrete Unit

SpeechLM
Whole Vocab

Text Vocab
Token embedding Speech
Vocab
A1 A2 … T1 T2 …

Speech
Text tokenizer
Quantizer

This is text input.

53
Multimodal LLMs – Representing Images
● Continuous embeddings
○ concatenated with the embeddings of text inputs to LLMs
○ Pre-trained independently
○ Ex: CLIP

● Discrete representations
○ Extracted from self-supervised audio models like VQ-VAEs
Open Challenges - LLMs
● New Capabilities
○ Multimodal
○ Multi-lingual
○ More Complex Tasks
● Performance
○ Reduce Hallucinations
○ Improve Alignment with Human Preference
○ Increase Context Length Efficiently
○ Improve Data, Training Strategy, and Model Architecture
● Efficiency
○ Computational cost, time, and money
○ Compute architecture – GPU/ TPU/ HPU
Open Challenges - LLMs
● Safety
○ Reduce Harm
○ Improve Adversarial Robustness
○ Privacy Concerns
● Interpretability
○ Why do LLMs do what they do?
Summary
● LLMs are large-scale models that possess astounding abilities
● Scaling both data and model capacity is important for performance and
leads to the emergence of new abilities
● Decoder-only architectures are popular for convergence and performance
● LLMs are trained using pre-training, SFT, RLHF
● LLMs are evaluated using prompting/ strategies like ICL and CoT
● Multimodal LLMs can process audio, text, images and more.
Thank you!

50 LLM Interview Questions
100% (1)
50 LLM Interview Questions
56 pages
Top 50 LinkedIn LLM Interview Questions
100% (1)
Top 50 LinkedIn LLM Interview Questions
12 pages
Foundations of LLM
No ratings yet
Foundations of LLM
231 pages
LLM Cheat Sheetpdf
No ratings yet
LLM Cheat Sheetpdf
7 pages
Lecture 15 - Foundation Models - CLIP and GPT
No ratings yet
Lecture 15 - Foundation Models - CLIP and GPT
45 pages
Large Language Models (LLM)
No ratings yet
Large Language Models (LLM)
139 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
Large Language Models: CSC413 Tutorial 9 Yongchao Zhou
No ratings yet
Large Language Models: CSC413 Tutorial 9 Yongchao Zhou
40 pages
Foundations of Large Language Models 1738142777
No ratings yet
Foundations of Large Language Models 1738142777
101 pages
Jason Weston Reasoning Alignment Berkeley Talk
No ratings yet
Jason Weston Reasoning Alignment Berkeley Talk
106 pages
GenAI Preparation
No ratings yet
GenAI Preparation
15 pages
Introduction To LLMS: Transformers Types of Llms Configuration Settings
100% (2)
Introduction To LLMS: Transformers Types of Llms Configuration Settings
7 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
RVSP Short Answers
100% (3)
RVSP Short Answers
201 pages
Fine Tuning Techniques For Large Language Models LLMs
No ratings yet
Fine Tuning Techniques For Large Language Models LLMs
15 pages
Building LLMs - Stanford
No ratings yet
Building LLMs - Stanford
78 pages
LLM - Introduction 2024
No ratings yet
LLM - Introduction 2024
77 pages
State of GPT
No ratings yet
State of GPT
50 pages
Transformer Basics
No ratings yet
Transformer Basics
17 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Day 1
No ratings yet
Day 1
32 pages
Slides
No ratings yet
Slides
137 pages
19 20-gpt-3 Prompts
No ratings yet
19 20-gpt-3 Prompts
68 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
Activity Diagram
No ratings yet
Activity Diagram
16 pages
LLM Learning
No ratings yet
LLM Learning
56 pages
Lecture 7
No ratings yet
Lecture 7
66 pages
Recent Advances in Natural Language Processing Via Large Pre-Trained Language Models-A Survey
No ratings yet
Recent Advances in Natural Language Processing Via Large Pre-Trained Language Models-A Survey
40 pages
Advancement in NLP Paper
No ratings yet
Advancement in NLP Paper
49 pages
Prompting - Survey On Prompting Techniques in LLMs
No ratings yet
Prompting - Survey On Prompting Techniques in LLMs
10 pages
LLM Prompting & In-Context Learning
No ratings yet
LLM Prompting & In-Context Learning
18 pages
Lecture 12 Pretraining
No ratings yet
Lecture 12 Pretraining
46 pages
(2303.18223) A Survey of Large Language Models
No ratings yet
(2303.18223) A Survey of Large Language Models
115 pages
MLSys Class LLM Introduction
No ratings yet
MLSys Class LLM Introduction
43 pages
Summary - Foundations On LLMs
No ratings yet
Summary - Foundations On LLMs
6 pages
Icaps LLM Tut Slides Posted
No ratings yet
Icaps LLM Tut Slides Posted
97 pages
2023 LLMBC Whats Next
No ratings yet
2023 LLMBC Whats Next
95 pages
2023 07 28 Evolution of Language Models
No ratings yet
2023 07 28 Evolution of Language Models
73 pages
521H0502-521H0498-521h0333 NLP Report
No ratings yet
521H0502-521H0498-521h0333 NLP Report
27 pages
2AMM30+AY23 24+Text+Mining+Lecture+3
No ratings yet
2AMM30+AY23 24+Text+Mining+Lecture+3
88 pages
Lora - Low-Rank Adaptation of Large Language Models - 2106.09685
No ratings yet
Lora - Low-Rank Adaptation of Large Language Models - 2106.09685
26 pages
Perspectives in Business Ethics
No ratings yet
Perspectives in Business Ethics
113 pages
L L M H T C: A S R: Arge Anguage Odels For Ealthcare EXT Lassification Ystematic Eview
No ratings yet
L L M H T C: A S R: Arge Anguage Odels For Ealthcare EXT Lassification Ystematic Eview
55 pages
Large Language Model
0% (1)
Large Language Model
38 pages
Understanding LLMS: A Comprehensive Overview From Training To Inference
No ratings yet
Understanding LLMS: A Comprehensive Overview From Training To Inference
30 pages
Bloomberggpt: A Large Language Model For Finance: . Co-First Authors
No ratings yet
Bloomberggpt: A Large Language Model For Finance: . Co-First Authors
65 pages
Lecture Notes
No ratings yet
Lecture Notes
86 pages
Jason Wei Stanford cs330 Talk
No ratings yet
Jason Wei Stanford cs330 Talk
44 pages
Large Language Models Johns Hopkins University
No ratings yet
Large Language Models Johns Hopkins University
54 pages
To Create A LLM
No ratings yet
To Create A LLM
53 pages
Know Thy Frenemy
No ratings yet
Know Thy Frenemy
40 pages
14 LookingForward
No ratings yet
14 LookingForward
48 pages
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
No ratings yet
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
51 pages
Augmenting LLMs Survey
No ratings yet
Augmenting LLMs Survey
33 pages
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
No ratings yet
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
53 pages
Teori-Response Surface Methodology
No ratings yet
Teori-Response Surface Methodology
3 pages
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
No ratings yet
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
11 pages
DF-GLS vs. Augmented Dickey-Fuller: Elliott, Rothenberg, and Stock 1996
100% (1)
DF-GLS vs. Augmented Dickey-Fuller: Elliott, Rothenberg, and Stock 1996
3 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
5 pages
SSRN Id4655822
No ratings yet
SSRN Id4655822
9 pages
Compound Distributions and Mixture Distributions: Parameters
100% (2)
Compound Distributions and Mixture Distributions: Parameters
5 pages
Theory of Automata - Lecture - 1
No ratings yet
Theory of Automata - Lecture - 1
36 pages
Mean SD X Operator P (X 64) Z P (Z - 1) : Professor's Past Test Score Distribution Is Normally Distributed
No ratings yet
Mean SD X Operator P (X 64) Z P (Z - 1) : Professor's Past Test Score Distribution Is Normally Distributed
6 pages
07 - UNI+EXP Distribution
No ratings yet
07 - UNI+EXP Distribution
18 pages
CS407 Neural Computation: Lecturer: A/Prof. M. Bennamoun
No ratings yet
CS407 Neural Computation: Lecturer: A/Prof. M. Bennamoun
34 pages
ARIMA Paper
No ratings yet
ARIMA Paper
3 pages
Lecture3 - Gradient Descent - IITM - 23-1-200
No ratings yet
Lecture3 - Gradient Descent - IITM - 23-1-200
200 pages
Lab Using JAVA
No ratings yet
Lab Using JAVA
3 pages
Back Propagation LSN 4
No ratings yet
Back Propagation LSN 4
17 pages
PS04 New
No ratings yet
PS04 New
40 pages
Unit 6: Probability Functions: Gabriel Asare Okyere (PHD)
No ratings yet
Unit 6: Probability Functions: Gabriel Asare Okyere (PHD)
27 pages
CNN Architectures - Transfer Learning
No ratings yet
CNN Architectures - Transfer Learning
64 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
13 pages
CS321 Theory of Computation Midterm, Fall 2014
No ratings yet
CS321 Theory of Computation Midterm, Fall 2014
5 pages
MIT18 440S14 Lecture28 PDF
No ratings yet
MIT18 440S14 Lecture28 PDF
44 pages
Object-Oriented Analysis and Design
No ratings yet
Object-Oriented Analysis and Design
14 pages
Time-Series Analysis and Forecasting of Rainfall at Idukki District, Kerala: Machine Learning Approach
No ratings yet
Time-Series Analysis and Forecasting of Rainfall at Idukki District, Kerala: Machine Learning Approach
8 pages
Neural Network Architecture
No ratings yet
Neural Network Architecture
3 pages
Turing Variations
No ratings yet
Turing Variations
57 pages
I2ml3e Chap11
No ratings yet
I2ml3e Chap11
38 pages
Ad3511 Set2
No ratings yet
Ad3511 Set2
2 pages
11 Automata Theory Part1
No ratings yet
11 Automata Theory Part1
23 pages
Answer All The Following Questions: Problem 1: (25) : Marks
No ratings yet
Answer All The Following Questions: Problem 1: (25) : Marks
2 pages
TT 2022-23 Even 1ST Year
No ratings yet
TT 2022-23 Even 1ST Year
6 pages
CHATGPT DALL.E 3: Complete Guide. Third Edition
From Everand
CHATGPT DALL.E 3: Complete Guide. Third Edition
Hesham Mohamed Elsherif
No ratings yet
The Prompt Engineer's Handbook A Practical Guide to Prompt Design and ChatGPT Mastery
From Everand
The Prompt Engineer's Handbook A Practical Guide to Prompt Design and ChatGPT Mastery
MARTIN NEEL
No ratings yet
40 Machine Learning Algorithms
From Everand
40 Machine Learning Algorithms
Anam Giri
No ratings yet
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
From Everand
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
Nelson Ambrose
No ratings yet
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet