100% found this document useful (1 vote)

228 views9 pages

Guide To Top 7 LLM Parameters

GENAI

Uploaded by

vicky.sonawane3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

228 views9 pages

Guide To Top 7 LLM Parameters

GENAI

Uploaded by

vicky.sonawane3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Guide to

Top 7
LLM Generation
Parameters

Dipanjan (DJ)
Max Tokens

The max_tokens parameter controls the length of the output

generated by the model
A “token” can be as short as one character or as long as one word
By setting an appropriate max_tokens value, you can control
whether the response is a quick snippet or an in-depth explanation
Max_token value is now deprecated in favor of
max_completion_tokens (in OpenAI API)

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

Temperature

The temperature parameter influences how deterministic or

random and creative the model’s responses are
It’s essentially a measure of how deterministic the responses
should be:
Low Temperature (e.g., 0.1): The model will produce more focused and predictable
responses.
High Temperature (e.g., 0.9): The model will produce more creative, varied, or even
“wild” responses.
Use low temperatures for tasks like generating technical answers,
where precision matters, and higher temperature for creative
content generation tasks
Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya
Top-p - Nucleus Sampling

The top_p parameter, also known as nucleus sampling, helps

control the diversity of responses
It sets a threshold for the cumulative probability distribution of
next token generation choices:
Low Value (e.g., 0.1): The model will only consider the top 10% of possible next
tokens, limiting variation.
High Value (e.g., 0.9): The model considers a wider range of possible next tokens
(summing up to 90%), increasing variability

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

Top-k - Token Sampling

The top_k parameter limits the model to only considering the top k
most probable next tokens when predicting (generating) the next
word
Low Value (e.g., 10): Limits the model to more predictable and constrained
responses
High Value (e.g., 100): Allows the model to consider a larger number of tokens,
increasing the variety of responses
The top_k parameter isn’t directly available in the OpenAI API but
is available in other platforms like Hugging Face transformers

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

Frequency Penalty

The frequency_penalty parameter discourages the model from

repeating previously used words. It reduces the probability of
tokens that have already appeared in the output
Low Value (e.g., 0.0): The model won’t penalize for repetition
High Value (e.g., 2.0): The model will heavily penalize repeated words,
encouraging the generation of new content
This is useful when you want the model to avoid repetitive outputs,
like in creative writing, where redundancy might diminish quality

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

Presence Penalty

The presence_penalty parameter is similar to the frequency

penalty, but instead of penalizing based on how often a word is
used, it penalizes based on whether a word has appeared at all in
the response so far
Low Value (e.g., 0.0): The model won’t penalize for reusing words
High Value (e.g., 2.0): The model will avoid using any word that has already
appeared
Presence penalty helps encourage more diverse content generation

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

Stop Sequence

The stop parameter lets you define a sequence of characters or

words that will signal the model to stop generating further content
This allows you to cleanly end the generation at a specific point.
Example Stop Sequences: Could be periods (.), newlines (\n), or specific phrases
like “The end”.
Useful especially if you teach the model to generate content until a
specific special token when fine-tuning

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

Hands-on Guide

Check out the

HANDS-ON GUIDE
here

Real Test Bank Wileys Solomons Fryhle Snyder Organic Chemistry For Jee Main Advanced 3RD Edition MS Chouhan Digital Bundle
No ratings yet
Real Test Bank Wileys Solomons Fryhle Snyder Organic Chemistry For Jee Main Advanced 3RD Edition MS Chouhan Digital Bundle
329 pages
Physics Unit 3 Assignment
67% (3)
Physics Unit 3 Assignment
19 pages
1 - Script Ceu Azul 2023
67% (3)
1 - Script Ceu Azul 2023
21 pages
Running HashiCorp Vault in Production (Dan McTeer, Bryan Krausen) (Z-Library)
100% (1)
Running HashiCorp Vault in Production (Dan McTeer, Bryan Krausen) (Z-Library)
276 pages
Presentation Cassandra Datastax
100% (1)
Presentation Cassandra Datastax
151 pages
Use Delta Lake in Azure Synapse Analytics
No ratings yet
Use Delta Lake in Azure Synapse Analytics
37 pages
Balaguruswamy OOP With C++
No ratings yet
Balaguruswamy OOP With C++
656 pages
Turbonomic User Guide 8.5.0
100% (1)
Turbonomic User Guide 8.5.0
452 pages
Solution of Triangle JEE MAIN
No ratings yet
Solution of Triangle JEE MAIN
2 pages
Linux Crash Course For Beginners - Kodecloud
0% (1)
Linux Crash Course For Beginners - Kodecloud
270 pages
Webmethods Integration Workshop
100% (1)
Webmethods Integration Workshop
4 pages
Agentic Design Patterns Clearly Explained 1737225219
No ratings yet
Agentic Design Patterns Clearly Explained 1737225219
7 pages
Cassandra: A Distributed Database With No Single Point of Failure
100% (1)
Cassandra: A Distributed Database With No Single Point of Failure
9 pages
GIDS - Building Robust, Secure LLM and Agentic AI Workflows
100% (1)
GIDS - Building Robust, Secure LLM and Agentic AI Workflows
36 pages
Data Lakehouse, Data Mesh, and Data Fabric - SqlBits
No ratings yet
Data Lakehouse, Data Mesh, and Data Fabric - SqlBits
35 pages
2.3 Finding The Equation of A Parabola Given Certain Conditions
100% (2)
2.3 Finding The Equation of A Parabola Given Certain Conditions
10 pages
IBM Power Virtual Server Guide For IBM AIX and Linux
100% (1)
IBM Power Virtual Server Guide For IBM AIX and Linux
204 pages
Design Research of Railway Bridges With Span Length Over 1000m in China
No ratings yet
Design Research of Railway Bridges With Span Length Over 1000m in China
6 pages
Agents in LangChain
100% (2)
Agents in LangChain
11 pages
LangGraph Tutorials
100% (1)
LangGraph Tutorials
3 pages
Applied Ai Enterprise Java ER Red Hat Developer
100% (1)
Applied Ai Enterprise Java ER Red Hat Developer
64 pages
Data Ready Ai
No ratings yet
Data Ready Ai
8 pages
DataStax Ebook The 5 Main Benefits of Apache Cassandra PDF
100% (1)
DataStax Ebook The 5 Main Benefits of Apache Cassandra PDF
12 pages
Hugging Face
100% (1)
Hugging Face
11 pages
Benchmarking Warehouse Workloads On The Data Lake Using Presto
100% (1)
Benchmarking Warehouse Workloads On The Data Lake Using Presto
13 pages
Steel BS Parameter PDF
No ratings yet
Steel BS Parameter PDF
8 pages
Object Oriented Programming With Python
No ratings yet
Object Oriented Programming With Python
36 pages
0606 Add Maths - C16 Kinematics P1
No ratings yet
0606 Add Maths - C16 Kinematics P1
21 pages
Otc 25457 MS PDF
No ratings yet
Otc 25457 MS PDF
9 pages
20 Types of LLM Guardrails
No ratings yet
20 Types of LLM Guardrails
12 pages
Technology Management Tools: S-Curve
No ratings yet
Technology Management Tools: S-Curve
18 pages
Complex Analysis: Balram Dubey
No ratings yet
Complex Analysis: Balram Dubey
57 pages
Digital Communications: Fundamentals and Applications: by Bernard Sklar
No ratings yet
Digital Communications: Fundamentals and Applications: by Bernard Sklar
310 pages
Minimal Representations of Orientation Homogeneous Transformations
No ratings yet
Minimal Representations of Orientation Homogeneous Transformations
14 pages
Functions 283 PDF
100% (1)
Functions 283 PDF
398 pages
MLS-C01 Updated Dumps - AWS Certified Machine Learning - Specialty
No ratings yet
MLS-C01 Updated Dumps - AWS Certified Machine Learning - Specialty
19 pages
Mass, Weight and Gravity
No ratings yet
Mass, Weight and Gravity
1 page
Fractions, Decimals, and Percentage in Real-Life (With Answer Key)
No ratings yet
Fractions, Decimals, and Percentage in Real-Life (With Answer Key)
38 pages
Modernizing IBM I Applications
100% (1)
Modernizing IBM I Applications
284 pages
Data-Level Parallelism in Vector, SIMD, And: GPU Architectures
100% (1)
Data-Level Parallelism in Vector, SIMD, And: GPU Architectures
29 pages
55+ Python Projects
No ratings yet
55+ Python Projects
7 pages
Unleashing The Power of AI - Whitepaper 2024
No ratings yet
Unleashing The Power of AI - Whitepaper 2024
27 pages
Complex+Numbers++ +L3-Modulus+of+Complex+Numbers
No ratings yet
Complex+Numbers++ +L3-Modulus+of+Complex+Numbers
62 pages
Vector-04 - Exercise
100% (1)
Vector-04 - Exercise
28 pages
1104B B.P.S. XI Maths Chapterwise Topicwise Worksheets With Solution 2014 15
100% (1)
1104B B.P.S. XI Maths Chapterwise Topicwise Worksheets With Solution 2014 15
380 pages
Ncert Math Class 12
100% (1)
Ncert Math Class 12
618 pages
IEEE Template Research-Track
No ratings yet
IEEE Template Research-Track
3 pages
Wiley Test Paper Series 2019
No ratings yet
Wiley Test Paper Series 2019
12 pages
25 Basic Linux Commands For Beginners
100% (1)
25 Basic Linux Commands For Beginners
2 pages
Best Approach: Functions
100% (1)
Best Approach: Functions
54 pages
4Q Sampling Distribution of The Sample Means
No ratings yet
4Q Sampling Distribution of The Sample Means
26 pages
Mongodb Indexing and Aggregation in Mongodb
No ratings yet
Mongodb Indexing and Aggregation in Mongodb
33 pages
Nlegkfo - K Nlegkfo - K Nlegkfo - K Nlegkfo - K V"vks V"vks V"vks V"vks Ùkj'krukekofy Ùkj'krukekofy Ùkj'krukekofy Ùkj'krukekofy
No ratings yet
Nlegkfo - K Nlegkfo - K Nlegkfo - K Nlegkfo - K V"vks V"vks V"vks V"vks Ùkj'krukekofy Ùkj'krukekofy Ùkj'krukekofy Ùkj'krukekofy
40 pages
Building A Database-Driven Chatbot With LangChain and OpenAI - A Practical Approach (Part 1, Warm-Up) - by Mathews Pious - Aug, 2024 - GoPenAI
No ratings yet
Building A Database-Driven Chatbot With LangChain and OpenAI - A Practical Approach (Part 1, Warm-Up) - by Mathews Pious - Aug, 2024 - GoPenAI
17 pages
2024-02-13T06 - 47 - 30.050Z-Google GenAI Hackathon Idea Submission Template
No ratings yet
2024-02-13T06 - 47 - 30.050Z-Google GenAI Hackathon Idea Submission Template
8 pages
R23 II Year Syllabus EEE
No ratings yet
R23 II Year Syllabus EEE
43 pages
Plane Curvilinear Motion:: Displacement
100% (1)
Plane Curvilinear Motion:: Displacement
18 pages
Assignment-1 QT
No ratings yet
Assignment-1 QT
3 pages
Optimise LLM Selection and Save Cost and Resources
No ratings yet
Optimise LLM Selection and Save Cost and Resources
8 pages
CH 4 LP QM
No ratings yet
CH 4 LP QM
40 pages
Artigo SEPOPE - Redes Neurais - Ingles
No ratings yet
Artigo SEPOPE - Redes Neurais - Ingles
12 pages
Introduction Mechanics Kleppner Solutions 1mk7a
0% (1)
Introduction Mechanics Kleppner Solutions 1mk7a
4 pages
Guide To End-to-End RAG Systems Evaluation
No ratings yet
Guide To End-to-End RAG Systems Evaluation
8 pages
Vector Search
No ratings yet
Vector Search
10 pages
002 Angular Kinematics
100% (1)
002 Angular Kinematics
28 pages
Discrete Structures: Topic
100% (1)
Discrete Structures: Topic
5 pages
Solution of Triangles DPP
No ratings yet
Solution of Triangles DPP
9 pages
Xercise: Single Correct (Objective Questions)
No ratings yet
Xercise: Single Correct (Objective Questions)
16 pages
Function E
No ratings yet
Function E
60 pages
ORF 544 Week 5 Derivative Free Stochastic Optimization VFA and DLA
No ratings yet
ORF 544 Week 5 Derivative Free Stochastic Optimization VFA and DLA
123 pages
Calculus of One Variable Topic 1.1: FUNCTIONS: Assoc. Prof. Dr. Loh Wei Ping
No ratings yet
Calculus of One Variable Topic 1.1: FUNCTIONS: Assoc. Prof. Dr. Loh Wei Ping
21 pages
Body Systems Interactions Chart
100% (1)
Body Systems Interactions Chart
11 pages
Improved Serially Concatenated Convolution Turbo Code (SCCTC) Using Chicken Swarm Optimization
No ratings yet
Improved Serially Concatenated Convolution Turbo Code (SCCTC) Using Chicken Swarm Optimization
6 pages
Accelerating Data Modernization With Azure
No ratings yet
Accelerating Data Modernization With Azure
7 pages
Assessing RAG Models For Health Chatbots
No ratings yet
Assessing RAG Models For Health Chatbots
17 pages
1985-Orientational Analysis, Tensor Analysis and The Group Properties of The SI Supplementary units-II
No ratings yet
1985-Orientational Analysis, Tensor Analysis and The Group Properties of The SI Supplementary units-II
18 pages
Model Selection Strategies
No ratings yet
Model Selection Strategies
20 pages
Add Maths Quadratics
No ratings yet
Add Maths Quadratics
11 pages
Final Exam Review 1-Solutions
100% (1)
Final Exam Review 1-Solutions
16 pages
Installation, Configuration, and Administration Guide SAP NetWeaver Single Sign-On SP4 Secure Login Library
No ratings yet
Installation, Configuration, and Administration Guide SAP NetWeaver Single Sign-On SP4 Secure Login Library
71 pages
Cambridge IGCSE ™: Physics 0625/53 October/November 2022
No ratings yet
Cambridge IGCSE ™: Physics 0625/53 October/November 2022
8 pages
Lesson 2 Variables and Their Measurement
No ratings yet
Lesson 2 Variables and Their Measurement
40 pages
CH 4 CH 5 Review Materials Solutions From Book
No ratings yet
CH 4 CH 5 Review Materials Solutions From Book
8 pages
End To End Setup For SAP HANA On Azure Large Instances
No ratings yet
End To End Setup For SAP HANA On Azure Large Instances
21 pages
Week 1 AI Paper by Hand
No ratings yet
Week 1 AI Paper by Hand
8 pages
LLM Interpretability 101
No ratings yet
LLM Interpretability 101
8 pages
CS 1101 Unit 4
No ratings yet
CS 1101 Unit 4
3 pages
Data Visualization Using Spreadsheet SEC-1 (2023)
No ratings yet
Data Visualization Using Spreadsheet SEC-1 (2023)
3 pages
9 Constrained Motion of Connected Particles
No ratings yet
9 Constrained Motion of Connected Particles
8 pages
Unit 6 Binomial Theorem
No ratings yet
Unit 6 Binomial Theorem
28 pages
Chapter 4 Work Energy Power
No ratings yet
Chapter 4 Work Energy Power
13 pages
As 4
No ratings yet
As 4
2 pages
Level - I (C.W) (Areas)
No ratings yet
Level - I (C.W) (Areas)
3 pages
Summative Test 1
No ratings yet
Summative Test 1
2 pages
JEE Books
No ratings yet
JEE Books
1 page
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet

Guide To Top 7 LLM Parameters

Uploaded by

Guide To Top 7 LLM Parameters

Uploaded by

Guide to

The max_tokens parameter controls the length of the output

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

The temperature parameter influences how deterministic or

The top_p parameter, also known as nucleus sampling, helps

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

The frequency_penalty parameter discourages the model from

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

The presence_penalty parameter is similar to the frequency

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

The stop parameter lets you define a sequence of characters or

Source: 7 LLM Parameters to Enhance Model Performance - Analytics Vidhya

Check out the

You might also like