0% found this document useful (0 votes)

17 views32 pages

Session 1

The document outlines a session on Generative AI, focusing on Large Language Models (LLMs) and their applications, cost structures, and prompt engineering. It discusses the evolution of LLMs, their capabilities, and the challenges they face, including data privacy and explainability issues. Additionally, it provides insights into accessing LLMs through various platforms and compares features and costs among different providers.

Uploaded by

siva512reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views32 pages

Session 1

Uploaded by

siva512reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

bhumareddy@deloitte.

com
[email protected] Session 1

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
In this session, we will discuss:
• Introduction to Generative AI
• Overview of the LLM Ecosystem
[email protected] Agenda • Understanding LLMs and their Cost Structures
[email protected]
• A Workflow for Enterprise LLM Applications
• Prompt Engineering Fundamentals
• Hands-on Implementation of Prompt Engineering
Techniques

This file is meant for personal use by [email protected] only.

Ecosystem
[email protected]

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Generative AI - An Introduction
Design and implementation of computer programs that can reason,
Artificial Intelligence learn and act in complex environments.

Superset of

Design of algorithms that enable computer systems to

Machine Learning efficiently learn from data.

[email protected] Superset of
[email protected]
A subset of ML that uses neural network algorithms for
Deep Learning predictive applications.

Superset of

Algorithms that generate original content, such as images,

Generative AI text, voice, or music, by learning patterns from a dataset.

This file is meant for personal use by [email protected] only.

Generative AI

Large Language Large Multimodal

Models (LLMs) Models (LMMs)
[email protected]
[email protected]
Trained using language Trained using a mixture of
modeling, that is, predicting images, video and text.
the next word in a sentence. LMMs demonstrate nuanced
LLMs have excellent understanding of multimodal
reasoning capabilities and inputs and can be used for
can be used for several several mixed-input tasks
natural language tasks (e.g., (e.g., video summarization)
summarization).

This file is meant for personal use by [email protected] only.

Applications

Classification
Question-Answering
Promp
[email protected] t LLM Summarization
[email protected]
Code Generation
Generative AI Creatives Generation
Resource

This file is meant for personal use by [email protected] only.

Question

[email protected]
Document
[email protected] Dennis Walsh, Goldman Sachs Asset Management
Store

Answer

Generative AI is reducing the human effort

required in synthesizing information from
documents

Ernst & Young

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
LLM Applications - Shortcomings
● Vulnerability to injection attacks
● Data privacy concerns
● Intellectual property concerns on LLM outputs
● Lack of explainability

[email protected]
[email protected]

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Accessing Large Language Models (LLMs)
LLMs (both paid and open source) can be accessed either through public cloud providers or LLM
vendors.

Google: Gemini
Open AI: GPT-3.5, GPT-4
Paid Anthropic: Claude
Cohere: Command

[email protected]
[email protected]
LLMs

Open Meta AI: Llama

Source Mistral AI: Mistral, Mixtral

This file is meant for personal use by [email protected] only.

Efficient
Hugging Face
implementations of
Access LLama Cpp
open models for
Unsloth
specific target devices.
[email protected]
[email protected]
LLMs
Open Source
Open Weight
Efficient inference
vLLM abstractions
Servers Ollama compatible with Open
AI APIs

This file is meant for personal use by [email protected] only.

Feature Azure AWS GCP

Exclusive model access GPT Claude Gemini Pro

(Open AI) (Anthropic) (Google)

Application ecosystem integration (e.g., High Medium Low

LangChain,
[email protected])
[email protected]

Stability & reliability of APIs High Medium High

Customizability for production systems High Medium High

This file is meant for personal use by [email protected] only.

Cost comparison per 1000 tokens

Model Type Azure AWS GCP

Input Output Input Output Input Output

Value-for-money,
production grade (e.g.,
[email protected]
[email protected] $0.0005 $0.0015 $0.0008 $0.0024 $0.000125 $0.000375
GPT3.5, Claude Instant,
Gemini 1.5 Flash)

Advanced Reasoning
Models (e.g., GPT4, $0.01 $0.03 $0.015 $0.075 $0.00125 $0.00375
Claude, Gemini 1.5 Pro)

Fine-tuned LLM variants $0.0005 $0.0015

Note* - The cost of OpenAI services is based on the number of tokens.

Prompt Tokens Completion Tokens

× Cost/1k Tokens × Cost/1k Tokens

[email protected]
[email protected]
Total Token
Cost
∕ Total Requests

Avg Cost/Prediction
Monthly
Deployment Cost
# Daily Predictions ×

30 Days
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
LLMs for Enterprise Applications
[email protected]
[email protected]

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What are Large Language Models (LLMs)?
LLMs are trained using language modeling, that is, predicting the next word in a sequence.
They do so by assigning probabilities to a fixed vocabulary.
Vocabulary

positive p = .03

[email protected]
negative p = .00001
The movie is a visually stunning, action-packed, and
[email protected]
emotionally resonant thrill ride that will leave you on the
edge of the seat from the beginning to end. Overall, the
experience was magical.
…

magical p = .83

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What are Large Language Models (LLMs)?
LLMs are trained using language modeling, that is, predicting the next word in a sequence.
They do so by assigning probabilities to a fixed vocabulary.
Vocabulary

positive p = .03

magical p = .83

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What are Large Language Models (LLMs)?
LLMs are trained using language modeling, that is, predicting the next word in a sequence.
They do so by assigning probabilities to a fixed vocabulary.
Vocabulary

positive p = .03

magical p = .83

This file is meant for personal use by [email protected] only.

Output, word-by-word
positive
The movie was awesome. Overall, the experience was positive.
[email protected]
negative
…
The movie was awesome. Overall, the experience was positive.
[email protected]

The movie was awesome. Overall, the experience was positive. movie

The movie was awesome. Overall, the experience was positive. magical
The movie was awesome. Overall, the experience was positive.
Vocabulary
The movie was awesome. Overall, the experience was positive.

⁞ This file is meant for personal use by [email protected] only.

GPT (117M parameters) InstructGPT

First model to be trained in Instruction-tuned models
a “generative” mode by understand human inputs as
masking portions of input instructions; path to ChatGPT
text from left-to-right is paved
[email protected]
[email protected]

2019 2020 2023

2018 2022
GPT-2 (1.5B parameters) GPT-3 (175B parameters) GPT-4 (1T parameters?)
The era of prompting begins. Large scale foundation models are Era of completely closed
Models are relatively small, born. Prompting is shown to models begins; API
open-source and fine-tuning induce
This file is meant for personal userobust performance on
by [email protected] only. access only
is possible natural in
Sharing or publishing the contents language
part or fulltasks
is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Business Problems Solved by LLMs - A Taxonomy

Text Sentence

Fixed set of
classes Label Positive
Context + Context +
[email protected] Classification Text Question
[email protected]
(Text) → (Label)

Conversatio Contextualized Text Answer

Text
n
Augmented Generation
(Context + Text) → (Text)
Open-ended Text Summary

Generation
This file is meant for personal use by [email protected] only.
(Text) → (Text)
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
A Workflow for Enterprise LLM Applications
Container Approvals,
Dev Workspace
Registry Guardrails

Data Sources

Package prompts for

Build and manage prompts deployment Deployment
Data Lake Environment

[email protected]
[email protected]
Staging Production
Database environment environment

Logging & Monitoring

Event Logs
Infrastructure &
resource metrics

Collect live data & annotations

Performance metrics
(Accuracy, F1 score,
ThisRatings)
BERTScore, LLM file is meant
for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Prompt Engineering Fundamentals
Designing specific instructions that enable LLMs to accomplish business tasks

[email protected]
[email protected]

This file is meant for personal use by [email protected] only.

[email protected]
[email protected]

This file is meant for personal use by [email protected] only.

Copy of base model +

Model dedicated hosting + rate limit Deployment
(tokens per minute)
[email protected]
[email protected]

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Understanding Tokens
A token refers to a segment or piece of text, such as a word, punctuation, or other
meaningful element, into which input text is divided for processing by the model.

The movie was awesome.

[The,movie,was,awesome,.,Overall,,,the
Overall, the experience
,experience,was,positive,.]
was positive.

[email protected]
[email protected]
Tokenize LLM Model
r

Parses sentences Predicts the next

as tokens token

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Accessing LLMs using Azure Open AI APIs
The Azure Open AI Playground enables iterative development for prompt engineering, that is,
designing specific instructions for LLMs to accomplish a task.

[email protected]
[email protected]

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Accessing LLMs using Azure Open AI APIs
The Azure Open AI Playground enables iterative development for prompt engineering, that is,
designing specific instructions for LLMs to accomplish a task.

[email protected] Azure Open

[email protected] AI API

Prompt
API Key
format
Playground enables quick iterations on
prompts. Once an effective prompt is
discovered, it is translated to code for efficient
API access

This file is meant for personal use by [email protected] only.

System Message
Clear instructions explaining the task that the LLM should accomplish.
These instructions are agnostic to the user input and appended to the
user input with higher priority. Can also be used to prime LLM behavior

[email protected]
User Message
LLM API Prediction
[email protected]

Specific instructions from the user describing the task that needs to
be accomplished.

Assistant Message
Not needed for single-turn conversations. Can be used to showcase
expected completions in multi-turn conversations.

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Prompt Engineering Fundamentals
Prompt = Specific set of instructions sent to a LLM to accomplish a task
Engineering = Iteratively deriving a specific prompt for the task

Length of output Max output length

Prompt
More temperature = More
[email protected]
[email protected] Temperature Parameters Structure
randomness in response

More Top P = More

tokens selected for Top P
completion

More FP = Less chance of Frequency

tokens repeating Penalty (FP)

This file is meant for personal use by [email protected] only.

[Playground Demo]
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Prompt Engineering Fundamentals
Understanding temperature
Temperature = 0

The overall The overall experience was positive Repeated

experience was execution
The overall experience was positive
produces the
The overall experience was positive same results

[email protected] LLM
[email protected]
Temperature = 1

The overall experience was positive Repeated

positive negative magical execution can
The overall experience was magical
produce different
0.6 0.1 0.3
The overall experience was negative results

This file is meant for personal use by [email protected] only.

- Zero-shot, Few-shot
- Chain-of-Thought
- Rephrase & Respond
- Self-Consistency
- LLM-as-a-judge
- Tree-of-thought
[email protected]
[email protected]

This file is meant for personal use by [email protected] only.

Foundation
Models

Train

[email protected]
[email protected]
LLMs Templates System Message,
Few Shot Examples

Infer Prompts

Max Length,
Parameters
Temperature
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.

Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
No ratings yet
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
254 pages
Generative AI For Dummies
67% (3)
Generative AI For Dummies
6 pages
OceanofPDF - Com LLMs in Enterprise - Ahmed Menshawy
No ratings yet
OceanofPDF - Com LLMs in Enterprise - Ahmed Menshawy
194 pages
White Topping Report
73% (11)
White Topping Report
21 pages
New Agentic AI
No ratings yet
New Agentic AI
16 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
A Beginner's Guide To Large Language Models
No ratings yet
A Beginner's Guide To Large Language Models
25 pages
Mod 4
No ratings yet
Mod 4
69 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
AI Week9
No ratings yet
AI Week9
37 pages
Fine Tuning Techniques For Large Language Models LLMs
No ratings yet
Fine Tuning Techniques For Large Language Models LLMs
15 pages
Aryan A. What Is LLMOps. Large Language Models in Production 2024
100% (1)
Aryan A. What Is LLMOps. Large Language Models in Production 2024
67 pages
21046
No ratings yet
21046
38 pages
A Development Approach To Generative AI and Llm-Based Software Applications' Deployment
No ratings yet
A Development Approach To Generative AI and Llm-Based Software Applications' Deployment
23 pages
Chatgpt: A Technical Perspective: Presented by Teamx
No ratings yet
Chatgpt: A Technical Perspective: Presented by Teamx
18 pages
Mastering LLMs and Generative AI
No ratings yet
Mastering LLMs and Generative AI
12 pages
PYQ DEMO COMBO PYQ BANK All Odisha Previous Year Subject Wise Topic Wise 20000 Questions Answer PDF
100% (1)
PYQ DEMO COMBO PYQ BANK All Odisha Previous Year Subject Wise Topic Wise 20000 Questions Answer PDF
51 pages
LLM Mastery Pathways
No ratings yet
LLM Mastery Pathways
8 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
(English) Introduction To Large Language Models (DownSub - Com)
No ratings yet
(English) Introduction To Large Language Models (DownSub - Com)
9 pages
A Beginner's Guide To Large Language Mo-Ebook-Part1
No ratings yet
A Beginner's Guide To Large Language Mo-Ebook-Part1
25 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
NAIPDC 2025 Bootcamp Slides - National AI Prompt Design Challenge Philippines
No ratings yet
NAIPDC 2025 Bootcamp Slides - National AI Prompt Design Challenge Philippines
93 pages
Generative AI and LLMS
No ratings yet
Generative AI and LLMS
34 pages
E Book Unleashing AI Powered Search Pureinsights
No ratings yet
E Book Unleashing AI Powered Search Pureinsights
48 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
Training Large Language Models
No ratings yet
Training Large Language Models
7 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
(Coursera) GenAI
No ratings yet
(Coursera) GenAI
27 pages
Attention Is All You Need.
No ratings yet
Attention Is All You Need.
5 pages
A Beginner's Guide To Large Language Models Part 1
No ratings yet
A Beginner's Guide To Large Language Models Part 1
25 pages
Llm-Based Software Deployment
No ratings yet
Llm-Based Software Deployment
23 pages
Problem Set 1 - Simple Interest
50% (2)
Problem Set 1 - Simple Interest
2 pages
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
No ratings yet
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
11 pages
Understanding Large Language Models (LLMS) - A Mode
No ratings yet
Understanding Large Language Models (LLMS) - A Mode
3 pages
What Are LLMs
No ratings yet
What Are LLMs
3 pages
Large Language Models
No ratings yet
Large Language Models
17 pages
Presentation On Ai
No ratings yet
Presentation On Ai
10 pages
Responsible Design and Use of Large Language Models
No ratings yet
Responsible Design and Use of Large Language Models
12 pages
OSS Engine Parts Section
No ratings yet
OSS Engine Parts Section
28 pages
Data Seminar
No ratings yet
Data Seminar
10 pages
2024 NTU - Resaro - LLM - Security - Paper
No ratings yet
2024 NTU - Resaro - LLM - Security - Paper
19 pages
LLM Seminar PDF
No ratings yet
LLM Seminar PDF
10 pages
In Consulting Nasscom Deloitte Paper Large Language Models LLMs Noexp
No ratings yet
In Consulting Nasscom Deloitte Paper Large Language Models LLMs Noexp
13 pages
Data Sheet - Carrier Chiller
No ratings yet
Data Sheet - Carrier Chiller
4 pages
Using Chatgpt, Gpt-4, & Large Language Models in The Enterprise
No ratings yet
Using Chatgpt, Gpt-4, & Large Language Models in The Enterprise
20 pages
SW Post 1
No ratings yet
SW Post 1
5 pages
Large Language Models
No ratings yet
Large Language Models
3 pages
SSRN Id4655822
No ratings yet
SSRN Id4655822
9 pages
Introduction To Gen AI
No ratings yet
Introduction To Gen AI
7 pages
Tacn VD 1 4
No ratings yet
Tacn VD 1 4
6 pages
Global Logic Interview Questions and Answers
No ratings yet
Global Logic Interview Questions and Answers
6 pages
Pe 1
No ratings yet
Pe 1
5 pages
Large Language Models and Their Use Cases
No ratings yet
Large Language Models and Their Use Cases
3 pages
2 Notes
No ratings yet
2 Notes
3 pages
1st Note
No ratings yet
1st Note
3 pages
ICT5358 Himanshu Patel
No ratings yet
ICT5358 Himanshu Patel
5 pages
LLMS and Ai
No ratings yet
LLMS and Ai
7 pages
LLM
No ratings yet
LLM
3 pages
Python BAKMR010399001
No ratings yet
Python BAKMR010399001
3 pages
LLM Model
No ratings yet
LLM Model
3 pages
Ai 101
No ratings yet
Ai 101
3 pages
Kickstart Your Journey With LLM - A Comprehensive Guide
No ratings yet
Kickstart Your Journey With LLM - A Comprehensive Guide
2 pages
Rotax 912 Operator's Manual
No ratings yet
Rotax 912 Operator's Manual
85 pages
State Budget 2025-26
No ratings yet
State Budget 2025-26
13 pages
Grade 6 2nd Q Final
No ratings yet
Grade 6 2nd Q Final
5 pages
CRM Section Two
No ratings yet
CRM Section Two
4 pages
Cinema India
No ratings yet
Cinema India
31 pages
Chem m10
No ratings yet
Chem m10
24 pages
Table Showing Current Ratio: List of Tables
No ratings yet
Table Showing Current Ratio: List of Tables
37 pages
Don Mariano Marcos Memorial State University College of Graduate Studies
No ratings yet
Don Mariano Marcos Memorial State University College of Graduate Studies
4 pages
Prefinal-1 Model Paper (2024-25)
No ratings yet
Prefinal-1 Model Paper (2024-25)
4 pages
Chapter 12.2 - Financial Statements
No ratings yet
Chapter 12.2 - Financial Statements
10 pages
Lecture 1a
No ratings yet
Lecture 1a
22 pages
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
No ratings yet
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
19 pages
Origin of HAZOP Analysis
No ratings yet
Origin of HAZOP Analysis
5 pages
Xanthan Gum On Foam Concrete PDF
No ratings yet
Xanthan Gum On Foam Concrete PDF
8 pages
Xie 2021
No ratings yet
Xie 2021
8 pages
Synthetic
No ratings yet
Synthetic
6 pages
64482-International Price Index 23 24 v11
No ratings yet
64482-International Price Index 23 24 v11
30 pages
Action Reesearch Webinar CPD Certificate April 2025
No ratings yet
Action Reesearch Webinar CPD Certificate April 2025
5 pages
Internship Report
No ratings yet
Internship Report
10 pages
Werner 2018 Geographies of Production I Global Production and Uneven Development
No ratings yet
Werner 2018 Geographies of Production I Global Production and Uneven Development
11 pages
Aircraft Communication System AKD20603: Practical Assignment - Aircraft Hs-125
No ratings yet
Aircraft Communication System AKD20603: Practical Assignment - Aircraft Hs-125
16 pages
Gsu100 6648-0.0
No ratings yet
Gsu100 6648-0.0
16 pages
Surprise Test Solution
No ratings yet
Surprise Test Solution
1 page
After Class - AVTC6 - Unit 6 - Pie Charts - K26
No ratings yet
After Class - AVTC6 - Unit 6 - Pie Charts - K26
3 pages
Circuit Design Powerful Blad Tinkercad
No ratings yet
Circuit Design Powerful Blad Tinkercad
1 page
Prompt Perfect
From Everand
Prompt Perfect
Muni
No ratings yet
Comprehensive Beginner’s Guide to Google’s Generative AI Studio for Non-technical Executives
From Everand
Comprehensive Beginner’s Guide to Google’s Generative AI Studio for Non-technical Executives
CertSquad Professional Trainers
No ratings yet