100% found this document useful (1 vote)

295 views43 pages

GenAI POC - Training

Uploaded by

hoticgirl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

295 views43 pages

GenAI POC - Training

Uploaded by

hoticgirl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

GenAI POC to

Production
Lessons Learned Developing a GraphRAG Production App

March 21, 2024

Presenters

Alex Gilmore Alexander Fournier Daniel Bukowski

Consulting Engineer Sales Engineer Sales Engineer

2 Neo4j Inc. All rights reserved 2023

Key Topics
1. Notebooks → Streamlit

2. Streamlit → Self-Hosted

3. Back-End and Data Engineering

4. Data and Data Model

5. Logging

3 Neo4j Inc. All rights reserved 2023

Key Takeaways

Work toward building a viable app for today and tomorrow

Real-world usage is critical to inform how you evolve your app

Neo4j knowledge graphs are ﬂexible elements of the architecture:

● Accurate, complete grounding

● Logging usage
● Visibility into LLM responses

Build in a logging function to understand app behavior

4 Neo4j Inc. All rights reserved 2023

GenAI Project Overview

5 Neo4j Inc. All rights reserved 2023

What is Agent Neo?
● LLM Chat Application to assist
individuals using the Neo4j and the
Graph Data Science Library
● RAG Application grounded on a Neo4j
AuraDS Knowledge Graph containing
text from Neo4j Documentation and
blogs
● Built using LangChain
● POC built using Streamlit
● Production on cloud infrastructure
with a React front-end

6 Neo4j Inc. All rights reserved 2023

Notebooks → Streamlit

7 Neo4j Inc. All rights reserved 2023

Notebooks → Streamlit

The initial POC was built across

four Jupyter notebooks.

It worked, but…

Wrangling multiple notebooks

is error prone and time
consuming

Users had to load and run the

notebook to ask a question

8 Neo4j Inc. All rights reserved 2023

Notebooks → Streamlit
class LLM(BaseModel):
"""
What variables become Interface for interacting with different LLMs.
"""
changeable parameters?
llm_type: str
● Temperature temperature: float = 0.7
llm_instance: ChatVertexAI
● RAG on / off
def _init_llm(self, llm_type: str,
temperature: float):
What architecture makes sense for """
your application? This function initializes an LLM for conversation.

"""
● LLM handler . . .
● Database handler
def get_response(self, question: str,
context: pd.DataFrame | None) -> str:
"""
Get a response from the LLM.
"""
. . .
9 Neo4j Inc. All rights reserved 2023
Notebooks → Streamlit

Enables quick iteration and controlled

experimentation

Easy deployment to make your app

available to users and share demos

Expanding users helps gain an

understanding of how the app might be
used in production

10 Neo4j Inc. All rights reserved 2023

Streamlit → Self-Hosted

11 Neo4j Inc. All rights reserved 2023

Streamlit → Self-Hosted

What Streamlit managed: Streamlit has challenges:

● Application state management
● Deploy Streamlit “off Streamlit”
● Autoscaling
● Access to private / company repos
● Auto deployments from Github
● Custom / company styling
● UI styling / HTML
● Auth
● Frontend components

12 Neo4j Inc. All rights reserved 2023

Streamlit → Self-Hosted

13 Neo4j Inc. All rights reserved 2023

Cloud Infrastructure

14 Neo4j Inc. All rights reserved 2023

Cloud Infrastructure Recipe

Migrated to Google Cloud Managed Services:

● Google Cloud Run to host our backend API & frontend React app
● Google Cloud DNS
● Google Cloud Secrets Manager
● Google Cloud Artifact Registry / Container repository
● Google Cloud Storage

Github

● Github actions to handle auto-deployment

15 Neo4j Inc. All rights reserved 2023

Streamlit → React

● Migrating from Streamlit to React allowed us to to optimize performance through

the Virtual DOM and a higher degree of ﬁne grained support
● A greater degree of decoupling from the backend and frontend logic
● Migrated frontend code from python to typescript, moving from a dynamically
typed to static typed language helps us catch more 🪲
● Leaned heavily on Neo4j’s Needle design system - a fantastic asset, styling and
component library to make beautiful Neo4j-y UIs
● Packed in a docker image, pushed to google artifact repository and deployed via
Cloud Run

16 Neo4j Inc. All rights reserved 2023

Streamlit → Fastapi

● Validate requests
and responses
● Perform logging in
the background
● Automatic
documentation
● Stateless

17 Neo4j Inc. All rights reserved 2023

Fastapi & Pydantic

● Object
representation
● Advanced
validation

18 Neo4j Inc. All rights reserved 2023

Self-Hosted UI

19 Neo4j Inc. All rights reserved 2023

Back-End

20 Neo4j Inc. All rights reserved 2023

Notebooks →Data Engineering Pipelines

21 Neo4j Inc. All rights reserved 2023

Chunking Strategies

● Our data sources come in a variety of formats: unstructured, semi structured, and
unstructured.
● In anticipation of encoding our documents as embeddings, we chunk/partition the
documents using some logical pivot point

22 Neo4j Inc. All rights reserved 2023

Chunking Strategies

23 Neo4j Inc. All rights reserved 2023

Data and Data Model
Evolution

24 Neo4j Inc. All rights reserved 2023

Initial Grounding Data and Data Model
● 1,150 documents of ofﬁcial Neo4j documentation
● Developer blogs
● Support knowledge base
● Demo Github repos

25 Neo4j Inc. All rights reserved 2023

Expanded (then contracted) Grounding Data

As the project progressed, we discussed what additional data sources could further
improve results. We decided to include:
● More code from public Neo4j repos
● Transcripts of trainings from the Neo4j YouTube Page

But…
● More is not always better. Instead of ALL YouTube trainings we focused on the
Going Meta series, which focuses on ontologies and GenAI
● Quality over quantity. We also emphasized ensuring we properly ingested code
repos, rather than arbitrary text lengths.

26 Neo4j Inc. All rights reserved 2023

Logging and Visualizing Conversations
Logging conversations helped us see how users were interacting with Agent Neo and
which grounding text was being used.

Graph of an actual conversation between an Agent Neo user and the ChatGPT-4 LLM.
Context Documents are labeled with their GDS Community.
27 Neo4j Inc. All rights reserved 2023
Advanced RAG Strategies

Parent-Child: Subset larger text chunks into smaller chunks for initial matching, then
retrieve the larger chunk for response generation.

Topic Summaries: Identify semantically-similar clusters of text in the grounding

database.

Questions: Use our question logging to identify repeat questions and the grounding
data that was retrieved to produce answers. Also use an LLM to generate additional
questions from the grounding data.

28 Neo4j Inc. All rights reserved 2023

Basic Data Model

Parent - Child

Question Matching

Topic Summaries

Advanced Graph Model Emerges

Enhanced
Grounding Data

LLM Conversation

Logging and Evaluation

Importance of Logging and Visibility
Existing highly-regulated industries (i.e., ﬁnance, life sciences, healthcare, etc…)

AI-Speciﬁc Regulations like the EU AI Act and potential regulations in the US and
elsewhere

Critical for production environments where the GenAI app responses may have
real-world implications

Be able to understand how the LLM generated answers (good or bad)

Identify areas for performance improvements

LangSmith
● LangSmith is one of the available
tools to log and analyze GenAI App
performance.
● Developed by the team behind
LangChain, but is platform-agnostic.
● Provides extensive logs for each step
of a query.
● Can analyze using LangSmith UI or
query the logs into a Notebook (ref:
LangSmith Cookbook Repo).

Latency can be an issue
LLM 1 LLM 2 LLM 3

Distribution of Latency (seconds) by LLM

Token count varies widely
● Users can specify the count of
grounding documents (from 0 to 10)
so variation is expected
● While models now have much larger
context windows, more grounding
data is not always better
● Efﬁciency, cost, and latency all need
to be considered when deploying a
GenAI app

Potential LangSmith Data as a Graph

Conclusion

Key Takeaways

Work toward building a viable app for today and tomorrow

Real-world usage is critical to inform how you evolve your app

Neo4j knowledge graphs are ﬂexible elements of the architecture:

● Accurate, complete grounding

● Logging usage
● Visibility into LLM responses

Build in a logging function to understand app behavior

Resources
Neo4j GenAI Ecosystem Page:
https://neo4j.com/labs/genai-ecosystem/

Needle Starter Kit:

https://neo4j.com/labs/neo4j-needle-starterkit/

Neo4j Developer Blog:

https://neo4j.com/developer-blog/

Going Meta YouTube Series:

https://www.youtube.com/@neo4j

Thank you for attending!

Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
Yugandar - Generative AI Architect
No ratings yet
Yugandar - Generative AI Architect
8 pages
Azure OpenAI Cookbook
No ratings yet
Azure OpenAI Cookbook
173 pages
Generative AI
100% (1)
Generative AI
107 pages
Building A Streamlit Chatbot With LangChain and Llama 3.1 - Exploring LLMs - 3 - by Abou Zuhayr - Sep, 2024 - GoPenAI
No ratings yet
Building A Streamlit Chatbot With LangChain and Llama 3.1 - Exploring LLMs - 3 - by Abou Zuhayr - Sep, 2024 - GoPenAI
15 pages
GenerativeAI Projects
100% (2)
GenerativeAI Projects
46 pages
Power Amazon Bedrock Applications With Neo4j Knowledge Graph
No ratings yet
Power Amazon Bedrock Applications With Neo4j Knowledge Graph
19 pages
Evolving LLOMPS For RAG
No ratings yet
Evolving LLOMPS For RAG
6 pages
RAG Notes
No ratings yet
RAG Notes
4 pages
Generative Ai Terminology
67% (3)
Generative Ai Terminology
26 pages
Generative AI With Large Language Models
100% (3)
Generative AI With Large Language Models
31 pages
PythonAI LLMs ForSharing
No ratings yet
PythonAI LLMs ForSharing
47 pages
Building LLM Applications For Production
100% (3)
Building LLM Applications For Production
28 pages
Generative AI LLM Tutorial
No ratings yet
Generative AI LLM Tutorial
25 pages
GenAI Interview Questions-Draft
No ratings yet
GenAI Interview Questions-Draft
27 pages
Kubernetes
No ratings yet
Kubernetes
42 pages
Application of Large Language
No ratings yet
Application of Large Language
75 pages
Rag 1708257109
100% (1)
Rag 1708257109
5 pages
KAG Graph + Multimodal RAG + LLM Agents = Powerful AI Reasoning - by Gao Dalie (高達烈) - in Towards AI - Freedium
No ratings yet
KAG Graph + Multimodal RAG + LLM Agents = Powerful AI Reasoning - by Gao Dalie (高達烈) - in Towards AI - Freedium
13 pages
Knowledge Graphs V Vector Databases and When Not To Use Them!
No ratings yet
Knowledge Graphs V Vector Databases and When Not To Use Them!
3 pages
Introduction To Generative AI LLM
100% (1)
Introduction To Generative AI LLM
9 pages
26 RAG Concepts in Alphabetical Order
No ratings yet
26 RAG Concepts in Alphabetical Order
15 pages
Kubernetes For MLOps Engineers
No ratings yet
Kubernetes For MLOps Engineers
7 pages
MasterClass Agentic AI & RAG Flyer-1
No ratings yet
MasterClass Agentic AI & RAG Flyer-1
4 pages
7 Agentic RAG System Architectures To Build AI Agents
100% (1)
7 Agentic RAG System Architectures To Build AI Agents
12 pages
GraphRAG + GPT-4o-Mini Is The RAG Heaven - by Vatsal Saglani - Jul, 2024 - Towards AI
No ratings yet
GraphRAG + GPT-4o-Mini Is The RAG Heaven - by Vatsal Saglani - Jul, 2024 - Towards AI
34 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
No ratings yet
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
61 pages
Aryan A. What Is LLMOps. Large Language Models in Production 2024
100% (1)
Aryan A. What Is LLMOps. Large Language Models in Production 2024
67 pages
Hands-On Lab With LLMs and Gen AI Within IDC
No ratings yet
Hands-On Lab With LLMs and Gen AI Within IDC
57 pages
Vector Database Essentials
No ratings yet
Vector Database Essentials
26 pages
GenAI Pinnacle Roadmap
100% (1)
GenAI Pinnacle Roadmap
8 pages
Oracle Generative AI Services
No ratings yet
Oracle Generative AI Services
17 pages
RAG Notes
No ratings yet
RAG Notes
19 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
User Guide Technical Reference Manual - 2015
No ratings yet
User Guide Technical Reference Manual - 2015
110 pages
Hands-On Guide To Agentic Corrective RAG-1
No ratings yet
Hands-On Guide To Agentic Corrective RAG-1
5 pages
Little Guide To Building Large Language Models in 2024
100% (1)
Little Guide To Building Large Language Models in 2024
65 pages
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
100% (1)
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
27 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
LangGraph: Multi-Agent Systems
No ratings yet
LangGraph: Multi-Agent Systems
9 pages
A Practical Primer To AI Agents 1736197641
No ratings yet
A Practical Primer To AI Agents 1736197641
23 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
Multi-Agent Agentic RAG Systems - Prashant Sahu
No ratings yet
Multi-Agent Agentic RAG Systems - Prashant Sahu
10 pages
LLM Applications
100% (1)
LLM Applications
1 page
Embeddings
No ratings yet
Embeddings
13 pages
1GitHub - Modelcontextprotocol - Python-Sdk - The Official Python SDK For Model Context Protocol Servers and Clients
No ratings yet
1GitHub - Modelcontextprotocol - Python-Sdk - The Official Python SDK For Model Context Protocol Servers and Clients
9 pages
RAG and LangChain Loading Documents Round1
No ratings yet
RAG and LangChain Loading Documents Round1
8 pages
Software Architecture in An AI World
100% (1)
Software Architecture in An AI World
25 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
Building Finetuning Aimodels
No ratings yet
Building Finetuning Aimodels
41 pages
00 Course Introduction
100% (1)
00 Course Introduction
17 pages
What Are Vector Databases
No ratings yet
What Are Vector Databases
5 pages
API Test Cases
No ratings yet
API Test Cases
9 pages
Types of RAG: @bhavishya Pandit
No ratings yet
Types of RAG: @bhavishya Pandit
15 pages
GraphRAG + GPT-4o Mini - Building An AI Knowledge Graph at Low Cost - by Shuyi Wang - Jul, 2024 - Cubed
No ratings yet
GraphRAG + GPT-4o Mini - Building An AI Knowledge Graph at Low Cost - by Shuyi Wang - Jul, 2024 - Cubed
31 pages
Vector Databases
No ratings yet
Vector Databases
35 pages
Enhancing AI Systems With Agentic Workflows Patterns in Large Language Model
No ratings yet
Enhancing AI Systems With Agentic Workflows Patterns in Large Language Model
6 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
Langchain PDF Reader
100% (1)
Langchain PDF Reader
15 pages
Presentation 1
No ratings yet
Presentation 1
5 pages
Satellite Interception Solutions
No ratings yet
Satellite Interception Solutions
8 pages
Web UNIT-1
No ratings yet
Web UNIT-1
27 pages
2) Introduction To MySQL Database
No ratings yet
2) Introduction To MySQL Database
41 pages
RLM License Administration
No ratings yet
RLM License Administration
137 pages
CH2-MCQ-12 Comp
No ratings yet
CH2-MCQ-12 Comp
4 pages
TIBCO BusinessConnect Scripting Deployment User S Guide
No ratings yet
TIBCO BusinessConnect Scripting Deployment User S Guide
76 pages
Em Tech Quiz (Advanced Presentation Skills)
No ratings yet
Em Tech Quiz (Advanced Presentation Skills)
2 pages
Cambridge IGCSE (9-1) : Information and Communication Technology 0983/31
No ratings yet
Cambridge IGCSE (9-1) : Information and Communication Technology 0983/31
8 pages
CSS Questions
No ratings yet
CSS Questions
15 pages
Frontend Security
No ratings yet
Frontend Security
12 pages
Chapter 2-Part 1
No ratings yet
Chapter 2-Part 1
18 pages
Source Code Management Using Git & GitHub
No ratings yet
Source Code Management Using Git & GitHub
3 pages
EM2 - Server IM ENG 150914-R2
No ratings yet
EM2 - Server IM ENG 150914-R2
71 pages
Answer Key Sample Paper 3 AI Class 10
No ratings yet
Answer Key Sample Paper 3 AI Class 10
12 pages
Tutorials List - Javatpoint
No ratings yet
Tutorials List - Javatpoint
30 pages
Dictionary App Builder 02 Building Apps
No ratings yet
Dictionary App Builder 02 Building Apps
27 pages
Partizan Access Control Management User Manual: Version 2.0.0, 14 August 2015
No ratings yet
Partizan Access Control Management User Manual: Version 2.0.0, 14 August 2015
53 pages
User Manual - Accops Hysecure Client VPN Installation
No ratings yet
User Manual - Accops Hysecure Client VPN Installation
13 pages
Web Testing Checklist & Guidelines: 1. Functionality
No ratings yet
Web Testing Checklist & Guidelines: 1. Functionality
4 pages
Class 7 - ICT Definitions Term I
No ratings yet
Class 7 - ICT Definitions Term I
2 pages
Resume Chetan Jadhav
No ratings yet
Resume Chetan Jadhav
1 page
Ankit Resume
No ratings yet
Ankit Resume
1 page
Simple, Beautiful Notes For Ableton Live: Getting Started Guide
No ratings yet
Simple, Beautiful Notes For Ableton Live: Getting Started Guide
10 pages
IBM® DB2® Web Query For I™ 5733WQX Install Instructions - Version 2.1.0
No ratings yet
IBM® DB2® Web Query For I™ 5733WQX Install Instructions - Version 2.1.0
9 pages
Pradeep's Resume
No ratings yet
Pradeep's Resume
1 page
Curiculum Vitae - Ananda Jauhar F-1
No ratings yet
Curiculum Vitae - Ananda Jauhar F-1
2 pages
Pipeline With XSLT Soap Output, How To Submit To Web Service - Designing Pipelines - SnapLogic Community
No ratings yet
Pipeline With XSLT Soap Output, How To Submit To Web Service - Designing Pipelines - SnapLogic Community
1 page
Guia Instalacion Sakai
No ratings yet
Guia Instalacion Sakai
2 pages

GenAI POC - Training

Uploaded by

GenAI POC - Training

Uploaded by

GenAI POC to

March 21, 2024

Alex Gilmore Alexander Fournier Daniel Bukowski

2 Neo4j Inc. All rights reserved 2023

3. Back-End and Data Engineering

4. Data and Data Model

3 Neo4j Inc. All rights reserved 2023

Work toward building a viable app for today and tomorrow

Real-world usage is critical to inform how you evolve your app

Neo4j knowledge graphs are ﬂexible elements of the architecture:

● Accurate, complete grounding

Build in a logging function to understand app behavior

4 Neo4j Inc. All rights reserved 2023

5 Neo4j Inc. All rights reserved 2023

6 Neo4j Inc. All rights reserved 2023

7 Neo4j Inc. All rights reserved 2023

The initial POC was built across

Wrangling multiple notebooks

Users had to load and run the

8 Neo4j Inc. All rights reserved 2023

Enables quick iteration and controlled

Easy deployment to make your app

Expanding users helps gain an

10 Neo4j Inc. All rights reserved 2023

11 Neo4j Inc. All rights reserved 2023

What Streamlit managed: Streamlit has challenges:

12 Neo4j Inc. All rights reserved 2023

13 Neo4j Inc. All rights reserved 2023

14 Neo4j Inc. All rights reserved 2023

Migrated to Google Cloud Managed Services:

● Github actions to handle auto-deployment

15 Neo4j Inc. All rights reserved 2023

● Migrating from Streamlit to React allowed us to to optimize performance through

16 Neo4j Inc. All rights reserved 2023

17 Neo4j Inc. All rights reserved 2023

18 Neo4j Inc. All rights reserved 2023

19 Neo4j Inc. All rights reserved 2023

20 Neo4j Inc. All rights reserved 2023

21 Neo4j Inc. All rights reserved 2023

22 Neo4j Inc. All rights reserved 2023

23 Neo4j Inc. All rights reserved 2023

24 Neo4j Inc. All rights reserved 2023

25 Neo4j Inc. All rights reserved 2023

26 Neo4j Inc. All rights reserved 2023

Topic Summaries: Identify semantically-similar clusters of text in the grounding

28 Neo4j Inc. All rights reserved 2023

29 Neo4j Inc. All rights reserved 2023

30 Neo4j Inc. All rights reserved 2023

31 Neo4j Inc. All rights reserved 2023

32 Neo4j Inc. All rights reserved 2023

33 Neo4j Inc. All rights reserved 2023

34 Neo4j Inc. All rights reserved 2023

Be able to understand how the LLM generated answers (good or bad)

Identify areas for performance improvements

35 Neo4j Inc. All rights reserved 2023

36 Neo4j Inc. All rights reserved 2023

Distribution of Latency (seconds) by LLM

37 Neo4j Inc. All rights reserved 2023

38 Neo4j Inc. All rights reserved 2023

39 Neo4j Inc. All rights reserved 2023

40 Neo4j Inc. All rights reserved 2023

Work toward building a viable app for today and tomorrow

Real-world usage is critical to inform how you evolve your app

Neo4j knowledge graphs are ﬂexible elements of the architecture:

● Accurate, complete grounding

Build in a logging function to understand app behavior

41 Neo4j Inc. All rights reserved 2023

Needle Starter Kit:

Neo4j Developer Blog:

Going Meta YouTube Series:

42 Neo4j Inc. All rights reserved 2023

43 Neo4j Inc. All rights reserved 2023

You might also like