0% found this document useful (0 votes)

6 views5 pages

Wizer Training Module Config

The document proposes replacing the Vanna training module with a new system due to its limitations in reliability, scalability, performance, and size. It outlines a plan for creating a semantic layer and details the methodology for embedding and performance analysis. Next steps include finalizing model selections, developing embedding functions, and conducting performance testing.

Uploaded by

akius

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views5 pages

Wizer Training Module Config

Uploaded by

akius

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Wizer Training Module Configuration

Document
Introduction
This document outlines the rationale for replacing the Vanna training module with a
new, more efficient system, describes the proposed approach for creating a semantic
layer, and details the methodology for embedding, performance analysis, and next
steps.

Current Implementation

Current Implementation Limitations

Why Replace the Vanna Training Module?

● Reliability: Vanna is prone to sporadic downtimes and performance

inconsistencies.
● Scalability: It's challenging to scale Vanna for enterprise-level needs due to its
excessive abstraction.
● Performance: Vanna's response generation is slow, primarily due to the Large
Language Model's lengthy processing times.
● Size: The Vanna package is overly large (>300 MB), making it unsuitable for
serverless API deployment.
● Accuracy and Code Complexity: There is a need for more complex query
generation using specialized models.
● Inference from Training Data: The current model requires extensive key-value
pairs for training, hindering rapid deployment.
Proposed Approach
Semantic Layer Creation:

Semantic Layer Creation

Classes

● WizerStore: Handles CRUD operations with the vector database. Methods

implemented for Pinecone Chroma and Qdrant include:
○ Adding DDL, documentation, and question-SQL data.
○ Retrieving related DDL, documentation, and similar question-SQL.
○ Managing training data.
def add_ddl(self, ddl: str, **kwargs) -> str:
# Implement here

def add_documentation(self, doc: str, **kwargs) -> str:

# Implement here

def add_question_sql(self, question: str, sql: str, **kwargs) -> str:

# Implement here

def get_related_ddl(self, question: str, **kwargs) -> list:

# Implement here

def get_related_documentation(self, question: str, **kwargs) -> list:

# Implement here

def get_similar_question_sql(self, question: str, **kwargs) -> list:

# Implement here

def get_training_data(self, **kwargs) -> pd.DataFrame:

# Implement here

def remove_training_data(self, id: str, **kwargs) -> bool:

# Implement here

● WizerLLM: Interfaces with the LLM. Implemented methods with Groq for
submitting prompts.

def submit_prompt(self, prompt, **kwargs) -> str:

# Implement here
End to End Flow:
Embeddings

● Selection of the optimal embedding model for performance and accuracy.

● Local storage of embedding models to avoid downloading large files for each
call.

Performance Analysis

Key Parameters:

● Speed: Benchmark against Vanna to identify bottlenecks.

● Accuracy: Research and testing with different embedding models.
● Complexity: Evaluate the complexity of generated code and model capacity.
● Inference Ability: Assess models' ability to understand context with minimal
training.

Key Goals (Levels of importance 5 = Very high, 1 = very low)

1. Identify the best model for our task (Importance: 5).

2. Determine the point of diminishing returns for model upgrades (Importance:
3).
3. Best training techniques for each LLM (Importance: 5).
4. Continuous testing and discovery of results (Importance: 5).
5. Enhance LLM accuracy over time (Importance: 4).
6. Manage training data at scale (Importance: 4).

Next Steps
● Finalize the selection of embedding and LLM models.
● Develop and implement the embedding function for documentation and
question-SQL data.
● Conduct extensive testing on performance metrics outlined above.
● Plan for scaling the training data management process.

ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
From Everand
ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
SUJAN
No ratings yet
Managing the Testing Process: Practical Tools and Techniques for Managing Hardware and Software Testing
From Everand
Managing the Testing Process: Practical Tools and Techniques for Managing Hardware and Software Testing
Rex Black
4/5 (8)
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Mastering Django: Core
From Everand
Mastering Django: Core
Nigel George
3/5 (1)
Learning Informatica PowerCenter 9.x
From Everand
Learning Informatica PowerCenter 9.x
Rahul Malewar
3/5 (4)
Oracle Advanced PL/SQL Developer Professional Guide
From Everand
Oracle Advanced PL/SQL Developer Professional Guide
Saurabh K. Gupta
4/5 (8)
Learning Oracle 12c: A PL/SQL Approach
From Everand
Learning Oracle 12c: A PL/SQL Approach
Prof. Sham Tickoo
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
AI-Driven Web Apps: Practical Machine Learning for Software Developers
From Everand
AI-Driven Web Apps: Practical Machine Learning for Software Developers
Sivaramarajalu Ramadurai Venkataraajalu
No ratings yet
SpecFlow Test Automation Essentials: Definitive Reference for Developers and Engineers
From Everand
SpecFlow Test Automation Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DP-500 Designing and Implementing Enterprise-Scale Analytics Solutions Using Microsoft Azure and Microsoft Power BI Exam Guide
From Everand
DP-500 Designing and Implementing Enterprise-Scale Analytics Solutions Using Microsoft Azure and Microsoft Power BI Exam Guide
Anand Vemula
No ratings yet
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
From Everand
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
Marije Brummel
No ratings yet
SageMaker Deployment and Development: Definitive Reference for Developers and Engineers
From Everand
SageMaker Deployment and Development: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
From Everand
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
From Everand
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
CertSquad Professional Trainers
No ratings yet
Study Guide 300-435 ENAUTO: Automating and Programming Cisco Enterprise Solutions Certification Exam
From Everand
Study Guide 300-435 ENAUTO: Automating and Programming Cisco Enterprise Solutions Certification Exam
Anand Vemula
No ratings yet
Scrum Art Hand Book: Effective Tips & Techniques
From Everand
Scrum Art Hand Book: Effective Tips & Techniques
Durga Madiraju
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
Practical MXNet Applications: Definitive Reference for Developers and Engineers
From Everand
Practical MXNet Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
T-SQL Techniques and Best Practices: Definitive Reference for Developers and Engineers
From Everand
T-SQL Techniques and Best Practices: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Keras Deep Learning Essentials: Definitive Reference for Developers and Engineers
From Everand
Keras Deep Learning Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Java Performance Optimization: From Basics to Expert Proficiency
From Everand
Java Performance Optimization: From Basics to Expert Proficiency
William Smith
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Study Guide Cisco Certified Design Expert (CCDE 400-007) Exam
From Everand
Study Guide Cisco Certified Design Expert (CCDE 400-007) Exam
Anand Vemula
No ratings yet
VMWARE Certified Spring Professional Certification Concept Based Practice Questions - Latest Edition
From Everand
VMWARE Certified Spring Professional Certification Concept Based Practice Questions - Latest Edition
Exam OG
No ratings yet
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Dataproc Administration and Engineering Solutions: Definitive Reference for Developers and Engineers
From Everand
Dataproc Administration and Engineering Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
AWS Certified Machine Learning - Specialty (MLS-C01) Certification Guide: The ultimate guide to passing the MLS-C01 exam on your first attempt
From Everand
AWS Certified Machine Learning - Specialty (MLS-C01) Certification Guide: The ultimate guide to passing the MLS-C01 exam on your first attempt
Somanath Nanda
No ratings yet
TestNG Essentials: Definitive Reference for Developers and Engineers
From Everand
TestNG Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cypress.io Essentials: Definitive Reference for Developers and Engineers
From Everand
Cypress.io Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Model Based Environment: A Practical Guide for Data Model Implementation with Examples in Powerdesigner
From Everand
Model Based Environment: A Practical Guide for Data Model Implementation with Examples in Powerdesigner
Vladimir Pantic
No ratings yet
Microsoft Azure DevOps Engineer AZ 400
From Everand
Microsoft Azure DevOps Engineer AZ 400
Manish Soni
No ratings yet
Practical Moq for .NET Developers: Definitive Reference for Developers and Engineers
From Everand
Practical Moq for .NET Developers: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Microsoft Azure Database Administrator DP 300
From Everand
Microsoft Azure Database Administrator DP 300
Manish Soni
No ratings yet
K6 Load Testing Essentials: The Complete Guide for Developers and Engineers
From Everand
K6 Load Testing Essentials: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Programming Cloudflare Workers KV: The Complete Guide for Developers and Engineers
From Everand
Programming Cloudflare Workers KV: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Informatica PowerCenter Workflow and Transformation Guide: Definitive Reference for Developers and Engineers
From Everand
Informatica PowerCenter Workflow and Transformation Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
XGBoost in Practice: Definitive Reference for Developers and Engineers
From Everand
XGBoost in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Alpaca Fine-Tuning with LLaMA: The Complete Guide for Developers and Engineers
From Everand
Alpaca Fine-Tuning with LLaMA: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Building Scalable Systems with C: Optimizing Performance and Portability
From Everand
Building Scalable Systems with C: Optimizing Performance and Portability
Larry Jones
No ratings yet
PMI-ACP Exam Companion : Q & A with Explanations
From Everand
PMI-ACP Exam Companion : Q & A with Explanations
SUJAN
No ratings yet
Optimizing Machine Learning Pipelines: Advanced Techniques with TensorFlow and Kubeflow
From Everand
Optimizing Machine Learning Pipelines: Advanced Techniques with TensorFlow and Kubeflow
Adam Jones
No ratings yet
Django 1.1 Testing and Debugging
From Everand
Django 1.1 Testing and Debugging
Karen M. Tracey
4.5/5 (3)
SQL 101 Crash Course: Comprehensive Guide to SQL Fundamentals and Practical Applications
From Everand
SQL 101 Crash Course: Comprehensive Guide to SQL Fundamentals and Practical Applications
Emrys Callahan
5/5 (1)
DevOps Master Courseware
From Everand
DevOps Master Courseware
Alejandro Pestchanker
No ratings yet
Microsoft 365 Identity and Services MS-100 Practice Test
From Everand
Microsoft 365 Identity and Services MS-100 Practice Test
CertSquad Professional Trainers
No ratings yet
TestCafe Automation Engineering: Definitive Reference for Developers and Engineers
From Everand
TestCafe Automation Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
SageMaker Essentials: Definitive Reference for Developers and Engineers
From Everand
SageMaker Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
SQL Server Interview Questions You'll Most Likely Be Asked
From Everand
SQL Server Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
TensorFlow Developer Certificate Exam Practice Tests 2024 Made Easy
From Everand
TensorFlow Developer Certificate Exam Practice Tests 2024 Made Easy
Mr Troy
No ratings yet
AZ-720 Troubleshooting Microsoft Azure Connectivity Study Guide
From Everand
AZ-720 Troubleshooting Microsoft Azure Connectivity Study Guide
Anand Vemula
No ratings yet
Learn SQL in 24 Hours: The Complete Beginner’s Guide: Master Coding in 24 Hours
From Everand
Learn SQL in 24 Hours: The Complete Beginner’s Guide: Master Coding in 24 Hours
Aniket Jain
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Cypress for Reliable Web Application Testing: Definitive Reference for Developers and Engineers
From Everand
Cypress for Reliable Web Application Testing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
The Foundry NukeX 7 for Compositors
From Everand
The Foundry NukeX 7 for Compositors
Prof. Sham Tickoo
No ratings yet
Phase 2 Ibm
No ratings yet
Phase 2 Ibm
5 pages
3rd Form
No ratings yet
3rd Form
6 pages
GEC108Module1secondSem2024 2025
No ratings yet
GEC108Module1secondSem2024 2025
21 pages
Python - Module at Master Livewires - Python GitHub
No ratings yet
Python - Module at Master Livewires - Python GitHub
4 pages
(Ebook - Commodore Computers) Impossible Routines For The c64 PDF
No ratings yet
(Ebook - Commodore Computers) Impossible Routines For The c64 PDF
211 pages
Jmusic PDF
No ratings yet
Jmusic PDF
9 pages
ControlLogix Controller Portfolio Customer Presentation
No ratings yet
ControlLogix Controller Portfolio Customer Presentation
22 pages
02 Modul Exasol SQL - en
No ratings yet
02 Modul Exasol SQL - en
41 pages
Test 3 - Ôn Thi 10
No ratings yet
Test 3 - Ôn Thi 10
3 pages
MMW Module 2.1
No ratings yet
MMW Module 2.1
5 pages
PowerPath Family For Windows 6.2 and Minor Releases Release Notes
0% (1)
PowerPath Family For Windows 6.2 and Minor Releases Release Notes
23 pages
CSC213 Object Oriented Programming-Lab Manual-Sol
No ratings yet
CSC213 Object Oriented Programming-Lab Manual-Sol
83 pages
6.4.5 Packet Tracer - Configure Static NAT
No ratings yet
6.4.5 Packet Tracer - Configure Static NAT
2 pages
Acts An Exegetical Commentary 1512335 Craig S Keener Instant Download
No ratings yet
Acts An Exegetical Commentary 1512335 Craig S Keener Instant Download
83 pages
Sample Church Survey For Pastor Searches Template Vanderbloemen Search Group
No ratings yet
Sample Church Survey For Pastor Searches Template Vanderbloemen Search Group
5 pages
Iqra' Grade - One Curriculum Aqidah, Fiqh & Ahklaq: Tasneema Ghazi
No ratings yet
Iqra' Grade - One Curriculum Aqidah, Fiqh & Ahklaq: Tasneema Ghazi
25 pages
Siddhantamuktavali: Sevyaswaroop
No ratings yet
Siddhantamuktavali: Sevyaswaroop
24 pages
Final Time Table For Mock 2025
No ratings yet
Final Time Table For Mock 2025
2 pages
Chick Literature
No ratings yet
Chick Literature
9 pages
Speaking in Subtitles Revaluing Screen Translation 1st Edition Tessa Dwyer 2024 Scribd Download
100% (1)
Speaking in Subtitles Revaluing Screen Translation 1st Edition Tessa Dwyer 2024 Scribd Download
72 pages
Error Log
No ratings yet
Error Log
59 pages
T. Guthrie (Ed.), Comprehension and Teaching Research Reviews
No ratings yet
T. Guthrie (Ed.), Comprehension and Teaching Research Reviews
332 pages
Java Workshop
No ratings yet
Java Workshop
2 pages
Do The Exercises Below On The Simple Past Tense and Click On The Button To Check Your Answers. The Simple Past Tense
No ratings yet
Do The Exercises Below On The Simple Past Tense and Click On The Button To Check Your Answers. The Simple Past Tense
18 pages
Codings 1
No ratings yet
Codings 1
12 pages
PDF Ibm Spss by Example A Practical Guide To Statistical Data Analysis Second Edition Service Des Societes Secretes Ebook Full Chapter
100% (7)
PDF Ibm Spss by Example A Practical Guide To Statistical Data Analysis Second Edition Service Des Societes Secretes Ebook Full Chapter
53 pages
Gr3 ENG (FAL) June 2022 Question Paper
No ratings yet
Gr3 ENG (FAL) June 2022 Question Paper
10 pages
NS LogMessages
No ratings yet
NS LogMessages
54 pages
Together Kl5 U1 Test For Dyslexic Students
No ratings yet
Together Kl5 U1 Test For Dyslexic Students
4 pages
Jane Eyre PDF
No ratings yet
Jane Eyre PDF
6 pages
Identifying Functions
No ratings yet
Identifying Functions
2 pages

Wizer Training Module Config

Uploaded by

Wizer Training Module Config

Uploaded by

Wizer Training Module Configuration

Current Implementation Limitations

Why Replace the Vanna Training Module?

● Reliability: Vanna is prone to sporadic downtimes and performance

Semantic Layer Creation

● WizerStore: Handles CRUD operations with the vector database. Methods

def add_documentation(self, doc: str, **kwargs) -> str:

def add_question_sql(self, question: str, sql: str, **kwargs) -> str:

def get_related_ddl(self, question: str, **kwargs) -> list:

def get_related_documentation(self, question: str, **kwargs) -> list:

def get_similar_question_sql(self, question: str, **kwargs) -> list:

def get_training_data(self, **kwargs) -> pd.DataFrame:

def remove_training_data(self, id: str, **kwargs) -> bool:

def submit_prompt(self, prompt, **kwargs) -> str:

● Selection of the optimal embedding model for performance and accuracy.

● Speed: Benchmark against Vanna to identify bottlenecks.

Key Goals (Levels of importance 5 = Very high, 1 = very low)

1. Identify the best model for our task (Importance: 5).

You might also like