Discord Taz

Uploaded by

Seth Thunder

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views3 pages

Discord Taz

Uploaded by

Seth Thunder

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

To make it a chatbot so that I can have multiple users asking, would I have to do

some changes to my code besides changing it to async? You know chatgpt has millions
of users that ask it question but each one from his account. I want something like
that for mine
2) when I wanna load pdf files for example, should I use PyPDFLoader or
PDFMinerLoader or any other pdf type loader? I tried asking LLMs about which is
better but couldn�t find an answer and on YT, people just use PyPdfLoader
3) When splitting my doc to chunks, what do people split them based on? Like if you
consider any chatbot website where u can upload ur doc, there�s definitely several
different structures like a pdf thats just purely text or one that has sections
with subheaders then text or one that has tables and images with it etc� how would
I use my separators exactly? I couldn�t find anything about that and not sure whats
even the optimum chunk size and chunk overlap( based on the structure ofc)
4) If I am chatting withal a csv file or mysql, is it better to use an agent to
answer my questions or should I use the same idea of chains and all that?
5) if you look into my code earlier, I did use chroma but if I wanna deploy it, I
cant have my knowledge base stored locally and persisted locally right? How can I
store and persist on a cloud service and which one would you recommend?
SethThunder
OP
� Yesterday at 6:23 PM
6) Regarding the loaders, which one is better from the 2 to load directory:
DirectoryLoader (but I specify in the parameters to just load PDF files) or
PyPDFDirectoryLoader? The reason I am asking this is because in the DirectoryLoader
there is a parameter that allows me to use multithreading
7) How can I evaluate my RAG performance as a %? Ive tried reading on linked in
and some YT videos but for some reason there isnt 1 definitive way. One says method
X is good while another says its bad and Y is better
8) For the chain types like stuff, refine, map reduce and map rerank, how can I
know which one is better for my use case? For my code earlier since its just 1 pdf
I assume stuff is fine but when is it too big to use stuff and switch to map
rerank?
9) For retrieving from my knowledge base, whats the best or based on what do I
choose my retrieve type? Similarity search or MMR etc�
SethThunder
OP
� Yesterday at 6:32 PM
10) does Chroma have a concept similar to Deeplake�s deepmemory? Deepmemory can
increase my RAG accuracy by 20-30%
@taz I think those are my questions for now and sorry for asking too many but I�ve
been reading literally everywhere but couldn�t find answers. I�m not sure if thats
because LC is a new framework or it�s because I�m a beginner in the entire
programming world. I definitely want to make my career in LLMs as I�ve been very
hooked to it.
taz � Yesterday at 7:12 PM
1) Other than async you'll have to think about how you want to isolate user
sessions e.g. each user should not be able to see the other user's chat data. Then
how about the docs they can operate on, are they all going to have access to the
same docs, if not then you need to think about how each user will have his own set
of documents (maybe you can model that user per collection or metadata in Chroma)
2) Depends what you wanna do with the doc, some libs allow you to extract images
and tables
I think LC has functionality around that
but my advice is, start simple use whatever works and create abstraction (this can
be as simple as wrapping the PyPdfLoader code in a function) to be able to easily
swap out libraries
3) Chunking is an art form ?? There is no one best solution but there are couple of
things you need to keep in mind - chunks should be less than the maximum input
sequence of your embedding model, if you are using OpenAI that's large - 8000ish
tokens, with others it is relatively less. And second thing to keep in mind is
LLM's context window. An example would be if you take the top 10 results and you
feed those to an LLM where each chunk (result from the search) is 2000 tokens then
you'll end up with 20k tokens which not so many LLMs can handle. To that effect I
think LC might also have some functionality to help
taz � Yesterday at 7:19 PM
4) Agents, I think might have an edge here, LC has functionality around that too
5) No specific recommendation on the cloud provider, all of them work fine and
prices are comparable. We have a few cloud deployment blueprints -
https://github.com/chroma-core/chroma/tree/main/examples/deployments
GitHub
chroma/examples/deployments at main � chroma-core/chroma
the AI-native open-source embedding database. Contribute to chroma-core/chroma
development by creating an account on GitHub.
chroma/examples/deployments at main � chroma-core/chroma
6) if you have large set of files then multi-treaded loading might help
7) I usually suggest Ragas to people - https://github.com/explodinggradients/ragas
GitHub
GitHub - explodinggradients/ragas: Evaluation framework for your Re...
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines -
GitHub - explodinggradients/ragas: Evaluation framework for your Retrieval
Augmented Generation (RAG) pipelines
GitHub - explodinggradients/ragas: Evaluation framework for your Re...
also have a look at this, though you will have to adapt it to LC -
https://docs.ragas.io/en/latest/howtos/integrations/llamaindex.html
Evaluating LlamaIndex | Ragas
taz � Yesterday at 7:27 PM
LC also have their own eval framework, LangSmith I think
8) stuff is fine for small number of docs
for complex or numerous docs refine/map reduce. Refine does things iteratively by
passing each doc to the LLM for answer + intermediate answer from previous doc,
whereas map reduce will pass each doc to LLM get an answer then combine the answers
iteratively to arrive to a single answer
if you need scoring or raking of the relevancy of results then map rerank
taz � Yesterday at 7:37 PM
9) use similarity if you want closest matches, use MMR if you want diversity of
results
10) not familiar with deepmemory, there are however hyper parameters of the HNSW
lib which Chroma uses to increase accuracy (memory tradeoff) of results
SethThunder
OP
� Yesterday at 8:29 PM
1) Do you have any reference on how I can isolate sessions? I did understand the
concept of what you wrote but the implementation is where I'm sort of lost. In my
case, the users will be able to operate on the same docs and yeah I am interested
in using chroma more than any other DB

5) Based on what do people normally pick a cloud provider? What should I look into
before deciding?

8) How small is small for stuff?

9) MMR would work well if my use case involves setting temperature = 1 or a high
number right?

10) I won't be able to use memory at all or does it reduce the number of msgs it
can remember?
taz � Yesterday at 11:45 PM
10) no, but chroma will consume more memory in order to give you better results
taz � Today at 12:05 PM
9) If you're looking for diversity in search results then MMR is good choice
5) The easiest I find to start with is AWS
SethThunder
OP
� Today at 7:31 PM
Thanks a lot @taz

Beyond Effective Go: Part 1 - Achieving High-Performance Code
From Everand
Beyond Effective Go: Part 1 - Achieving High-Performance Code
Corey S Scott
5/5 (1)
Python Data Science Cookbook
From Everand
Python Data Science Cookbook
Taryn Voska
No ratings yet
Ethereum Blockchain Developer - The Bootcamp
From Everand
Ethereum Blockchain Developer - The Bootcamp
Thomas Wiesner
5/5 (3)
Start-to-Finish Visual C# 2015
From Everand
Start-to-Finish Visual C# 2015
Tim Patrick
5/5 (1)
Python Data Science Cookbook: Practical solutions across fast data cleaning, processing, and machine learning workflows with pandas, NumPy, and scikit-learn
From Everand
Python Data Science Cookbook: Practical solutions across fast data cleaning, processing, and machine learning workflows with pandas, NumPy, and scikit-learn
Taryn Voska
No ratings yet
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
From Everand
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
Marcin Jamro
No ratings yet
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
From Everand
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
Abiprod Pty Ltd
5/5 (10)
The basics that every web developer needs to know
From Everand
The basics that every web developer needs to know
Marcelo Galhego
5/5 (1)
Building RAG-based LLM Applications For Production: Blog Detail
No ratings yet
Building RAG-based LLM Applications For Production: Blog Detail
78 pages
Parallel and High Performance Programming with Python
From Everand
Parallel and High Performance Programming with Python
Fabio Nelli
No ratings yet
Network Programming in Python : The Basic: A Detailed Guide to Python 3 Network Programming and Management
From Everand
Network Programming in Python : The Basic: A Detailed Guide to Python 3 Network Programming and Management
John Galbraith
No ratings yet
Machine Learning with Rust
From Everand
Machine Learning with Rust
Keiko Nakamura
No ratings yet
Python Parallel Programming Cookbook
From Everand
Python Parallel Programming Cookbook
Giancarlo Zaccone
5/5 (1)
HPE6-A88 HPE Aruba Networking ClearPass Exam Free Dumps
No ratings yet
HPE6-A88 HPE Aruba Networking ClearPass Exam Free Dumps
10 pages
Entity Framework Core
From Everand
Entity Framework Core
Kenji Elzerman
No ratings yet
Hands-on Data Virtualization with Polybase: Administer Big Data, SQL Queries and Data Accessibility Across Hadoop, Azure, Spark, Cassandra, MongoDB, CosmosDB, MySQL and PostgreSQL (English Edition)
From Everand
Hands-on Data Virtualization with Polybase: Administer Big Data, SQL Queries and Data Accessibility Across Hadoop, Azure, Spark, Cassandra, MongoDB, CosmosDB, MySQL and PostgreSQL (English Edition)
Pablo Alejandro Echeverria Barrios
No ratings yet
Mastering TensorFlow 2.x: Implement Powerful Neural Nets across Structured, Unstructured datasets and Time Series Data
From Everand
Mastering TensorFlow 2.x: Implement Powerful Neural Nets across Structured, Unstructured datasets and Time Series Data
Rajdeep Dua
No ratings yet
Building Serverless Apps with Azure Functions and Cosmos DB: Leverage Azure functions and Cosmos DB for building serverless applications (English Edition)
From Everand
Building Serverless Apps with Azure Functions and Cosmos DB: Leverage Azure functions and Cosmos DB for building serverless applications (English Edition)
Hansamali Gamage
No ratings yet
Python for Developers: Learn to Develop Efficient Programs using Python
From Everand
Python for Developers: Learn to Develop Efficient Programs using Python
Mohit Raj
No ratings yet
Data Structures and Algorithms with Go: Create efficient solutions and optimize your Go coding skills (English Edition)
From Everand
Data Structures and Algorithms with Go: Create efficient solutions and optimize your Go coding skills (English Edition)
Dušan Stojanović
No ratings yet
Getting Started with Terraform
From Everand
Getting Started with Terraform
Kirill Shirinkin
5/5 (1)
Jump Start PHP Environment: Master the World's Most Popular Language
From Everand
Jump Start PHP Environment: Master the World's Most Popular Language
Bruno Skvorc
No ratings yet
Programming Problems: Advanced Algorithms
From Everand
Programming Problems: Advanced Algorithms
Bradley Green
3.5/5 (7)
Building Web Apps with Python and Flask: Learn to Develop and Deploy Responsive RESTful Web Applications Using Flask Framework (English Edition)
From Everand
Building Web Apps with Python and Flask: Learn to Develop and Deploy Responsive RESTful Web Applications Using Flask Framework (English Edition)
Malhar Lathkar
4/5 (1)
Building a Product Master
From Everand
Building a Product Master
Edufdev
No ratings yet
Algorithm Challenges: The Dojo Collection
From Everand
Algorithm Challenges: The Dojo Collection
Martin Puryear
No ratings yet
Building Server-side and Microservices with Go: Building Modern Backends and Microservices Using Go, Docker and Kubernetes
From Everand
Building Server-side and Microservices with Go: Building Modern Backends and Microservices Using Go, Docker and Kubernetes
Dušan Stojanović
No ratings yet
KNIME Essentials
From Everand
KNIME Essentials
Gábor Bakos
No ratings yet
Practical C++ Machine Learning: Hands-on strategies for developing simple machine learning models using C++ data structures and libraries
From Everand
Practical C++ Machine Learning: Hands-on strategies for developing simple machine learning models using C++ data structures and libraries
Anais Sutherland
No ratings yet
Project Report Format (M.SC.)
0% (1)
Project Report Format (M.SC.)
6 pages
Where to Place My Project: Code Hosting Platforms
From Everand
Where to Place My Project: Code Hosting Platforms
Jagoda Górska
No ratings yet
Synopsis
No ratings yet
Synopsis
3 pages
Hands-On Machine Learning Recommender Systems with Apache Spark
From Everand
Hands-On Machine Learning Recommender Systems with Apache Spark
Ernesto Lee
No ratings yet
Introduction to Algorithms & Data Structures: A solid foundation for the real world of machine learning and data analytics
From Everand
Introduction to Algorithms & Data Structures: A solid foundation for the real world of machine learning and data analytics
Bolakale Aremu
No ratings yet
Formal Letters: LT Col PR Pathiravithana PSC Co-8 Slac
100% (1)
Formal Letters: LT Col PR Pathiravithana PSC Co-8 Slac
40 pages
C++ Advanced Programming: Building High-Performance Applications
From Everand
C++ Advanced Programming: Building High-Performance Applications
Robert Johnson
No ratings yet
HVAC - Part-3
No ratings yet
HVAC - Part-3
55 pages
Assignment
No ratings yet
Assignment
5 pages
Python Data Persistence
From Everand
Python Data Persistence
Malhar Lathkar
No ratings yet
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
From Everand
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
Chitra Lele
No ratings yet
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
From Everand
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
Anthony Serpico
No ratings yet
Assignment
No ratings yet
Assignment
5 pages
Hexagonal Architecture Explained
From Everand
Hexagonal Architecture Explained
Alistair Cockburn
No ratings yet
CSD Rise Ultra Wrap Manual
No ratings yet
CSD Rise Ultra Wrap Manual
36 pages
PostgreSQL 9 Administration Cookbook LITE: Configuration, Monitoring and Maintenance
From Everand
PostgreSQL 9 Administration Cookbook LITE: Configuration, Monitoring and Maintenance
Simon Riggs
3/5 (1)
PostgreSQL 9 Administration Cookbook: LITE Edition
From Everand
PostgreSQL 9 Administration Cookbook: LITE Edition
Simon Riggs
3/5 (1)
TMS374 Family In-Circuit Programming: Users Manual Rev. 1.3 2005.05.11
100% (1)
TMS374 Family In-Circuit Programming: Users Manual Rev. 1.3 2005.05.11
10 pages
DD Assignment
No ratings yet
DD Assignment
40 pages
Data Analysis with Python: Introducing NumPy, Pandas, Matplotlib, and Essential Elements of Python Programming (English Edition)
From Everand
Data Analysis with Python: Introducing NumPy, Pandas, Matplotlib, and Essential Elements of Python Programming (English Edition)
Rituraj Dixit
No ratings yet
Learning PySpark
From Everand
Learning PySpark
Tomasz Drabas
No ratings yet
Getting Started with Bootstrap 3.2
From Everand
Getting Started with Bootstrap 3.2
Ryan Flores
No ratings yet
Essential Algorithms: A Practical Approach to Computer Algorithms
From Everand
Essential Algorithms: A Practical Approach to Computer Algorithms
Rod Stephens
4.5/5 (2)
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
From Everand
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
Bob Mather
5/5 (1)
ChatGPT Super Edition : Build Your Own Chatbot Without Cut-offs
From Everand
ChatGPT Super Edition : Build Your Own Chatbot Without Cut-offs
George Chiu
5/5 (2)
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
Relayd and Httpd Mastery: IT Mastery, #11
From Everand
Relayd and Httpd Mastery: IT Mastery, #11
Michael W. Lucas
No ratings yet
Python for Mechanical and Aerospace Engineering
From Everand
Python for Mechanical and Aerospace Engineering
Alexander Kenan
No ratings yet
Pandas in 7 Days: Utilize Python to Manipulate Data, Conduct Scientific Computing, Time Series Analysis, and Exploratory Data Analysis
From Everand
Pandas in 7 Days: Utilize Python to Manipulate Data, Conduct Scientific Computing, Time Series Analysis, and Exploratory Data Analysis
Fabio Nelli
No ratings yet
ASTM A249 Stainless Steel Tubes
No ratings yet
ASTM A249 Stainless Steel Tubes
10 pages
An Introduction to Website Performance: How to Outrun the Zombie Hordes: Undead Institute, #15
From Everand
An Introduction to Website Performance: How to Outrun the Zombie Hordes: Undead Institute, #15
John Rhea
No ratings yet
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Ansible for IT Experts
From Everand
Ansible for IT Experts
Denis Zuev
No ratings yet
Experiance Letter Sample
No ratings yet
Experiance Letter Sample
3 pages
Express Guide: Learn Any Web Builder or Content Management System
From Everand
Express Guide: Learn Any Web Builder or Content Management System
Martin Berlove
No ratings yet
Toshiba RAS-M10SKV-E
No ratings yet
Toshiba RAS-M10SKV-E
52 pages
Sew Cost Map
No ratings yet
Sew Cost Map
20 pages
Modern Tkinter for Busy Python Developers: Quickly Learn to Create Great Looking User Interfaces for Windows, Mac and Linux Using Python's Standard GUI Toolkit
From Everand
Modern Tkinter for Busy Python Developers: Quickly Learn to Create Great Looking User Interfaces for Windows, Mac and Linux Using Python's Standard GUI Toolkit
Mark Roseman
No ratings yet
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
From Everand
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
Eric Chou
No ratings yet
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
From Everand
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
James Tudor
5/5 (1)
Ptu PHD Thesis Format
100% (3)
Ptu PHD Thesis Format
8 pages
Specifications Guide Electric Range EN
No ratings yet
Specifications Guide Electric Range EN
2 pages
Microprocessor in Agriculture
No ratings yet
Microprocessor in Agriculture
2 pages
I2ml2e Chap4 v1 0
No ratings yet
I2ml2e Chap4 v1 0
27 pages
Raw Data Quantitative Analysis Meaningful Information
No ratings yet
Raw Data Quantitative Analysis Meaningful Information
3 pages
Programming Concepts in C++
From Everand
Programming Concepts in C++
Robert Burns
No ratings yet
Permanent Formwork For Concrete
No ratings yet
Permanent Formwork For Concrete
6 pages
Engine Immobilizer System
No ratings yet
Engine Immobilizer System
6 pages
How to use ChatGPT
From Everand
How to use ChatGPT
Bernhard Gaum
No ratings yet
9852 2340 01b Manual Cement Unit Boltec M & L RCS 4.5
No ratings yet
9852 2340 01b Manual Cement Unit Boltec M & L RCS 4.5
56 pages
Kolom Distilasi Tinjauan Umum
No ratings yet
Kolom Distilasi Tinjauan Umum
22 pages
Realtime Festival Overview
No ratings yet
Realtime Festival Overview
28 pages
Twelve Tips For Enhancing Anatomy Teaching and Learning Using Radiology
No ratings yet
Twelve Tips For Enhancing Anatomy Teaching and Learning Using Radiology
5 pages
K039-Pic-Checklist For JCB
No ratings yet
K039-Pic-Checklist For JCB
1 page
Prashanth 091123
No ratings yet
Prashanth 091123
8 pages
LDO Forwarding Pump 90 KW Scheme
No ratings yet
LDO Forwarding Pump 90 KW Scheme
15 pages
KOM-MICS, A "Tsunagaruka" System For Production Sites: Technical Paper
No ratings yet
KOM-MICS, A "Tsunagaruka" System For Production Sites: Technical Paper
6 pages
Department of Civil Engineering (Bbit B.Tech Wing) A.Y. 2019-2020 Even Semester Faculty Database For Online Internal Exam in May 2020
No ratings yet
Department of Civil Engineering (Bbit B.Tech Wing) A.Y. 2019-2020 Even Semester Faculty Database For Online Internal Exam in May 2020
1 page
Number System and Polynomials
No ratings yet
Number System and Polynomials
2 pages
FIRE FIGHTING TANK - MEP-Model
No ratings yet
FIRE FIGHTING TANK - MEP-Model
1 page
Tugas SKD
No ratings yet
Tugas SKD
5 pages

Discord Taz

Uploaded by

Discord Taz

Uploaded by

To make it a chatbot so that I can have multiple users asking, would I have to do

8) How small is small for stuff?

You might also like