0% found this document useful (0 votes)

98 views

PostgreSQL As A Vector Database: Create, Store, and Query OpenAI Embeddings With Pgvector

The document discusses using PostgreSQL and the pgvector extension to store and query OpenAI embeddings, enabling efficient storage and search of vector data which is essential for AI applications using large language models. It provides a tutorial on how to create embeddings from content using the OpenAI API, store the embedding vectors in a PostgreSQL database using pgvector, and then use the retrieved embeddings to augment language model generation. Code and sample data for following along with the tutorial is provided in a Jupyter notebook.

Uploaded by

great_gmi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views

PostgreSQL As A Vector Database: Create, Store, and Query OpenAI Embeddings With Pgvector

Uploaded by

great_gmi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Why Timescale Developers Pricing Contact us Login Try for free

21 Jun 2023

Try Timescale Vector

PostgreSQL as a Vector Database: 18 min read

Create, Store, and Query OpenAI

PostgreSQL

PostgreSQL++ for AI
Applications
Embeddings With pgvector
Contributors
Avthar Sewrathan
Samuel Gichohi
Get started for free
Matvey Arye

Blog Categories

All posts

Announcements

Cloud

Developer Q&A

Engineering

General

Grafana

Observability

PostgreSQL

Product Updates
Looking for a “Hello world” tutorial for pgvector and OpenAI embeddings that gives you the basics of using
Search
PostgreSQL as a vector database? You’ve found it!

Vector databases enable efficient storage and search of vector data and are essential to developing and
maintaining AI applications using Large Language Models (LLMs).

With a little help from the pgvector extension, you can leverage PostgreSQL, the flexible and robust SQL
database, as a vector database to store and query OpenAI embeddings. Used to measure the similarity of text
strings, OpenAI embeddings are a type of data representation (in the shape of vectors, i.e., lists of numbers)
for OpenAI’s models. Much more on OpenAI embeddings, pgvector and vector databases later in this post.

We’ll use the example of creating a chatbot to answer questions about Timescale use cases, referencing
content from the Timescale Developer Q&A blog posts, to illustrate the key concepts for creating, storing, and
querying OpenAI embeddings with PostgreSQL and pgvector.

We divided the post into three parts:

Part 1: How to create embeddings from content using the OpenAI API.

Part 2: How to use PostgreSQL as a vector database and store OpenAI embedding vectors using
pgvector.

Part 3: How to use embeddings retrieved from a vector database to augment LLM generation.

One could think of this tutorial as a first step to building a chatbot that can reference a company knowledge
base or developer docs.

Jupyter Notebook and Code: You can find all the code used in this tutorial in a Jupyter
Notebook, as well as sample content and embeddings on the Timescale GitHub:
timescale/vector-cookbook. We recommend cloning the repo and following along by
executing the code cells as you read through the tutorial.

Why Create and Store OpenAI Embeddings for

Your Documents?
We will explain how to use Retrieval Augmented Generation (RAG) to create a chatbot that combines your
data with the power of ChatGPT using OpenAI and pgvector. RAG addresses the problem that a foundational
model (e.g., GPT-3 or GPT-4) may be missing some information needed to give a good answer because that
information was not in the dataset used to train the model (for example, the information is stored in private
documents or only became available recently).

RAG’s solution is dead simple: provide additional context to the foundational model in the prompt. For
example, if someone asks a baking chatbot, “What is a cronut?” and the foundational model has never heard
of cronuts, you can transform the prompt into context: “A cronut resembles a doughnut and is made from
croissant-like dough filled with flavored cream and fried in grapeseed oil. What is a cronut?”

The foundational model can then use its knowledge of donuts and croissants to wax eloquently about cronuts.
This technique is insanely powerful—it allows you to “teach” foundational models about things only you know
about and use that to create a ChatGPT++ experience for your users!

But what context do you provide to the model? If you have a library of information, how do you know what’s
relevant to a given question? Cue in embeddings. As mentioned above, OpenAI embeddings are a
mathematical representation of the semantic meaning of a piece of text that allows for similarity search.

This means that if you get a user question and calculate its embedding, you can use similarity search against
data embeddings in your library to find the most relevant information. But that requires having an embedding
representation of your library.

This post is a guide to creating, storing, and querying OpenAI vector embeddings using pgvector, the
extension that turns PostgreSQL into a vector database.

What is pgvector?

Pgvector is an open-source extension for PostgreSQL that enables storing and searching over machine
learning-generated embeddings. It provides different capabilities that let users identify both exact and
approximate nearest neighbors. It is designed to work seamlessly with other PostgreSQL features, including
indexing and querying.

Let's get started!

Before You Begin: Pre-Requisites and

Configuration
Install Python.

Install and configure a Python virtual environment. We recommend Pyenv.

Install the requirements for this notebook using the following command:

pip install -r requirements.txt

Import all the packages we will be using:

import openai
import os
import pandas as pd
import numpy as np
import json
import tiktoken
import psycopg2
import ast
import pgvector
import math
from psycopg2.extras import execute_values
from pgvector.psycopg2 import register_vector

You’ll need to sign up for an OpenAI Developer Account and create an OpenAI API Key – we recommend
getting a paid account to avoid rate limiting and settting a spending cap so that you avoid any surprises with
bills.

Once you have an OpenAI API key, it’s a best practice to store it as an environment variable and then have
your Python program read it.

#First, run export OPENAI_API_KEY=sk-YOUR_OPENAI_API_KEY...

# Get openAI api key by reading local .env file

from dotenv import load_dotenv, find_dotenv
_ = load_dotenv(find_dotenv())
openai.api_key = os.environ['OPENAI_API_KEY']

Part 1: Create Embeddings for Your Data Using

the OpenAI API
Embeddings measure how related text strings are. First, we'll create embeddings using the OpenAI API on
some text we want the LLM to answer questions on.

In this example, we'll use content from the Timescale blog, specifically from the Developer Q&A section, which
features posts by Timescale users talking about their real-world use cases.

You can replace this blog data with any text you want to embed, such as your own company blog, developer
documentation, internal knowledge base, or any other information you’d like to have a “ChatGPT-like”
experience over.

# Load your CSV file into a pandas DataFrame

df = pd.read_csv('blog_posts_data.csv')
df.head()

The output looks like this:

Title Content URL

How to Build a Weather This is an installment https://www.timescale.com/blog/how-to-build-a-...

0 Station With Elixir, Ne... of our “Community
Membe...

CloudQuery on Using This is an installment https://www.timescale.com/blog/cloudquery-on-

1 PostgreSQL for Cloud of our “Community u...
Asset... Membe...

How a Data Scientist Is This is an installment https://www.timescale.com/blog/how-a-data-scie...

2 Building a Time-Series... of our “Community
Membe...

How Conserv Safeguards This is an installment https://www.timescale.com/blog/how-conserv-

3 History: Building an En... of our “Community saf...
Membe...

How Messari Uses Data This is an installment https://www.timescale.com/blog/how-messari-

4 to Open the of our “Community use...
Cryptoeconom... Membe...

1.1 Calculate the cost of embedding data

It's usually a good idea to calculate how much creating embeddings for your selected content will cost. We
provide a number of helper functions to calculate a cost estimate before creating the embeddings to help us
avoid surprises.

For OpenAI, you are charged on a per-token basis for embeddings created. The total cost will be less than
$0.01 for the blog posts we want to embed, thanks to OpenAI’s recent announcement of a 75 % cost reduction
in their most popular embedding model, text-embedding-ada-002.

What is a token? Tokens are common sequences of characters found in text. Roughly speaking, a token is
three-quarters (¾) of a word. Large language models, like GPT-3 and GPT-4 made by OpenAI, are trained to
understand the statistical relationships between tokens and predict the next token in a sequence. Learn more
about tokens with OpenAI’s Tokenizer tool.

# Helper functions to help us create the embeddings

# Helper func: calculate number of tokens

def num_tokens_from_string(string: str, encoding_name = "cl100k_base") -> int:
if not string:
return 0
# Returns the number of tokens in a text string
encoding = tiktoken.get_encoding(encoding_name)
num_tokens = len(encoding.encode(string))
return num_tokens

# Helper function: calculate length of essay

def get_essay_length(essay):
word_list = essay.split()
num_words = len(word_list)
return num_words

# Helper function: calculate cost of embedding num_tokens

# Assumes we're using the text-embedding-ada-002 model
# See https://openai.com/pricing
def get_embedding_cost(num_tokens):
return num_tokens/1000*0.0001

# Helper function: calculate total cost of embedding all content in the dataframe
def get_total_embeddings_cost():
total_tokens = 0
for i in range(len(df.index)):
text = df['content'][i]
token_len = num_tokens_from_string(text)
total_tokens = total_tokens + token_len
total_cost = get_embedding_cost(total_tokens)
return total_cost

# quick check on total token amount for price estimation

total_cost = get_total_embeddings_cost()
print("estimated price to embed this content = $" + str(total_cost))

1.2 Create smaller chunks of content

The OpenAI API has a limit to the maximum number of tokens it can create an embedding for in a single
request: 8,191 to be specific.

To get around this limit, we'll break up our text into smaller chunks. Generally, it's a best practice to “chunk”
the documents you want to create embeddings into groups of a fixed token size.

The precise number of tokens to include in a chunk depends on your use case and your model’s context
window—the number of input tokens it can handle in a prompt.

For our purposes, we'll aim for chunks of around 512 tokens each. Chunking text up is a complex topic worthy
of its own blog post. We’ll illustrate a simple method we found to work well below. If you want to read about
other approaches, we recommend this blog post and this section of the LangChain docs.

Note: If you prefer to skip this step, you can use the provided file: blog_data_and_embeddings.csv, which
contains the data and embeddings that you'll generate in this step.

The code below creates a new list of our blog content while retaining the metadata associated with the text,
such as the blog title and URL that the text is associated with.

# Create new list with small content chunks to not hit max token limits
# Note: the maximum number of tokens for a single request is 8191
# https://openai.com/docs/api-reference/requests

# list for chunked content and embeddings

new_list = []
# Split up the text into token sizes of around 512 tokens
for i in range(len(df.index)):
text = df['content'][i]
token_len = num_tokens_from_string(text)
if token_len <= 512:
new_list.append([df['title'][i], df['content'][i], df['url'][i], token_len])
else:
# add content to the new list in chunks
start = 0
ideal_token_size = 512
# 1 token ~ 3/4 of a word
ideal_size = int(ideal_token_size // (4/3))
end = ideal_size
#split text by spaces into words
words = text.split()

#remove empty spaces

words = [x for x in words if x != ' ']

total_words = len(words)

#calculate iterations
chunks = total_words // ideal_size
if total_words % ideal_size != 0:
chunks += 1

new_content = []
for j in range(chunks):
if end > total_words:
end = total_words
new_content = words[start:end]
new_content_string = ' '.join(new_content)
new_content_token_len = num_tokens_from_string(new_content_string)
if new_content_token_len > 0:
new_list.append([df['title'][i], new_content_string, df['url'][i], new_content_token_len])
start += ideal_size
end += ideal_size

Now that our text is chunked better, we can create embeddings for each chunk of text using the OpenAI API.

We’ll use this helper function to create embeddings for a piece of text:

# Helper function: get embeddings for a text

def get_embeddings(text):
response = openai.Embedding.create(
model="text-embedding-ada-002",
input = text.replace("\n"," ")
)
embedding = response['data'][0]['embedding']
return embedding

And then create embeddings for each chunk of content:

# Create embeddings for each piece of content

for i in range(len(new_list)):
text = new_list[i][1]
embedding = get_embeddings(text)
new_list[i].append(embedding)

# Create a new dataframe from the list

df_new = pd.DataFrame(new_list, columns=['title', 'content', 'url', 'tokens', 'embeddings'])
df_new.head()

The new data frame should look like this:

Title Content URL Tokens Embeddings

0 How to This is an https://www.timescale.com/blog/how-to- 501 [0.021440856158733368,

Build a installment of our build-a-... 0.02200360782444477, -0...
Weather “Community
Station With Membe...
Elixir, Ne...

1 How to capture weather https://www.timescale.com/blog/how-to- 512 [0.016165969893336296,

Build a and environmental build-a-... 0.011341351084411144, 0...
Weather data. In all...
Station With
Elixir, Ne...

2 How to command in their https://www.timescale.com/blog/how-to- 374 [0.022517921403050423,

Build a database build-a-... -0.0019158280920237303,...
Weather migration:SELECT
Station With cre...
Elixir, Ne...

3 CloudQuery This is an https://www.timescale.com/blog/cloudquery- 519 [0.009028822183609009,

on Using installment of our on-u... -0.005185891408473253, ...
PostgreSQL “Community
for Cloud Membe...
Asset...

4 CloudQuery Architecture with https://www.timescale.com/blog/cloudquery- 511 [0.02050386555492878,

on Using CloudQuery SDK- on-u... 0.010169642977416515,
PostgreSQL Writing plug... 0....
for Cloud
Asset...

As an optional but recommended step, you can save the original blog content along with associated
embeddings in a CSV file for reference later on so that you don't have to recreate embeddings if you want to
reference it in another project.

# Save the dataframe with embeddings as a CSV file

df_new.to_csv('blog_data_and_embeddings.csv', index=False)

Part 2: Store OpenAI Embeddings in a Vector

Database Using pgvector
Now that we have created embedding vectors for our blog content, the next step is to store the embedding
vectors in a vector database to help us perform a fast search over many vectors.

What is a vector database?

A vector database is a database that can handle vector data. Vector databases are useful for:

Semantic search: Vector databases facilitate semantic search, which considers the context or meaning
of search terms rather than just exact matches. They are useful for recommendation systems, content
discovery, and question-answering systems.

Efficient similarity search: Vector databases are designed for efficient high-dimensional nearest
neighbor search, a task where traditional relational databases struggle.

Machine learning: Vector databases store and search embeddings created by machine-learning models.
This feature aids in finding items semantically similar to a given item.

Multimedia data handling: Vector databases also excel in working with multimedia data (images, audio,
video) by converting them into high-dimensional vectors for efficient similarity search.

NLP and data combination: In Natural Language Processing (NLP), vector databases store high-
dimensional vectors representing words, sentences, or documents. They also allow a combination of
traditional SQL queries with similarity searches, accommodating both structured and unstructured data.

We’ll use PostgreSQL with the pgvector extension installed as our vector database. Pgvector extends
PostgreSQL to handle vector data types and vector similarity search, like nearest neighbor search, which we’ll
use to find the k most related embeddings in our database for a given user prompt.

Why use pgvector as a vector database?

Here are five reasons why PostgreSQL is a good choice for storing and handling vector data:

Integrated solution: By using PostgreSQL as a vector database, you keep your data in one place. This
can simplify your architecture by reducing the need for multiple databases or additional services.

Enterprise-level robustness and operations: With a 30-year pedigree, PostgreSQL provides world-class
data integrity, operations, and robustness. This includes backups, streaming replication, role-based and
row-level security, and ACID compliance.

Full-featured SQL: PostgreSQL supports a rich set of SQL features, including joins, subqueries, window
functions, and more. This allows for powerful and complex queries that can include both traditional
relational data and vector data. It also integrates with a plethora of existing data science and data
analysis tools.

Scalability and performance: PostgreSQL is known for its robustness and ability to handle large
datasets. Using it as a vector database allows you to leverage these characteristics for vector data as
well.

Open source: PostgreSQL is open source, which means it's free to download and use, and you can
modify it to suit your needs. It also means that it benefits from the collective input of developers all over
the world, which often results in high-quality, secure, and up-to-date software. PostgreSQL has a large
and active community, so help is readily available. There are many resources, such as documentation,
tutorials, forums, and more, to help you troubleshoot and optimize your PostgreSQL database.

2.1 Create a PostgreSQL database and install pgvector

First, we’ll create a PostgreSQL database. You can create a cloud PostgreSQL database in minutes for free on
Timescale or use a local PostgreSQL database for this step.

Once you’ve created your PostgreSQL database, export your connection string as an environment variable,
and just like the OpenAI API key, we’ll read it into our Python program from the environment file:

# Timescale database connection string

# Found under "Service URL" of the credential cheat-sheet or "Connection Info" in the Timescale console
# In terminal, run: export TIMESCALE_CONNECTION_STRING=postgres://<fill in here>

connection_string = os.environ['TIMESCALE_CONNECTION_STRING']

We then connect to our database using the popular psycopg2 python library and install the pgvector
extension as follows:

# Connect to PostgreSQL database in Timescale using connection string

conn = psycopg2.connect(connection_string)
cur = conn.cursor()

#install pgvector
cur.execute("CREATE EXTENSION IF NOT EXISTS vector");
conn.commit()

2.2 Connect to and configure your vector database

Once we’ve installed pgvector, we use the register_vector() command to register the vector type with our
connection:

# Register the vector type with psycopg2

register_vector(conn)

Once we’ve connected to the database, let’s create a table that we’ll use to store embeddings along with
metadata. Our table will look as follows:

id title url content tokens embedding

Id represents the unique ID of each vector embedding in the table.

title is the blog title from which the content associated with the embedding is taken.

url is the blog URL from which the content associated with the embedding is taken.

content is the actual blog content associated with the embedding.

tokens is the number of tokens the embedding represents.

embedding is the vector representation of the content.

One advantage of using PostgreSQL as a vector database is that you can easily store metadata and
embedding vectors in the same database, which is helpful for supplying the user-relevant information related
to the response they receive, like links to read more or specific parts of a blog post that are relevant to them.

# Create table to store embeddings and metadata

table_create_command = """
CREATE TABLE embeddings (
id bigserial primary key,
title text,
url text,
content text,
tokens integer,
embedding vector(1536)
);
"""

cur.execute(table_create_command)
cur.close()
conn.commit()
2.3 Ingest and store vector data into PostgreSQL using
pgvector

Now that we’ve created the database and created the table to house the embeddings and metadata, the final
step is to insert the embedding vectors into the database.

For this step, it’s a best practice to batch insert the embeddings rather than insert them one by one.

#Batch insert embeddings and metadata from dataframe into PostgreSQL database
register_vector(conn)
cur = conn.cursor()
# Prepare the list of tuples to insert
data_list = [(row['title'], row['url'], row['content'], int(row['tokens']), np.array(row['embeddings'])) for
# Use execute_values to perform batch insertion
execute_values(cur, "INSERT INTO embeddings (title, url, content, tokens, embedding) VALUES %s", data_list)
# Commit after we insert all embeddings
conn.commit()

Let’s sanity check by running some simple queries against our newly inserted data:

cur.execute("SELECT COUNT(*) as cnt FROM embeddings;")

num_records = cur.fetchone()[0]
print("Number of vector records in table: ", num_records,"\n")
# Correct output should be 129

# print the first record in the table, for sanity-checking

cur.execute("SELECT * FROM embeddings LIMIT 1;")
records = cur.fetchall()
print("First record in table: ", records)

2.4 Index your data for faster retrieval

In this example, we only have 129 embedding vectors, so searching through all of them is blazingly fast. But
for larger datasets, you need to create indexes to speed up searching for similar embeddings, so we include
the code to build the index for illustrative purposes.

Pgvector supports the ivfflat index type to provide for speed up of approximate nearest neighbor (ANN)
searches (similarity search indexes for high-dimensionality data is very often approximate).

You always want to build this index after you have inserted the data, as the index needs to discover clusters
in your data to be effective, and it does this only when first building the index.

The index has a tunable parameter of the number of lists to use, and the code below shows the best practice
for tuning this parameter. You also need to specify the distance measure used for indexing and ensure it
matches the measure you use in your queries. In our case, we use the Cosine distance for querying below,
and so we create our index with vector_cosine_ops .

# Create an index on the data for faster retrieval

#calculate the index parameters according to best practices

num_lists = num_records / 1000
if num_lists < 10:
num_lists = 10
if num_records > 1000000:
num_lists = math.sqrt(num_records)

#use the cosine distance measure, which is what we'll later use for querying
cur.execute(f'CREATE INDEX ON embeddings USING ivfflat (embedding vector_cosine_ops) WITH (lists = {num_lists
conn.commit()

Part 3: Nearest Neighbor Search Using

pgvector
Given a user question, we’ll perform the following steps to use information stored in the vector database to
answer their question using Retrieval Augmented Generation:

Create an embedding vector for the user question.

Use pgvector to perform a vector similarity search and retrieve the k nearest neighbors to the question
embedding from our embedding vectors representing the blog content. In our example, we’ll use k=3,
finding the three most similar embedding vectors and associated content.

Supply the content retrieved from the database as additional context to the model and ask it to perform
a completion task to answer the user question.

3.1 Define a question you want to answer

First, we’ll define a sample question that a user might want to answer about the blog posts stored in the
database.

# Question about Timescale we want the model to answer

input = "How is Timescale used in IoT?"

Since Timescale is popular for IoT sensor data, a user might want to learn specifics about how they can
leverage it for that use case.

3.2 Find the most relevant content in the database

Here’s the function we use to find the three nearest neighbors to the user question. Note it uses pgvector’s
!!<=> operator, which finds the Cosine distance (also known as Cosine similarity) between two embedding
vectors.

# Helper function: Get top 3 most similar documents from the database
def get_top3_similar_docs(query_embedding, conn):
embedding_array = np.array(query_embedding)
# Register pgvector extension
register_vector(conn)
cur = conn.cursor()
# Get the top 3 most similar documents using the KNN <=> operator
cur.execute("SELECT content FROM embeddings ORDER BY embedding <=> %s LIMIT 3", (embedding_array,))
top3_docs = cur.fetchall()
return top3_docs

3.3 Define helper functions to query OpenAI

We supply helper functions to create an embedding for the user question and to get a completion response
from an OpenAI model. We use GPT-3.5, but you can use GPT-4 or any other model from OpenAI.

We also specify a number of parameters, such as limits of the maximum number of tokens in the model
response and model temperature, which controls the randomness of the model, which you can modify to your
liking:

# Helper function: get text completion from OpenAI API

# Note we're using the latest gpt-3.5-turbo-0613 model
def get_completion_from_messages(messages, model="gpt-3.5-turbo-0613", temperature=0, max_tokens=1000):
response = openai.ChatCompletion.create(
model=model,
messages=messages,
temperature=temperature,
max_tokens=max_tokens,
)
return response.choices[0].message["content"]

# Helper function: get embeddings for a text

def get_embeddings(text):
response = openai.Embedding.create(
model="text-embedding-ada-002",
input = text.replace("\n"," ")
)
embedding = response['data'][0]['embedding']
return embedding

3.3 Putting it all together

We’ll define a function to process the user input by retrieving the most similar documents from our database
and passing the user input, along with the relevant retrieved context to the OpenAI model to provide a
completion response to.

Note that we modify the system prompt as well in order to influence the tone of the model’s response.

We pass to the model the content associated with the three most similar embeddings to the user input using
the assistant role. You can also append the additional context to the user message.

# Function to process input with retrieval of most similar documents from the database
def process_input_with_retrieval(user_input):
delimiter = "```"

#Step 1: Get documents related to the user input from database

related_docs = get_top3_similar_docs(get_embeddings(user_input), conn)

# Step 2: Get completion from OpenAI API

# Set system message to help set appropriate tone and context for model
system_message = f"""
You are a friendly chatbot. \
You can answer questions about timescaledb, its features and its use cases. \
You respond in a concise, technically credible tone. \
"""

# Prepare messages to pass to model

# We use a delimiter to help the model understand the where the user_input starts and ends
messages = [
{"role": "system", "content": system_message},
{"role": "user", "content": f"{delimiter}{user_input}{delimiter}"},
{"role": "assistant", "content": f"Relevant Timescale case studies information: \n {related_docs[0][0
]

final_response = get_completion_from_messages(messages)
return final_response

Let’s see an example of the model’s output to our original input question:

input = "How is Timescale used in IoT?"

response = process_input_with_retrieval(input)
print(input)
print(response)

Here’s the model’s response:

User input: How is Timescale used in IoT?

Model response:

TimescaleDB is commonly used in IoT (Internet of Things) applications for storing and analyzing time-series
data generated by IoT devices. IoT devices generate a large volume of data over time, such as sensor
readings, device status updates, and alarm information. TimescaleDB provides a scalable and efficient
solution for storing and querying this time-series data.

With TimescaleDB, IoT applications can benefit from features such as automatic data partitioning,
compression, and retention policies. These features allow for efficient storage and retrieval of time-series
data, even as the data volume grows. TimescaleDB also supports SQL queries, making it easy to perform
complex analytics and aggregations on the time-series data.

In IoT applications, TimescaleDB can be used to monitor and analyze various aspects, such as environmental
conditions (temperature, humidity), energy consumption, equipment performance, and predictive
maintenance. It enables real-time monitoring, anomaly detection, and trend analysis, helping businesses
make data-driven decisions and optimize their operations.

Overall, TimescaleDB provides a reliable and scalable database solution for handling the high-volume time-
series data generated by IoT devices, enabling businesses to unlock valuable insights and improve their IoT
applications.

We can also ask the model questions about specific documents in the database, in this case about specific
Timescale users who have spoken about their experience in a blog post:

input_2 = "Tell me about Edeva and Hopara. How do they use Timescale?"
response_2 = process_input_with_retrieval(input_2)
print(input_2)
print(response_2)

Here’s the model output for the input question above; notice how it uses specific details from the blog posts.

User input: Tell me about Edeva and Hopara. How do they use Timescale?

Model response:

Edeva and Hopara are two companies that utilize TimescaleDB for their data storage and visualization needs.

Edeva uses TimescaleDB as the main database in their smart city system. Their clients can control their IoT
devices and view captured data, gaining insights from trends and historical data. They leverage
TimescaleDB's continuous aggregations feature to speed up queries and improve dashboard performance.

Hopara, on the other hand, provides a sophisticated visualization system that allows users to derive insights
from various types of data. They use TimescaleDB to store real-time vibration data from sensor-tagged
machines. Hopara's visualization system, powered by TimescaleDB, enables users to drill down into the data
and identify vibration issues.

Both Edeva and Hopara benefit from TimescaleDB's time-series functionality and its ability to handle large
amounts of data efficiently.

Conclusion
Retrieval Augmented Generation (RAG) is a powerful method of building applications with LLMs that enable
you to teach foundation models about things it was not originally trained on—like private documents or
recently published information.

We covered the basics of creating a chatbot to answer questions about a blog. We used the content from the
Timescale Developer Q&A blog posts as an example to show how to create, store, and perform similarity
search on OpenAI embeddings. We used PostgreSQL and pgvector as our vector database to store and query
the embeddings.

Jupyter Notebook and Code: You can find all the code used in this tutorial in a Jupyter
Notebook, as well as sample content and embeddings on the Timescale GitHub:
timescale/vector-cookbook.

And if you’re looking for a production PostgreSQL database for your vector workloads, try Timescale. It’s free
for 30 days, no credit card required.

Ingest and query in milliseconds, even at

Try Timescale for free
terabyte scale.

This post was a collaboration between

Avthar Sewrathan , Samuel Gichohi , Matvey Arye

How to Build LLM The PostgreSQL Job Using

Applications With Scheduler You Always pg_stat_statements to
pgvector Vector Store Wanted (But Be Careful Optimize Queries
in LangChain What You Ask For) 30 Mar 2022 • 11 min read

12 Jul 2023 • 11 min read 19 Jan 2023 • 7 min read

Products Learn Company

Why Timescale Documentation Contact us

Cloud Status Blog Careers Subscribe to the Timescale Newsletter

Support Forum About

Your email Subscribe

Security Tutorials Newsroom

Cloud Terms of Service Release notes Brand By submitting you acknowledge Timescale's Privacy Policy.

Community
Case studies
Timescale shop
Time series database
Code of conduct

Buy ebook Building Generative AI Services with FastAPI (Early Release) 1st Edition Ali Parandeh cheap price
100% (2)
Buy ebook Building Generative AI Services with FastAPI (Early Release) 1st Edition Ali Parandeh cheap price
65 pages
MongoDB Sales Presentation
No ratings yet
MongoDB Sales Presentation
35 pages
Machine Learning For Marketers PowerPoint Presentation Storyboard
No ratings yet
Machine Learning For Marketers PowerPoint Presentation Storyboard
25 pages
Saurav Dudulwar Resume
No ratings yet
Saurav Dudulwar Resume
1 page
Hands-On Lab With LLMs and Gen AI Within IDC
No ratings yet
Hands-On Lab With LLMs and Gen AI Within IDC
57 pages
Structured Approachto Solution Architecture
No ratings yet
Structured Approachto Solution Architecture
109 pages
Donald Ngandeu 1
No ratings yet
Donald Ngandeu 1
6 pages
Meta Llama 3
No ratings yet
Meta Llama 3
3 pages
MLOps
No ratings yet
MLOps
9 pages
Case Study Based On: Cloud Deployment and Service Delivery Models
No ratings yet
Case Study Based On: Cloud Deployment and Service Delivery Models
10 pages
Glade Tutorial
No ratings yet
Glade Tutorial
5 pages
Download Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir ebook All Chapters PDF
100% (7)
Download Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir ebook All Chapters PDF
81 pages
ZFNET Architecture
No ratings yet
ZFNET Architecture
14 pages
Code Review Automation With Gemini
No ratings yet
Code Review Automation With Gemini
37 pages
LLM Intro
No ratings yet
LLM Intro
51 pages
Microsoft Azure AI Certified AI-900 - Yatharth Chauhan
No ratings yet
Microsoft Azure AI Certified AI-900 - Yatharth Chauhan
28 pages
Azure Integration Services
No ratings yet
Azure Integration Services
1 page
A Retrieval-Augmented Generation Based Large Langu
No ratings yet
A Retrieval-Augmented Generation Based Large Langu
9 pages
Gen Ai Solutions
No ratings yet
Gen Ai Solutions
14 pages
WTX Overview 2014
No ratings yet
WTX Overview 2014
13 pages
Redis Vs Ncache
No ratings yet
Redis Vs Ncache
36 pages
Arize U - Intro To ML Observability
No ratings yet
Arize U - Intro To ML Observability
13 pages
Data Mining N Business Intelligence
No ratings yet
Data Mining N Business Intelligence
63 pages
Flight From Strategy To Executable Code-2018 KOSTA Keynote
No ratings yet
Flight From Strategy To Executable Code-2018 KOSTA Keynote
27 pages
MongoDB Berlin Schema Design
No ratings yet
MongoDB Berlin Schema Design
61 pages
How To Use An Existing DNN Recognizer For Decoding in Kaldi
No ratings yet
How To Use An Existing DNN Recognizer For Decoding in Kaldi
14 pages
(Ebook) Generative AI on AWS (Early Release) by Antje Barth & Chris Fregly & Shelbee Eigenbrode ISBN 9781098159214, 1098159217 download pdf
100% (7)
(Ebook) Generative AI on AWS (Early Release) by Antje Barth & Chris Fregly & Shelbee Eigenbrode ISBN 9781098159214, 1098159217 download pdf
81 pages
Artificial Intelligence in Product Management-2 PDF
No ratings yet
Artificial Intelligence in Product Management-2 PDF
4 pages
Hands on.reactive.programming.in.Spring.5 Images
No ratings yet
Hands on.reactive.programming.in.Spring.5 Images
65 pages
Intel GenAI Hackathon
No ratings yet
Intel GenAI Hackathon
10 pages
Glade Tutorial
No ratings yet
Glade Tutorial
36 pages
AI_102_Notes
No ratings yet
AI_102_Notes
41 pages
1. Application Of Large Language
No ratings yet
1. Application Of Large Language
75 pages
DataOps AWS Architecture Blueprint
No ratings yet
DataOps AWS Architecture Blueprint
11 pages
Instant ebooks textbook Advances in Systems Engineering: Select Proceedings of NSC 2019 (Lecture Notes in Mechanical Engineering) V. H. Saran (Editor) download all chapters
100% (10)
Instant ebooks textbook Advances in Systems Engineering: Select Proceedings of NSC 2019 (Lecture Notes in Mechanical Engineering) V. H. Saran (Editor) download all chapters
50 pages
Web Mapping - Comparing Vector Tile Servers From Postgres - PostGIS - by Frédéric Rodrigo - Medium
No ratings yet
Web Mapping - Comparing Vector Tile Servers From Postgres - PostGIS - by Frédéric Rodrigo - Medium
11 pages
AI-Optimized DevOps For Streamlined Cloud CI/CD
No ratings yet
AI-Optimized DevOps For Streamlined Cloud CI/CD
7 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Datawarehouse To Data Lakehouse
No ratings yet
Datawarehouse To Data Lakehouse
48 pages
The Art of Data Structures and Algorithms
No ratings yet
The Art of Data Structures and Algorithms
391 pages
For Speed and Agility
No ratings yet
For Speed and Agility
14 pages
DevOps - Fresher Training
No ratings yet
DevOps - Fresher Training
15 pages
Architecting For Fast Data Applications Mesosphere
No ratings yet
Architecting For Fast Data Applications Mesosphere
45 pages
Understanding Unit and Integration Testing in Golang
No ratings yet
Understanding Unit and Integration Testing in Golang
59 pages
AWS DynamoDB Notes
No ratings yet
AWS DynamoDB Notes
2 pages
Community Session IndexingChaining
No ratings yet
Community Session IndexingChaining
19 pages
5 Designing Dropbox - Grokking The System Design Interview
No ratings yet
5 Designing Dropbox - Grokking The System Design Interview
10 pages
AZ 900+Bootcamp+Companion+Guide
No ratings yet
AZ 900+Bootcamp+Companion+Guide
39 pages
Using The Cost of Quality Approach For Software
No ratings yet
Using The Cost of Quality Approach For Software
6 pages
Lynxos
No ratings yet
Lynxos
4 pages
Sonar Qube
No ratings yet
Sonar Qube
46 pages
Applied Coding Track
No ratings yet
Applied Coding Track
10 pages
Set Your Data in Motion
No ratings yet
Set Your Data in Motion
8 pages
Download Complete Beginning MLOps with MLFlow : Deploy Models in AWS SageMaker, Google Cloud, and Microsoft Azure 1st Edition Sridhar Alla PDF for All Chapters
No ratings yet
Download Complete Beginning MLOps with MLFlow : Deploy Models in AWS SageMaker, Google Cloud, and Microsoft Azure 1st Edition Sridhar Alla PDF for All Chapters
52 pages
.Net Core Best Practices - Every .Net Developer Must Know
No ratings yet
.Net Core Best Practices - Every .Net Developer Must Know
55 pages
Spring Cloud Dataflow Reference
No ratings yet
Spring Cloud Dataflow Reference
130 pages
An Ontology Based Architecture For Implementing Semantic Integration of Supply Chain Management
No ratings yet
An Ontology Based Architecture For Implementing Semantic Integration of Supply Chain Management
19 pages
orreily trends
No ratings yet
orreily trends
43 pages
Mastering Ninject for Dependency Injection
From Everand
Mastering Ninject for Dependency Injection
Daniel Baharestani
No ratings yet