Build on Redis with one-click using Augment Code’s Easy MCP
If you’re a developer or vibe coder you’re probably already familiar with AI-powered coding assistants like Augment Code. These tools have a number of features, but they are all working ...
Read now
Blog
JUL.28.2025
Use Redis natively on Windows with Memurai
Redis is now available to use natively on Windows. We’ve partnered with Memurai to develop Memurai Enterprise Edition, a fully compatible and officially supported Windows port of Redis t...
Read now
Blog
JUL.28.2025
Get faster LLM inference and cheaper responses with LMCache and Redis
As GenAI applications evolve—think agents, copilots, and chatbots—developers need more than just a connection to an LLM. These applications demand fast, cost-efficient inference. By defa...
Read now
Blog
JUL.23.2025
LFU vs. LRU: How to choose the right cache eviction policy
Least Frequently Used (LFU) and Least Recently Used (LRU) are two of the most common cache eviction policies for determining which data to evict when a cache fills up. Without a policy i...
Read now
Blog
JUL.23.2025
10 techniques to improve RAG accuracy
Retrieval-augmented generation (RAG) has rapidly become the go-to pattern for combining large language models (LLMs) with up-to-date, domain-specific knowledge. By grounding generative o...
Read now
Blog
JUL.22.2025
Redis 8 brings vector sets and is now in preview on Redis Cloud Essentials
Redis 8 is now available for preview on Redis Cloud Essentials. Starting today, you can create a Redis 8 database and experience the power of our most performant Redis version yet. Whet...
Read now
Blog
JUL.21.2025
Retrieval optimizer: Custom data
In the previous blog posts on grid search and bayesian optimization with the retrieval optimizer we made things easy by pulling pre-defined and formatted data. However, let’s say you hav...
Read now
Blog
JUL.21.2025
Retrieval optimizer: Bayesian optimization
In the last article, we introduced the retrieval optimizer, why it matters for eval driven development (EDD) and how to get started with a basic grid search. Often selecting the right se...
Read now
Blog
JUL.21.2025
Retrieval optimizer: Grid search
One of the most valuable pieces of feedback I’ve ever received as an engineer came as an intern. I was debugging some data issues and, truthfully, didn’t really know what was going on. W...
Read now
Blog
JUL.18.2025
Outsmarting fraud in real-time: How Redis powers intelligent fraud detection
Fraudsters don’t wait—and neither can your systems. Financial crime is faster, smarter, and more coordinated than ever. Every delayed response is a potential breach, a lost customer, or...
Read now
Blog
JUL.14.2025
Agentic RAG: How enterprises are surmounting the limits of traditional RAG
ChatGPT launched in 2022, triggering the current AI wave, and in 2023, enterprises invested $2.3 billion in AI – an amount easily cleared by the $13.8 billion spent in 2024. But while c...
Read now
Blog
JUL.10.2025
Speed is accuracy: Why Redis Query Engine leads in vector search
In today's AI-driven landscape, the difference between retrieving relevant information and irrelevant noise can make or break your application's value proposition. Whether you're buildin...
Read now
Blog
JUL.8.2025
Build faster AI memory with Cognee & Redis
Large language models are powerful, but they forget quickly. For AI agents and assistants to be truly useful, they need memory: the ability to retain instructions, recall facts, and carr...
Read now
Blog
JUL.7.2025
Why your cache hit ratio strategy needs an update
A cache hit ratio is a simple concept. It’s the proportion of data requests served from a cache, measured against total requests received by the cache. The former includes the “hits” (wh...
Read now
Blog
JUL.2.2025
What’s new in two: June 2025 Edition
Welcome to “What’s new in two,” your quick hit of Redis releases you might have missed in the past month. We’re covering the latest developments from June and expanding on what I covered...
Read now
Blog
JUN.23.2025
Why vector embeddings are here to stay
Every few weeks, a new GenAI model arrives—smarter, faster, cheaper, and poised to outshine the rest. The claims stay the same, even as the names change. In just the past few weeks alone...
Read now
Blog
JUN.20.2025
LLM chunking
The key to better retrieval, lower latency, and higher accuracy LLM chunking is the process of breaking large amounts of data into smaller, self-contained pieces of data (chunks) befor...
Read now
Blog
JUN.20.2025
From tap to transaction: power real-time mobile banking with Redis
From branches to smartphones Today, a bank’s most important branch fits in the palm of your hand. Mobile apps have revolutionized banking, turning time-consuming tasks—like depositing c...
Read now
Blog
JUN.13.2025
Why your caching strategies might be holding you back (and what to consider next)
Caching is the process of storing copies of data in temporary storage, allowing new requests for that data to be served more quickly. Without that temporary storage – the cache – request...
Read now
Blog
JUN.12.2025
Scale your LLM gateway with LiteLLM & Redis
As developers build increasingly advanced GenAI apps like chatbots, agents, and copilots, the infrastructure behind them needs to keep up. Connecting to a large language model is just th...
Read now
Blog
JUN.10.2025
How hierarchical navigable small world (HNSW) algorithms can improve search
Do you know Kevin Bacon? The six degrees of separation rule says that if you map the right social connections, all people are six or fewer handshakes away from each other—including Kevin...