Question 1

Do I need a dedicated vector database or can I use pgvector?

Accepted Answer

For most agent applications, pgvector or a similar extension in your existing database is sufficient and simpler to manage. Dedicated vector databases become worthwhile when you have millions of vectors, need sub-millisecond search latency, or require advanced features like automatic reindexing. Start simple and migrate if you hit limits.

Question 2

How much storage do vector embeddings require?

Accepted Answer

A single 1,536-dimensional embedding (common for OpenAI models) takes about 6 KB. One million embeddings would require roughly 6 GB of vector storage, plus metadata and index overhead. For email agents processing thousands of messages per day, storage is rarely the bottleneck — search speed and index management matter more.

Question 3

Can vector databases replace traditional search?

Accepted Answer

Not entirely. Vector search excels at finding semantically similar content but can miss exact matches. Traditional keyword search is better for finding specific terms, IDs, or exact phrases. Most production systems use hybrid search — combining vector similarity with keyword matching — to get the best of both approaches.

Question 4

What is the difference between a vector database and an embedding?

Accepted Answer

An embedding is a numerical representation (vector) of a piece of text, generated by an AI model. A vector database is the storage and search system that holds millions of these embeddings and lets you efficiently find the most similar ones to a query. The database indexes and queries embeddings; it does not create them.

Question 5

How do email agents use vector databases?

Accepted Answer

Email agents store embeddings of processed emails in a vector database, creating a searchable semantic index of all past communications. When a new email arrives about a topic discussed months ago, the agent embeds the query, searches for similar past emails, and retrieves relevant context to inform its response.

Question 6

What is hybrid search in vector databases?

Accepted Answer

Hybrid search combines vector similarity search with traditional keyword search in a single query. This catches both semantically similar results (different words, same meaning) and exact keyword matches. Most production RAG systems use hybrid search to improve retrieval accuracy over either method alone.

Question 7

How fast is vector database search?

Accepted Answer

Most vector databases return results in single-digit milliseconds for collections of millions of vectors, using approximate nearest neighbor algorithms like HNSW. Search speed depends on the index type, vector dimensionality, and collection size. For email agents, latency is rarely an issue since email processing is not sub-millisecond sensitive.

Question 8

What is metadata filtering in vector databases?

Accepted Answer

Metadata filtering lets you narrow vector search results by structured attributes before or during the similarity comparison. For email agents, this means searching for "emails similar to this complaint, from this specific sender, in the last 30 days" by combining vector similarity with metadata constraints in a single query.

Question 9

How often should email agents update their vector database?

Accepted Answer

Agents should embed and store new emails as they are processed, keeping the vector database current. Batch updates work for non-time-sensitive applications, but real-time indexing is better for agents that need immediate access to recent context. Stale indexes mean the agent cannot reference recent conversations.

Question 10

What are the main vector database indexing algorithms?

Accepted Answer

The most common is HNSW (Hierarchical Navigable Small World), which offers fast search with high recall. IVF (Inverted File Index) is another option that trades some accuracy for lower memory usage. Product quantization compresses vectors to reduce storage. The choice depends on your balance of speed, accuracy, and memory constraints.

Vector Database

What is a vector database?#

Why it matters for AI agents#

Frequently asked questions

Related terms

Embeddings

RAG (Retrieval-Augmented Generation)