Question 1

Do I need a dedicated vector database for small projects?

Accepted Answer

Below 10,000 chunks you can run vector search in memory with FAISS or as a Postgres extension with pgvector. Above 100,000 chunks a dedicated database starts paying off in query latency and operational simplicity. The line moves with query volume and the cost of hosting your own infrastructure versus paying a managed service.

Question 2

Which vector database should I pick?

Accepted Answer

Pinecone is the easiest managed option for teams that want zero ops. Weaviate and Qdrant give you more control and run self-hosted. pgvector sits inside an existing Postgres deployment, the right call when you already run Postgres and want one fewer system to operate. Milvus scales to billions of vectors for the rare cases that need it.

Question 3

How does a vector database compare to Elasticsearch?

Accepted Answer

Elasticsearch indexes text by keywords using BM25 scoring. A vector database indexes by semantic meaning using embedding similarity. Production systems often combine both in a hybrid search pattern, with keyword search catching exact matches like product SKUs and vector search catching paraphrases and conceptual queries.

Question 4

What does it cost to run one in production?

Accepted Answer

Pinecone serverless starts around $50 a month for small workloads and scales with stored vectors and query volume. Self-hosted Qdrant or pgvector costs you the underlying compute, typically $100 to $400 a month for mid-sized RAG deployments. Embedding generation itself usually costs more than database hosting once volume picks up.