Question 1

How is semantic search different from keyword search?

Accepted Answer

Keyword search matches the literal words in the query against the words in the index. Semantic search matches the meaning of the query against the meaning of the index, using vector embeddings. Semantic search handles paraphrasing and synonyms naturally. Keyword search handles exact terms (product codes, names) more reliably. Most production systems use both in a hybrid configuration.

Question 2

Do I need a vector database for semantic search?

Accepted Answer

For anything beyond a small prototype, yes. Dedicated vector databases (Pinecone, Weaviate, Qdrant) provide the indexing, filtering, and concurrent query performance needed for production. Below 10,000 documents you can run in memory with FAISS or pgvector on Postgres. Above that, the dedicated databases pay for themselves in operational simplicity.

Question 3

Which embedding model should I use?

Accepted Answer

Depends on use case and data sensitivity. OpenAI text-embedding-3 models are the default high-quality choice for most teams. Cohere and Voyage offer competitive alternatives. For sensitive data, open-source models like BGE or E5 run on infrastructure you control. Match the embedding model used at index time and query time, or the math breaks.

Question 4

What is hybrid search and why does it usually win?

Accepted Answer

Hybrid search combines semantic vector results with keyword BM25 results, then reranks the combined set with a cross-encoder model. Semantic search alone misses exact terms like product codes. Keyword search alone misses paraphrasing. The hybrid combination handles both failure modes and typically improves retrieval quality 15 to 30 percent over either alone.