Semantic Search

Semantic search finds sessions based on meaning rather than exact keywords. It uses vector embeddings to match your query against session content, returning results that are conceptually similar even when they use different words.

How it works

When a session is synced, OpenSync generates a 1536-dimension vector embedding from the session’s searchable text using OpenAI’s text-embedding-3-small model.
The embedding is stored in the sessionEmbeddings table alongside a text hash for change detection.
When you search, your query is also converted to an embedding using the same model.
Convex’s vector search compares the query embedding against all stored embeddings using cosine similarity.
Results are ranked by similarity score (0.0 to 1.0, where 1.0 is identical).

Embedding model

Model	Dimensions	Cost	Notes
text-embedding-3-small	1536	$0.02 per 1M tokens	Default model used by OpenSync

The cost is negligible. A typical session produces 2K-10K tokens of searchable text. At $0.02 per million tokens, embedding 1000 sessions costs less than a penny.

What gets embedded

Session embeddings are generated from sessions.searchableText, which concatenates:

Session title
All user message text
All assistant message text

Tool call names and results are not included in the embedding. Only human-readable text is vectorized.

Embedding storage

Each embedding is stored with these fields:

Field	Type	Description
`sessionId`	Id	Reference to the parent session
`embedding`	float[1536]	The vector embedding
`textHash`	string	SHA-256 hash of the source text
`createdAt`	number	Timestamp of generation

The textHash field enables idempotency. If a session is re-synced with the same text content, the embedding is not regenerated.

Message-level embeddings

In addition to session-level embeddings, OpenSync generates embeddings for individual messages. These are stored in the messageEmbeddings table and enable finer-grained search within specific conversations.

Examples

Natural language question

How do I fix CORS errors in a Convex HTTP endpoint?

Returns sessions discussing CORS configuration, HTTP endpoint setup, and related error handling, even if they never use the exact phrase “CORS errors.”

Conceptual search

patterns for handling database migrations

Returns sessions about schema changes, data backfilling, and migration strategies across different tools and frameworks.

Problem-based search

my React component re-renders too often

Returns sessions about performance optimization, memoization, and state management, even if the conversations used terms like “useCallback” or “useMemo” rather than “re-renders.”

Requirements

Semantic search requires an OpenAI API key:

Hosted version: Already configured. No action needed.
Self-hosted: Set the OPENAI_API_KEY environment variable on your Convex deployment.

npx convex env set OPENAI_API_KEY sk-...

Without an OpenAI key, semantic search is disabled and sessions will not have embeddings generated.

Using in the dashboard

Go to the Context tab in the sidebar.
Select Semantic as the search type.
Type a natural language query.
Results appear ranked by similarity score.

Each result shows the session title, a content snippet, and the similarity score (higher is better).

Using via API

curl -X POST "https://polished-penguin-622.convex.site/search" \
  -H "Authorization: Bearer osk_your_key" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "how do I implement authentication with OAuth",
    "type": "semantic",
    "limit": 10
  }'

Response:

{
  "results": [
    {
      "sessionId": "j57a...",
      "title": "OAuth Setup with WorkOS",
      "snippet": "Setting up the OAuth callback handler...",
      "score": 0.89
    },
    {
      "sessionId": "k82b...",
      "title": "Fix login redirect loop",
      "snippet": "The auth middleware was redirecting before the token was set...",
      "score": 0.82
    }
  ]
}

Tips

Ask questions naturally. Semantic search works best with complete questions or descriptions, not individual keywords.
Be specific about the domain. “How to handle auth in a Next.js app” performs better than “auth.”
Embedding latency. Newly synced sessions may take a few seconds to appear in semantic search results while embeddings are generated asynchronously.
Score threshold. Results with scores below 0.5 are typically not relevant. The dashboard hides very low-scoring results automatically.

Comparison with full-text

Aspect	Full-text	Semantic
Best for	Exact terms, function names, error codes	Conceptual questions, problem descriptions
Speed	Faster (index lookup)	Slightly slower (embedding + vector search)
Cost	No additional cost	OpenAI embedding cost ($0.02/1M tokens)
Requires	Nothing extra	OpenAI API key

For combining both approaches, see Hybrid Search.

​Semantic Search

​How it works

​Embedding model

​What gets embedded

​Embedding storage

​Message-level embeddings

​Examples

​Natural language question

​Conceptual search

​Problem-based search

​Requirements

​Using in the dashboard

​Using via API

​Tips

​Comparison with full-text