Back to blog
ArticleBy Team Valor AINovember 6, 2025

What We Learned from Building Semantic Search for Veterans

Read the full note here, then bring the context into chat if you want help applying it to your own claim-prep workflow.

Ask ValorAI
Listen to this article

Audio playback is available only to paid users of the platform.

Checking audio access…

If you've spent any time on r/VeteransBenefits, you know the value of peer support. Over 500,000 veterans gather there to share claims advice, decode VA letters, and celebrate rating increases together. Those discussions prove something essential: veterans help veterans better than anyone else.

We originally explored a dedicated discussion product because that peer-to-peer knowledge deserves tools that make it easier to find, organize, and apply. That surface is gone now, but the retrieval work behind it still shaped how ValorAI thinks about search.


The Problem with Generic Search

Reddit works. r/VeteransBenefits has become the go-to resource for real-world VA claims guidance. But it's also a general-purpose platform not designed for veterans' specific needs:

  • Search limitations – Finding that post about TDIU eligibility from six months ago means scrolling or hoping your keywords match exactly
  • No benefit categories – Posts about healthcare, disability, education, and home loans all mix together
  • Generic features – Built for cat videos and memes, not VA decision letters and nexus letters
  • Scattered knowledge – Critical information buried in 500+ comment threads
  • Unclear terminology – Platform doesn't understand the difference between "total disability" and "TDIU," between "100% schedular" and "TDIU with SMC-S"

We love Reddit. But veterans deserve better retrieval than keyword roulette.


What We Were Trying to Build

🔍 AI-Powered Semantic Search

This is the game-changer. Our three-mode search system doesn't just match keywords. It understands meaning.

Search Modes

Keyword Mode – Traditional search
Type "tinnitus rating" and find posts with those exact words.

Semantic Mode – AI-powered understanding
Search for "ear ringing compensation" and find posts about tinnitus ratings, even if they never use those exact words. Search "unemployability benefits" and discover TDIU discussions. It connects concepts, not keywords.

Hybrid Mode – The best of both
Combines keyword matching with semantic relevance. Like having a VSO who remembers every conversation.

Real Example

Search "TDIU requirements" and get:

  • "Can I work part-time with Total Disability based on Individual Unemployability?"
  • "100% P&T vs unemployability: which is better?"
  • "Does TDIU count for Chapter 35 dependent benefits?"

All perfectly relevant. None with your exact search terms.


Built for Veterans. Built Right.

We didn't just copy Reddit. We rebuilt it from the ground up for veterans:

Product ideas we explored

Reddit/StackOverflow-style posts
Text and link posts with full markdown formatting

Nested comment threads
Deep discussions, just like Reddit

Upvote/downvote system
Surface the best answers to the top

Post editing
Correct information with "edited X minutes ago" badges

User profiles
Separate post and comment karma tracking

Save/bookmark
Keep posts for later reference

VA-Specific Benefit Tags

Not generic tags like "question" or "discussion," but actual benefit categories:

Healthcare & Education

  • Disability Compensation
  • Healthcare & Medical
  • GI Bill & Education

Financial Benefits

  • Home Loans (VA-backed)
  • Pension & Survivors Benefits
  • Life Insurance (SGLI/VGLI)

Services & Support

  • Employment & VR&E
  • Mental Health Support
  • Special Monthly Compensation (SMC)
  • Records & Documents
  • Appeals & Claims Process
  • Burial & Memorial

Authors can tag posts with up to 5 categories for maximum discoverability. Filter by tags to see only posts about education benefits, home loans, or any other category.

Advanced Search Capabilities

  • Keyboard shortcuts – Press forward slash or Cmd+K to search instantly
  • Real-time search – 300ms debouncing for smooth search-as-you-type
  • Shareable searches – URL parameters let you share searches with others
  • Rate limiting – 15 searches per minute prevents abuse
  • Infinite scroll – Browse results naturally

Reputation & Trust

Earn karma by posting helpful content and getting upvotes:

  • Post karma – Total upvotes on your posts
  • Comment karma – Total upvotes on your comments
  • Activity tracking – See posts created, comments written
  • User profiles – Browse everything someone has contributed

How Semantic Search Actually Works

Under the hood, every post gets converted into a 1,536-dimensional vector embedding using OpenAI's text-embedding-3-small model. These embeddings capture the semantic meaning: what it's actually about.

The process:

  1. Convert your search query into vector format
  2. Calculate cosine similarity between your query and every post
  3. Rank by semantic relevance (minimum 60% match threshold)
  4. Merge with keyword results for hybrid search
  5. Return the most relevant posts, regardless of exact wording

It's the same technology powering ChatGPT, now optimized for VA benefits.

Technical Stack:

  • OpenAI text-embedding-3-small for semantic understanding
  • PostgreSQL with JSONB storage (pgvector-ready for scale)
  • Three search strategies: keyword, semantic, hybrid
  • Relevance scoring with title match boosting
  • Configurable similarity thresholds (default 0.60)

Getting Started

This post documents the search and retrieval direction behind that experiment. The dedicated discussion product is not active today, so the practical path now is to use ValorAI Chat and the knowledge base while we focus on the core product.

For Current Users

  1. Sign in with your ValorAI account
  2. Ask your question in ValorAI Chat
  3. Upload relevant forms or decision letters for more context
  4. Use the knowledge base when you need reference material alongside chat

Why keep this post around?

  • It explains the search and retrieval ideas we explored
  • It preserves the product rationale behind that work
  • It preserves the retrieval patterns that still matter for the core product

What's Coming Next

This is the MVP (v4.0.0), but we're already planning:

  • Expert badges – Recognize VSOs, attorneys, and high-karma contributors
  • Best answer selection – Let authors mark the most helpful response
  • Cross-posting to chat – Ask ValorAI to analyze posts and provide personalized guidance
  • Trending topics – See the most-discussed benefits questions each week
  • Email digests – Weekly roundup of top posts in your categories
  • Better retrieval UX – Tighter ways to move between search, chat, and evidence review

Why This Matters

Veterans already share incredible knowledge online. Our job is to make that knowledge findable, organized, and actionable.

Semantic search means the answer you need is never more than a few keystrokes away, even if you don't know the exact VA terminology.

This was never about replacing r/VeteransBenefits. It was about understanding how veterans search for answers, how semantic retrieval should work for VA topics, and how to carry those lessons back into the product we are actually shipping.

Need help today? Head to /chat and start working through your claim questions with ValorAI.

  • Team ValorAI

Continue here

Use the product directly after reading the article.

Chat with ValorAI →