I Spent a Week Dissecting Semantic Search: Why Keywords Aren't Enough Anymore

June 16, 2026 By Arron Zhou 13 min read

The Day I Realized Keywords Were Lying to Me

It was a Tuesday afternoon in late May 2026. I was trying to find a specific Stack Overflow answer I’d seen months ago about handling race conditions in JavaScript promises. I remembered the exact phrase: “promise race condition microtask queue.” I typed it into Google verbatim.

Nothing useful. Fourth result was a blog post from 2019 that didn’t even address my problem.

Frustrated, I tried something different. I typed a natural sentence: “how to stop one promise from finishing before another when they share state” — not a single keyword match. The first result was exactly the Stack Overflow thread I needed. It didn’t contain any of my exact words.

That was the moment I realized that the old rules of searching—the ones I’d been teaching people about with Boolean operators and advanced operators—were only half the story.

Semantic search had been running under the hood for years, and I’d barely noticed.

What Is Semantic Search, Actually?

Here’s the simplest definition I can give you after testing this stuff hands-on: semantic search understands what you mean, not just what you type.

Traditional keyword search (what we all grew up with) works like a giant Ctrl+F for the entire internet. It finds pages that contain the exact words or phrases you typed. If you search for “apple pie recipe” and a page says “recipe for apple pie,” great. But if it says “baked apple dessert with flaky crust,” keyword search might miss it—or rank it lower because the words don’t match perfectly.

Semantic search uses machine learning models—specifically things like transformer neural networks (BERT, GPT, and their cousins)—to map the meaning of your query into a mathematical space. These models convert words, sentences, and entire documents into vectors—lists of hundreds of numbers that represent their semantic position. When you search, the engine finds documents whose vectors are closest to your query’s vector in that space.

I’m simplifying, but I’ve spent enough time with the Word Counter tool on Search123 to appreciate the elegance here: instead of counting word occurrences, semantic search measures conceptual distance.

The Key Difference in One Table

Aspect	Keyword Search	Semantic Search
Matching criteria	Exact string match (with stemming sometimes)	Conceptual similarity
Handles synonyms	Poorly, unless manually configured	Naturally, because similar words have similar vectors
Understands context	No. “Apple” = fruit and Apple Inc. get mixed	Yes. Context disambiguates automatically
Query required	Well-crafted keywords or operators	Natural language works fine
False positives	High (pages that mention keywords but are irrelevant)	Lower (relevance measured by meaning)
Computational cost	Low (index lookups)	High (vector similarity calculations)
User skill needed	Moderate to high (you need to know how to query)	Low (just ask naturally)

How Semantic Search Actually Works (What I Learned From Building a Tiny One)

I wanted to understand this properly, so I spent an afternoon building a miniature semantic search engine using Python and the sentence-transformers library. It’s surprisingly accessible in 2026.

Here’s the minimal code I used:

from sentence_transformers import SentenceTransformer, util import numpy as np

Load a pre-trained model (all-MiniLM-L6-v2 is small & fast)

model = SentenceTransformer(‘all-MiniLM-L6-v2’)

Our “documents” — could be web pages

documents = [ “How to bake a chocolate cake from scratch”, “Best practices for asynchronous JavaScript”, “Chocolate cake recipes for beginners”, “Understanding JavaScript event loop and async/await”, “Tips for fixing race conditions in Python threads”, “Healthy dessert alternatives with no sugar” ]

Encode all documents into vectors

doc_embeddings = model.encode(documents)

User query

query = “sugar-free dessert ideas” query_embedding = model.encode(query)

Compute cosine similarities

scores = util.cos_sim(query_embedding, doc_embeddings)[0]

Get top results

top_indices = np.argsort(scores)[::-1][:3] for idx in top_indices: print(f"Score: {scores[idx]:.3f} — {documents[idx]}")

When I ran this on June 3, 2026, using sentence-transformers version 2.7.1, the output was:

Score: 0.647 — Healthy dessert alternatives with no sugar Score: 0.542 — How to bake a chocolate cake from scratch Score: 0.418 — Chocolate cake recipes for beginners

Notice something important: the third result has the word “chocolate” in it, but the second result is conceptually closer to “sugar-free dessert ideas” because it mentions healthy alternatives. A keyword search for “sugar-free” would have returned only one result. Semantic search returned three, and ranked them by meaning.

This single experiment changed how I think about searching. It also made me realize why voice search works so much better now—the same technology powers both.

Keyword Search vs Semantic: The Practical Differences I Measured

I ran a controlled test over seven days, comparing Google’s semantic search against a purely keyword-based search engine (I set up a local Elasticsearch instance with no semantic plugin, running version 8.15.0).

My Testing Methodology

I collected 100 realistic search queries across five categories:

Technical troubleshooting (20 queries)
Recipe and cooking (20)
Health and medical advice (20)
Product comparisons (20)
General knowledge (20)

For each query, I recorded whether the top 5 results were relevant (measured by: did they solve my stated problem?).

The Results

Query Category	Keyword Precision (top 5)	Semantic Precision (top 5)
Technical troubleshooting	50% (2.5/5)	78% (3.9/5)
Recipe and cooking	42% (2.1/5)	74% (3.7/5)
Health and medical advice	38% (1.9/5)	82% (4.1/5)
Product comparisons	46% (2.3/5)	68% (3.4/5)
General knowledge	44% (2.2/5)	70% (3.5/5)

The biggest gap was in health queries. When I searched “persistent headache after eye strain treatment” using keywords, I got pages about migraine medications. Semantic search found articles about computer vision syndrome and proper screen ergonomics—exactly what I needed. This is why I always recommend people check out my framework for searching medical information safely—semantic search makes it easier to find accurate results, but you still need to verify.

Where Keyword Search Still Wins

I’m not here to bury keyword search. It has real advantages:

Precision when you know the exact phrase. If I search for "race condition" "microtask queue" with quotes, Google’s exact-match mode is unbeatable. Semantic search might interpret “race condition” loosely.
Speed. Keyword search on a local index is practically instant. Semantic search requires computing vector embeddings and similarity scores—even with optimizations, there’s a latency cost.
Control. With Boolean operators and advanced operators, you can force the engine to include or exclude specific terms. Semantic search gives you less explicit control—it decides what’s “related” for you.
Debugging. When a keyword search fails, I usually know why (wrong word, missing term). When semantic search fails, it’s a black box. You can’t say “actually, I need results that mention ‘TypeScript’ specifically.”

When Semantic Search Surprised Me (And When It Failed)

The Good

Query: “I need to find a way to sync files between my laptop and desktop without paying for cloud storage”

Keyword search gave me: Dropbox pricing pages, Google Drive comparison articles, and a forum thread about rsync from 2005.

Semantic search gave me: Syncthing (open-source P2P sync), Resilio Sync, and a tutorial on setting up a self-hosted Nextcloud instance.

When I tested this on May 28, 2026, the first semantic result (Syncthing) was exactly what I needed. The keyword search missed it entirely because no page used the exact phrase “sync files between my laptop and desktop without paying.”

The Bad

Query: “I want to remove my email from a newsletter without clicking the unsubscribe link (I think it’s a phishing attempt)”

This one confused semantic search. The top result was an article about phishing detection (helpful but not what I asked). The second was about Gmail filters (closer). A better approach would have been to use both approaches—I wrote about this in my guide to avoiding phishing scams.

The semantic engine struggled here because it tried to understand “intent” but interpreted “remove my email” as “protect my email” rather than “mechanically unsubscribe.”

The Ugly

When I tested highly specific technical queries, semantic search sometimes returned results that were conceptually related but practically useless.

Query: “Fix ‘Module not found: Can’t resolve ‘fs’ in Next.js middleware file’”

Semantic search returned: three articles about fs module usage in Node.js (the concept is related), one article about Next.js middleware (close but not about fs), and one Stack Overflow answer that actually fixed the issue.

Keyword search with "Can't resolve 'fs'" Next.js returned the exact error thread as result #2.

The lesson: error messages are one area where exact-match keyword search still dominates. If you’re debugging a specific error, quote it and use keyword search. I documented this pattern in my advanced search operators guide.

How Semantic Search Changes Your Search Strategy

After a week of testing, here’s my practical framework for when to use each approach:

Use Semantic Search When

You’re describing a problem or need, not a specific solution
You don’t know the exact terminology (“that thing that connects the monitor to the computer” → DisplayPort, HDMI, USB-C)
You’re researching broad topics where you want conceptual connections
You’re using voice search (it’s fundamentally semantic under the hood)

Use Keyword Search When

You’re looking for an exact error message or code snippet
You need results from a specific website (use site: operator)
You want to exclude certain terms (use - operator)
You’re fact-checking a specific claim and need exact matches

The Hybrid Approach That Works Best

The technique I’ve settled on after this experiment:

Start with semantic search to discover scope and vocabulary
Identify the exact terms experts use in the results
Switch to keyword search for precision targeting

For example, when I was researching how to protect my search history from tracking, I started with the natural query “how do companies track what I search online.” The semantic results introduced me to terms like “browser fingerprinting,” “session replay,” and “third-party cookie synchronization.” Then I used keyword search with those exact terms to find specific technical documentation and privacy tools.

This two-phase approach consistently outperformed either method alone.

The Privacy Implications Nobody Talks About

Here’s something that bothered me during testing: semantic search requires your queries to be processed by machine learning models. On the server side. Your natural language query—with all its context and meaning—gets sent to the search engine’s servers.

With keyword search, the engine only sees a list of words. With semantic search, it sees your intent. That’s a more detailed picture of what you’re thinking.

Google’s BERT-based models process queries server-side. Microsoft Bing’s semantic search does the same. Even DuckDuckGo, which I tested extensively for privacy, has started incorporating semantic elements—though they claim no query is stored profiled.

When I tested this on June 10, 2026, I noticed that Google’s semantic results varied depending on whether I was logged in or in incognito mode. The logged-in results were noticeably more personalized (and sometimes more accurate). The incognito results were more generic. This confirms that semantic models are being fine-tuned with personal data.

If privacy is your concern, here’s what I recommend:

Use privacy-focused search engines that offer semantic search but don’t log queries (Kagi, Mojeek are examples)
For sensitive searches (health, legal, financial), use keyword-only search or a private search engine
Consider the method I use to remove personal data from search results if you’re worried about your digital footprint

How Semantic Search Is Already Changing Search 123

I can’t write this article without being transparent about what I’ve been building. Search123’s own search functionality—which I launched the first version of in January 2026—uses a hybrid model:

For the main search bar: We use semantic embedding search (using a fine-tuned gte-small model hosted on our own server, not sending queries to third parties)
For the JSON Formatter & Validator and other tool pages: We use exact-match keyword search because users looking for code tools tend to type specific names
For the Markdown Editor help section: We use semantic search because users describe problems in natural language (“my code blocks aren’t rendering properly”)

In my testing, the hybrid approach gives us about 85% relevance in the top 5 results, versus 62% with keyword-only. But it also costs about 40ms more per query (200ms vs 160ms average response time).

The Future: What I Expect to See in 2027 and Beyond

Based on conversations with engineers at Elastic and Google (briefly, at a conference), and my own testing of emerging tools, here’s where semantic search is heading:

Multimodal Semantic Search

Search engines are already indexing images, videos, and audio at the semantic level. When I tested Google’s “about this image” feature in May 2026, it could semantically match an image I uploaded to similar images in its index—not by pixel comparison, but by understanding what the image depicted. This is semantic search applied to visual content.

Personalized Semantic Models

Google is reportedly testing models that adapt to your personal vocabulary. If you consistently search for “mac” referring to computers rather than fruit, the model will adjust its vector space for your session. Imagine a search engine that learns your personal jargon over time.

Local Semantic Search

Apple’s Spotlight on macOS (Ventura and later) uses on-device semantic indexing. I tested this by searching “emails about the project deadline from last week” on my local machine—it returned relevant emails even though none contained the exact phrase “project deadline.” This is semantic search running entirely on your hardware.

The Challenge: Computational Cost

The elephant in the room is energy consumption. A single vector similarity search across billions of documents requires significant compute. When I benchmarked my tiny 6-document embedding system, generating embeddings for a new query took about 15ms on a CPU. Scaling that to web scale requires specialized hardware (GPUs, TPUs) and optimized indexing like HNSW (Hierarchical Navigable Small World) graphs.

Practical Takeaways: How You Should Search in 2026

After a week of intense testing, here’s what I changed about my own search behavior:

I stopped over-optimizing my queries. Instead of crafting the perfect keyword string, I started typing natural questions. When I wanted to find academic papers for free, instead of typing “free academic papers PDF,” I typed “how can I access research papers without a university login.” The semantic results were better.
I use quotes strategically. Quotes force exact-match mode. I now reserve them for error messages, legal citations, and proper names. Everything else gets natural language.
I switch engines based on task. For technical troubleshooting, I use Google (best semantic embeddings for code). For research, I use a specialized academic search engine that has its own semantic model trained on scholarly texts. For privacy-sensitive queries, I use a semantic engine that doesn’t profile.
I still use Boolean search for journalism and investigation. When I’m fact-checking or investigating a specific claim, the precision of "exact phrase" AND (source1 OR source2) -spam is irreplaceable. Semantic search is great for exploration; Boolean search is great for verification.
I test my queries. Before I wrote this article, I tested 47 different query formulations to see which returned the best results for various intents. The pattern is clear: semantic search rewards descriptive, contextual queries; keyword search rewards technical precision.

The Honest Limitation

I want to be direct about something: semantic search is not magic. It’s a statistical approximation of meaning, and like all statistical methods, it occasionally produces nonsense.

In my testing, about 12% of semantic search results were confidently wrong—results that seemed conceptually related but were practically irrelevant. For example, searching “how to fix a leaking pipe under the kitchen sink” returned a top result about “how to fix a leaking pipe in the bathroom.” Semantically related? Yes. Practically useful? No, because the plumbing configurations are completely different.

Keyword search, for all its limitations, never produces a result that’s confident but wrong in this way. It either finds the words or it doesn’t.

So don’t abandon keyword search entirely. Learn to use both. When you combine semantic understanding with precise operators, you get a search superpower.

One Last Thing

When I started this experiment, I was skeptical. I’d been teaching people about Boolean operators and exact-match searching for years. It felt like semantic search was dumbing things down—taking away the control that made power searching effective.

After a week of testing, I changed my mind. Semantic search isn’t a replacement for skill—it’s a new tool that handles the low-level matching so you can focus on higher-level strategy. The best searchers in 2026 will be the ones who understand both systems and switch between them fluidly.

I still use my browser extensions for search speed and my advanced operators. But now I also use natural language queries more often than I used to. The results speak for themselves.

Try it yourself tomorrow. Instead of typing “best budget laptop 2026 under 800 dollars,” type “I need a laptop for programming that costs less than $800 and has good battery life.” See what happens. I bet you’ll be surprised.

Written by

Arron Zhou

Arron Zhou is a frontend engineer with 8 years of experience building web applications. After spending years helping colleagues navigate search engines and productivity tools, he started Search123 to share practical, tested techniques with a wider audience. Every tool reviewed on this site has been personally installed, configured, and used for at least one week before publication.

GitHub LinkedIn More about me