Keyword relevance testing is a popular way for ecommerce teams to audit the quality of the search experience on their site. While well-intentioned, this approach is outdated in a modern era of search and product discovery where personalization and Generative AI (GenAI) are becoming increasingly prevalent. It’s not just about matching keywords anymore — it’s about understanding what products will resonate most with your customers.
If you’re still using keyword relevance testing to evaluate your search algorithm, it’s time to rethink your approach. Find out more below.
Once a popular way to audit the on-site search experience, keyword relevance testing focuses on ensuring products appear in the search results based on relevant keywords.
Essentially, this type of testing helps retailers verify that search results match customer intent and expectations, and that product listings use relevant keywords that align with how customers search.
Often, the process relies heavily on manual assessments, where you review your ecommerce site’s search data to identify common terms your customers are using. Then, you assess how well your search function is performing by following these key steps:
While these steps cover a range of query types, they remain focused on keywords alone. This fails to account for the more dynamic aspects of user search behavior, such as personalization or evolving customer intents.
At face value, keyword relevance testing appears useful. But as ecommerce evolves, so do customer expectations, making this traditional approach increasingly ineffective for various reasons:
Keyword relevance testing lacks personalization, as it doesn’t consider user-specific factors like search history, preferences, or behavior patterns that advanced solutions like Constructor take into account.
Additionally, keyword testing faces several limitations:
The manual method of keyword relevance testing, often referred to as "The Eye Test," can lead to inconsistent results. It’s highly subjective, relying on individual interpretations of what’s relevant. What one team member considers relevant may differ from another’s perspective, leading to unreliable outcomes.
Beyond the inherent flaws of subjective human scoring, there are several additional issues:
In short, one person’s definition of relevance is no longer enough to meet modern customer expectations.
Many ecommerce sites still rely too heavily on simple keyword-based search functionality.
This keyword testing overlooks deeper patterns in customer behavior, such as customers’ evolving search habits or contextual needs. Even with attempts to test a range of queries and keywords, it also typically focuses on individual words or short phrases, which may not capture the nuances of natural language queries or complex search intents.
These more complex queries will become more common as LLMs and AI capabilities are adopted by more people over time, such as ChatGPT, Gemini, Apple Intelligence, and more advanced Siri capabilities. As more mainstream audiences become familiar with these technologies, there will be an expectation that more complex queries will work on ecommerce sites.
Despite the initial step of reviewing search terms, many ecommerce sites fail to fully utilize customer search data.
They often overlook deeper insights such as search patterns, user intent, and trends over time. This means they miss valuable opportunities to understand and respond to customer needs beyond surface-level keyword matching.
A clunky navigation can quickly turn potential customers away, especially when faced with confusing category structures, limited filtering options, and non-intuitive menus or layouts.
Additionally, when evaluating faceted search, keyword testing often falls short, as it fails to capture the true impact of filtered search results on the customer experience.
Many ecommerce sites make the mistake of using thin or non-existent product descriptions, often relying on manufacturer descriptions verbatim and failing to optimize for relevant keywords. This not only limits the ecommerce site's SEO potential, but also renders keyword relevance testing ineffective.
Without well-developed, unique descriptions, there’s a lack of keyword depth and variety needed to match diverse customer queries. This leads to poor search results and missed opportunities for discoverability.
Here’s how you can modernize your search auditing process:
Search relevance alone isn’t sufficient for ecommerce success, as it often fails to deliver the most attractive results for customers. Rather than clinging to outdated methods, it’s time to shift your focus from relevance to attractiveness.
Attractiveness takes into account more than just keywords. AI-native search technology built around attractiveness leverages clickstream data and user interactions to drive search results. This means user-entered search terms and subsequent clicks and conversions are pivotal in dictating what should be returned for a given search query.
This allows for a dynamic product discovery experience that adapts to user behavior and preferences over time. It also drives key business metrics that matter most for large ecommerce companies, like revenue, conversion rates, and customer satisfaction over time.
Dive deeper into your internal search data beyond just a list of common search terms. Identify trends such as which search queries consistently lead to conversions and which fail to engage users. Look for patterns in user behavior — like queries that frequently result in ‘no results’ or those that trigger high bounce rates.
Merchandisers of this apparel brand are able to see top searches in the back-end, and then searchandise on top of those for optimal business results.
By analyzing not just what customers search for but how they interact with the results, you can pinpoint gaps in your search experience and prioritize areas for improvement.
Audit product titles, descriptions, and metadata to ensure they contain relevant keywords. Also, check for consistency in terminology across similar products.
Have a large, complex catalog? Leverage GenAI-powered solutions, which automatically enrich product information at scale.
Attribute Enrichment is a GenAI-powered tool that automatically enriches product attributes and categories, eliminating the headaches caused by incomplete, inaccurate, or non-existent product data and giving shoppers the best possible experience.
Experiment with different search ranking algorithms to find out which provides the best results for your customers. Regularly update your site's search algorithm based on user behavior and new product additions, and don’t forget to implement features like autocomplete, synonym matching, and spell-check to improve search accuracy.
Look at metrics like CTR, conversions, and other engagement metrics to measure success.
Try searches with common misspellings and typos to see if the algorithm can still return relevant results.
On Sephora’s website, the mixed language query of “dry champu” (champu being Spanish for shampoo) produces attractive results for dry shampoo products.
You can also check how the search handles plural vs. singular terms.
Pick n Pay returns principally spice options for the search query “pepper” and vegetables for the search query “peppers” for a new shopper with no previous search history.
Test searches using natural language queries (e.g., "blue pajamas for women"). Verify that the algorithm understands context and intent.
Target AU’s search solution slows contextually relevant results for natural language queries, like “blue pajamas for women.”
If your algorithm includes personalization features, test how search results vary for different user profiles or browsing histories.
See how Sephora's search engine personalizes the shopper's online experience based on subtle brand affinity cues.
Engage with customers through surveys, interviews, or social media to understand the language they use when describing or searching for products
Incorporate relevant keywords naturally into product titles, descriptions, and metadata. Use long-tail keywords and related terms to improve specificity and match varied search queries.
If you’re feeling concerned that your search hasn't been as effective as it could be, you’re not alone. But the good news is that there’s a path forward. By focusing on attractiveness over relevance and embracing modern search capabilities, you can meet and exceed customer expectations — and ultimately drive more revenue.
Ready to take the next step? Download our free audit checklist (no email required) or request a complimentary audit from an ecommerce expert today.
See how your site measures up and receive actionable advice on how to improve it for optimal KPIs this holiday season.