
Introduction
Search marketing is being redefined by generative AI. Instead of returning a page of blue links and inviting the user to click, modern assistants and AI search features synthesise answers directly from a mix of trained knowledge and retrieved sources. This evolution means that visibility is no longer measured by how high you rank or how many impressions you receive; what matters is whether the engine even mentions you inside its response. Conventional metrics—impressions, click‑through rate (CTR) and average position—were designed for a world where searchers scanned results pages and chose which links to explore. In the generative era, the answer is delivered instantly. A brand can influence the conversation only if it is named or cited within the AI’s output. Early studies on AI overviews and conversational search suggest that when AI summaries appear, organic clicks drop significantly, shifting attention from the results page to the answer itself. This shift creates a blind spot for marketers. To understand and improve their presence inside generative answers, brands need new key performance indicators (KPIs) that reflect how frequently and how prominently they appear in AI‑generated content.
This article introduces two complementary metrics—Generative Appearance Score (GAS) and Share of AI Voice (SAIV)—which together capture visibility in generative environments. GAS measures how often and how prominently a brand is mentioned or cited across a curated set of AI prompts, while SAIV compares your presence to competitors within the same answer landscape. Both metrics require different thinking than traditional SEO; rather than counting clicks or ranking positions, you are examining the raw output of language models and quantifying your position within it. Building and tracking these metrics will help you understand the competitive dynamics of AI search and highlight where to improve your content and authority signals.
Why Traditional SEO Metrics No Longer Capture AI Visibility
The rise of AI summaries, conversational answers and assistant recommendations fundamentally changes where and how information is consumed. In classic search, visibility was a function of ranking well for relevant keywords and earning clicks. Marketers could examine impressions, click‑through rates and positions to gauge performance. But generative systems present a different challenge. When a user asks a question in a conversational interface—whether in Google’s Search Generative Experience (SGE), Perplexity’s search, ChatGPT’s browsing mode or Bing Copilot—the model does not show a full results page by default. Instead, it constructs an answer from a handful of retrieved passages and its own knowledge base. These answers often cite a small number of sources, and sometimes they do not display links at all. Users may not see the underlying pages unless they explicitly ask for citations. Studies of AI Overviews have found significant declines in traditional SEO metrics when generative summaries appear, with click‑through rates dropping by double‑digit percentages on informational queries. This demonstrates that visibility in AI is not reflected in your traffic reports.
Generative engines also evaluate content differently. They rely on entity recognition, knowledge graphs and semantic context rather than keyword density alone. They extract facts, relationships and tone to determine which sources to trust and cite. A website that ranks highly in organic search may still be absent from AI answers if its pages do not clearly state facts, align with user intent or offer consistent entity signals. In addition, generative engines operate across languages and platforms; your content may appear in ChatGPT but not in Perplexity, or vice versa. A single set of numbers like impressions or ranking cannot capture this multi‑platform reality. Marketers require metrics that assess presence inside the answer layer itself, across multiple engines, languages and query types.
The Rise of Generative Metrics
As AI‑driven answers become more prevalent, digital marketers have begun to adopt new KPIs tailored to generative search. The concept of Generative Appearance Score and Share of AI Voice stems from the generative engine optimisation (GEO) framework. Unlike SEO, which focuses on ranking on search engine results pages, GEO aims to ensure that your content is included or cited within AI‑generated answers. Wikipedia’s overview of GEO notes that new metrics such as generative appearance score and share of AI voice are replacing traditional measures like click‑through rate and ranking position. These metrics emphasise frequency and prominence of mention within AI answers rather than raw traffic or rank.
Industry articles expand on this concept. A piece by digital agency Sotavento Medios highlights that GEO success is no longer measured by top‑10 rankings but by a Generative Appearance Score, tracking how often and how prominently your brand appears in AI‑generated summaries. Geneo’s comparison of GEO metrics and traditional SEO KPIs further elaborates on appearance rate, citation inclusion and prominence share as components of a generative appearance score, illustrating that appearances and citations must be weighted and normalised to understand overall visibilitygeneo.app. These sources underline the shift from clicks to citations and from ranking to presence.
What Is the Generative Appearance Score (GAS)?
Generative Appearance Score (GAS) is a metric that quantifies how frequently and prominently a brand or domain is mentioned within AI‑generated responses across a defined set of prompts. At its core, GAS measures visibility, not engagement. A high GAS indicates that the AI models consistently include your brand or content when answering questions in your domain. This does not guarantee traffic; generative answers may or may not link out. However, frequent inclusion elevates brand authority and increases the chance that users will see and trust your information.
The scope of GAS spans several appearance types:
- Direct mentions: The brand name or product is explicitly stated in the answer. For example, a health assistant might say, “According to Example Health, iron is essential for transporting oxygen in the blood.” Direct mentions indicate strong recognition of your brand as a source.
- Implicit mentions: The product or service is described without naming the brand. A model might describe features or outcomes that clearly refer to your offering, such as “a smartwatch that monitors heart rate and sleep patterns,” when your product is the only one with those features. Implicit mentions suggest that your content influenced the answer, even if credit is omitted.
- Citations: The AI includes your URL as a source in the answer or a citation list. Citations demonstrate explicit attribution and give users a way to visit your site.
- Attribution mentions: The AI introduces information with phrasing like “according to YourBrand” or “a study by YourBrand suggests…”. Attribution mentions emphasise credibility and create an association between the information and your brand.
These appearance types can be weighted to compute an overall score. For instance, direct mentions may receive a higher weight than implicit mentions, reflecting the greater prominence of explicit naming. Citations and attribution may be weighted differently depending on how many AI engines display clickable links.
GAS Formula Example
To calculate GAS, follow these steps:
- Build a list of prompts relevant to your category. Identify questions that cover informational, comparative and transactional intents. Exposure Ninja’s guide suggests starting with prompts that reflect real customer queries, such as “What’s the best [product] for [pain point]?” or “Top [industry] companies in the UK”. This ensures that your test set mirrors actual searches.
- Run the prompts across multiple engines. Test your list on Google’s AI Overviews, Bing Copilot, Perplexity.ai, ChatGPT with browsing, Anthropic Claude, and any other relevant platform. Because AI engines have different citation behaviours and update frequencies, cross‑engine testing provides a complete picture.
- Score each appearance. For each prompt, record whether your brand appears and in what capacity. Assign values to each type: a direct mention might equal 3 points, a citation 2 points, an implicit mention 1 point and an attribution mention 2 points. Use consistent weights across your analysis to facilitate comparison.
- Compute GAS using a weighted formula. One simple formula is:
GAS = (Direct Mentions × 3) + (Citations × 2) + (Implicit Mentions)
-----------------------------------------------------------
Total Prompts
This formula rewards explicit naming and attribution, while still capturing implicit influence. The denominator normalises the score by the size of your test set.
- Analyse frequency and prominence. Beyond raw counts, consider how prominently your brand appears. Geneo’s methodology introduces a prominence share, which assigns ordinal weights to positions within an AI answer (e.g., 1.0 for first mention, 0.7 for second, 0.5 for third). Adding a prominence component allows you to distinguish between being the primary answer versus an afterthought.
- Normalise and benchmark. When comparing across engines or time periods, normalise your scores to a 0–1 scale. You can then benchmark against competitors, compute averages by category and track changes over time. Geneo suggests combining appearance rate (the percentage of prompts where you appear), citation inclusion rate, prominence share, cross‑engine coverage and sentiment index into a composite score. While not standardised, such composites help summarise complex data into an actionable metric.
Why GAS Matters
Unlike CTR or ranking, GAS reflects the AI’s judgment of relevance and authority. A low GAS indicates that your brand is rarely mentioned, even if you rank well in organic search. A high GAS means that AI models have internalised your brand or content as a go‑to source. Since generative systems can produce answers without showing any search results, being present in those answers becomes critical to maintaining brand awareness. Frequent appearance also builds trust; as Birdeye notes, brands that repeatedly appear in AI answers are perceived as credible and gain market share. Tracking GAS reveals which topics you already dominate and where competitors are winning visibility.
What Is Share of AI Voice (SAIV)?
Share of AI Voice (SAIV) measures your proportion of all mentions within AI‑generated answers for a given query set, compared to competitors. The concept builds on the traditional marketing metric Share of Voice (SoV), which quantifies how much of the conversation your brand commands across media channels. Exposure Ninja defines AI SoV as the percentage of AI‑generated responses that mention your brand relative to all brands in your category. It is the “how much” metric: how dominant your voice is within the AI dialogue.
SAIV is particularly useful for competitive analysis. A high SAIV means that AI models disproportionately cite your brand compared to peers, implying that your information is more likely to influence users. Conversely, a low SAIV signals that competitors are more visible and trusted. By benchmarking SAIV across topics and engines, you can identify where to focus your optimisation efforts.
SAIV Calculation
To compute SAIV:
- Identify key prompts that represent commercial and informational queries in your niche. Include questions at different stages of the customer journey (awareness, consideration and decision). This ensures that you capture a broad range of user intents.
- For each prompt, detect all brands mentioned. Record both explicit and implicit mentions, as well as citations and attribution. Exposure Ninja recommends tracking tone and sentiment as well.
- Count the occurrences of each brand across all answers. This includes the number of prompts where your brand appears and how many times it is mentioned within a single answer.
- Calculate SAIV using the formula:
SAIV = (Brand Mentions / Total Mentions of All Brands) × 100%
If your brand is mentioned 10 times across 20 answers, and all brands combined are mentioned 40 times, your SAIV is 25%. Repeat this calculation for each competitor. The sum of all SAIV percentages across brands equals 100%.
Interpreting SAIV
SAIV reflects category leadership within AI answers. A brand with a high SAIV dominates the conversation, occupying a large share of the answer space. Lower SAIV values indicate that competitors are more visible or that AI models favour other sources. Because generative answers often cite only a handful of sources, small shifts in SAIV can signal significant changes in market perception. Birdeye notes that AI share of voice is the new battleground for digital visibility; it measures how often—and how authoritatively—your brand appears as an answer in AI-generated results. Tracking SAIV alongside GAS provides a comprehensive view of both absolute and relative presence.
Understanding Appearance Types
When measuring GAS and SAIV, it is important to differentiate among appearance types because each conveys a different level of recognition and credit:
- Direct mentions are the gold standard. The AI explicitly names your brand, product or spokesperson. Direct mentions typically have the greatest impact on user recall and trust.
- Implicit mentions occur when the AI uses descriptive phrases that clearly refer to your product or service without naming it. These still contribute to brand visibility because they are based on your content, but they lack explicit credit.
- Citations are links or references to your URL. Citations are valuable because they allow users to click through to your site. However, some AI engines may cite you without naming your brand in the answer text, so tracking citations separately helps you understand where your domain is being used but not credited.
- Attribution mentions describe the source of information using phrases like “according to…” or “as reported by…”. These emphasise authority and are often accompanied by a citation. They bridge the gap between direct mention and citation.
When building your scoring system, consider weighting direct mentions more heavily than citations or implicit mentions. Exposure Ninja recommends tracking all appearance types to understand where your brand is credited versus where your content is used without recognition. This distinction helps you identify opportunities to improve citation and attribution.
Measuring the Generative Appearance Score
Step 1: Build a Bank of Relevant Prompts
Start by generating a list of questions that potential customers or stakeholders might ask about your product, service or industry. Use existing keyword research, forums and customer support logs to inform your list. Exposure Ninja suggests including prompts covering awareness, consideration and decision stages. For example:
- Awareness: “What is generative engine optimisation?”
- Consideration: “Best project management tools for remote teams.”
- Decision: “Is [YourBrand] the right CRM for small businesses?”
Be sure to adapt the wording for each language and region if you operate globally.
Step 2: Run Prompts Across Multiple Engines
Test each prompt on different generative platforms. Because engines like Google’s AI Overviews, Bing Copilot, Perplexity, ChatGPT and Claude have distinct models and data sources, your brand may appear in one but not another. By running prompts across multiple systems, you capture a comprehensive picture of your generative visibility. Record the date, engine and any notable differences in answers, such as the presence or absence of citations.
Step 3: Score Visibility Based on Presence, Prominence and Frequency
For each prompt and engine, log:
- Presence: Was your brand present at all? Score 1 for presence and 0 for absence.
- Prominence: If present, how prominently were you mentioned? Assign ordinal weights (e.g., 1.0 for first mention, 0.7 for second). Birdeye suggests that direct answers should be weighted higher than secondary mentions.
- Frequency: How many times did your brand appear across all prompts? Frequency captures the spread of your presence across the query set. You can calculate frequency as the count of prompts where your brand appeared divided by total prompts.
Combine these elements into your GAS formula. You may choose to adjust the weights depending on strategic priorities. For example, a B2B software company might value citations more than direct mentions if their goal is to drive trials, while a consumer brand may prioritise direct mentions for awareness.
Step 4: Compute GAS Using Weighted Components
After scoring each prompt, sum the weighted values and normalise by the total number of prompts. Optionally, include additional factors such as cross‑engine coverage and sentiment. Geneo’s transparent formula suggests weights of 0.30 for appearance rate, 0.25 for citation inclusion, 0.20 for prominence share, 0.15 for cross‑engine coverage and 0.10 for sentiment. Adjust these weights based on your specific goals; there is no universally agreed standard yet.
Step 5: Benchmark and Analyse Trends
Calculate your GAS for each time period (e.g., monthly or quarterly) and track changes. Benchmark against competitors by calculating their GAS using the same prompts and weights. Identify topics where your GAS is low or dropping; these may be areas where competitors have published new content or where your pages lack clarity. Birdeye advises conducting regular audits to find where your brand is cited, omitted or used without credit.
Measuring Share of AI Voice
Step 1: Identify Key Commercial and Informational Prompts
Like GAS, SAIV begins with building a list of prompts. Focus on questions that matter for your market, including both high‑purchase intent queries (“best [product] for [use case]”) and educational queries (“how to choose [product]”). Exposure Ninja emphasises that these prompts should mirror real customer questions.
Step 2: Detect All Brands Mentioned in Each AI Answer
After running the prompts, parse the AI response for brand names and synonyms. Record both direct and implicit mentions. Note whether a mention is positive, neutral or negative to incorporate sentiment if desired. This step ensures that you capture the full conversation, not just mentions of your brand.
Step 3: Count Occurrences Per Brand
Aggregate the number of times each brand appears across all prompts and engines. Consider weighting direct mentions more heavily than implicit mentions if you want to emphasise explicit recognition. You can also track the number of citations and attribution mentions per brand.
Step 4: Calculate SAIV
Divide the number of times your brand is mentioned by the total number of brand mentions across all competitors. Multiply by 100 to express as a percentage:
SAIV = (Brand Mentions ÷ Total Mentions of All Brands) × 100%
If you appear 15 times out of 60 total brand mentions, your SAIV is 25%. Compute this figure for each competitor to produce a distribution of the answer space. Exposure Ninja’s formula uses a similar approach to compute AI share of voice.
Using SAIV to Inform Strategy
SAIV shines a light on category leadership. If your SAIV is low across high‑intent prompts, it signals that competitors dominate the narrative. You may need to publish more comprehensive guides, improve entity clarity or secure third‑party citations to win more mentions. Conversely, a high SAIV indicates strong authority; maintain this by keeping content fresh and consistent. Birdeye notes that AI share of voice measures how often and how authoritatively your brand appears in answers, making it a clear indicator of market dominance.
Tools and Data Sources for Calculation
Calculating GAS and SAIV can be labour‑intensive if done manually. However, several techniques and tools simplify the process:
- Manual prompt testing logs: Begin by manually running prompts across engines and recording results in a spreadsheet. This approach provides flexibility and direct insight. Exposure Ninja provides an AI Visibility Tracker template for logging prompts, platforms, mentions and sentiment.
- Browser automation: Tools like Puppeteer and Playwright can automate the submission of prompts and capture responses. By scripting your test set, you can run hundreds of prompts across multiple engines and gather data at scale.
- Custom dashboards: A combination of BigQuery, Looker or Data Studio and Python scripts can ingest results from prompt tests, compute GAS and SAIV, and visualise trends. Use structured schema and consistent naming to ensure accurate aggregation.
- AI SERP monitoring tools: Emerging platforms such as Profound and Semrush’s AI Toolkit track brand mentions across generative search engines and provide share of voice reports. These tools often include sentiment analysis and competitor benchmarks, making them suitable for ongoing monitoring.
Using GAS and SAIV to Improve GEO Strategy
- Identify gaps: Analyse your GAS and SAIV results to see which topics and prompts you fail to dominate. For low‑performing areas, investigate why. Are competitors publishing more current or comprehensive content? Are you missing key definitions or comparison pages? Use this insight to prioritise new content or content updates.
- Strengthen entity clarity: Ensure that each page clearly names the entities it covers. Use descriptive headings, definitions and structured data to help AI models recognise your brand and products. Geneo emphasises that cross‑engine coverage is part of appearance scoring, so consistent entity signals help models map your content across platforms.
- Improve product documentation and comparisons: Many AI prompts are geared toward making decisions. Provide detailed product pages, comparison tables and buyer’s guides that include facts, features and use cases. Structured, data‑rich content increases the likelihood of being cited.
- Update stale content: AI engines prioritise freshness. Birdeye notes that engines evaluate freshness and consistency when selecting trusted brands. Regularly audit and refresh your content to ensure dates, statistics and references remain current. Use the
datePublishedanddateModifiedproperties in your structured data to signal recency. - Secure third‑party citations: External validations such as news articles, guest posts, expert interviews and industry awards build authority. Birdeye lists citations from reputable sources as a key signal for AI selection. A strong off‑site presence reinforces your brand’s credibility and improves both GAS and SAIV.
Benchmarks and Thresholds
Because generative metrics are new, there is no universal “good” GAS or SAIV. Healthy ranges vary by industry, query intent and competition. Early data suggest that in competitive consumer sectors (e.g., consumer electronics or health supplements), a GAS above 0.5 and a SAIV above 20% may indicate strong visibility. In niche B2B markets, even a GAS of 0.2 could signify leadership if competitors are barely present. Geneo stresses that GEO metrics should be treated as directional rather than absolute, given the evolving nature of AI enginesgeneo.app. Regular measurement across time periods provides more insight than a single snapshot; look for upward trends and improvements relative to peers.
Pitfalls to Avoid When Measuring AI Visibility
- Using too few prompts: A small prompt set will skew your results. Ensure your test set covers a range of intents, keywords, and question forms. A robust dataset leads to more reliable GAS and SAIV.
- Over‑focusing on a single engine: Generative engines behave differently. Evaluating only one (e.g., ChatGPT) may give a false impression of your visibility. Cross‑engine measurement is essential.
- Ignoring implicit mentions and paraphrased references: Some answers may draw from your content without naming you. Failure to account for implicit mentions understates your influence. Logging both explicit and implicit references provides a fuller picture.
- Overweighting any single appearance type: Balance is key. A high number of direct mentions might lead you to overlook a lack of citations, which still influences traffic. Use weighted scoring and adjust as needed.
- Failing to document your methodology: GAS and SAIV calculations depend on the prompts, engines and weights you choose. Without documentation, results cannot be compared over time or across teams. Geneo advises documenting query sets, engines, geographies and devices used in your analysis.
Future of AI Visibility Metrics
The measurement of generative visibility is in its infancy. Several trends will shape the future:
- Predictive AI visibility modelling: As more data is collected on which factors influence AI answers, models could predict the likelihood that a given page or brand will appear in a generative response. These predictions will help prioritise optimisation efforts before content is even published.
- Automated competitive alerting: Dashboards will evolve to provide real‑time alerts when competitors gain or lose share of voice. This will support agile content updates and PR responses.
- Standardisation of metrics: Industry groups and analytics vendors may develop standard definitions and weightings for GAS and SAIV, making it easier to benchmark across verticals and share insights. For now, however, custom scoring systems tailored to your goals remain essential.
- Integration with traditional KPIs: GAS and SAIV will not replace SEO metrics but complement them. The next generation of dashboards will correlate generative visibility with website traffic, lead generation and revenue, enabling marketers to see how AI presence drives business outcomes.
Conclusion
Generative Appearance Score and Share of AI Voice represent the new north‑star metrics of AI‑era visibility. Traditional SEO metrics alone cannot capture the ways in which large language models shape user journeys. By measuring how often—and how prominently—your brand is present inside AI‑generated answers, GAS reveals your absolute visibility, while SAIV shows how you fare against competitors. Building these metrics requires a disciplined approach: crafting relevant prompts, testing across multiple engines, recording appearance types, applying weighted formulas and benchmarking results. When used together, GAS and SAIV illuminate gaps in your generative strategy and provide a feedback loop for improvement. In the emerging landscape where AI decides what users see, brands that monitor and optimise their generative presence will command the conversation and convert attention into trust.
Want to know whether ChatGPT, Perplexity, or Google AI Overviews mention your firm? Run a free first-party visibility audit on your domain in under a minute and see exactly which queries cite you and which do not.
