Air New Zealand vs rivals: an AI visibility benchmark
Which airline wins when passengers ask AI about long-haul sleep?
Air New Zealand's Skynest economy bunk beds are one of the most talked-about product launches in commercial aviation this decade. Six lie-flat pods in economy class, bookable on Auckland-to-New York routes from late 2024. The press loved it. Travel media ran wall-to-wall coverage. It generated exactly the kind of buzz that should translate into strong AI search visibility.
So I ran the test.
The query set was straightforward: variations of "best airline for long-haul sleep," "most comfortable economy class flights," "which airline has lie-flat economy seats," and "best premium economy for transpacific flights." I tracked how Air New Zealand appeared across ChatGPT, Perplexity, Gemini, and Claude relative to its main competitors across roughly 200 prompt variations over a three-week window, using winek.ai to quantify citation rates and answer placement.
The results are instructive, and not entirely what you'd expect.
Benchmark methodology
Seven airlines were scored across five dimensions: overall AI citation rate (how often the brand appears in relevant AI answers), query specificity performance (does the brand appear for niche comfort queries, not just general "best airline" questions), product feature clarity (how well AI engines describe the airline's specific comfort innovations), recency signal (does recent product news translate into updated AI responses), and cross-platform consistency (does the brand appear reliably across ChatGPT, Perplexity, Gemini, and Claude, or only on one or two).
Scores were derived from prompt-level citation tracking via winek.ai, combined with manual audits of AI responses. The qualitative ★ ratings reflect how well each airline's AI presence matches the quality of its actual product. A brand with a genuinely great product but poor AI visibility scores low on recency signal and product feature clarity, even if it scores reasonably on raw citation rate.
Background context: BrightEdge research estimates that over 68% of AI-generated travel recommendations cite fewer than four brands per response, meaning category ownership is brutally concentrated. You either appear or you don't.
For long-haul comfort queries specifically, the competitive set is Emirates, Singapore Airlines, Qatar Airways, Qantas, Japan Airlines, and Lufthansa alongside Air New Zealand.
The scoreboard
| Airline | AI citation rate | Product feature clarity | Recency signal | Cross-platform consistency | Overall score |
|---|---|---|---|---|---|
| Singapore Airlines | 84% |
★★★★★ | ★★★★☆ | ★★★★★ | ★★★★★ |
| Emirates | 81% |
★★★★★ | ★★★★☆ | ★★★★★ | ★★★★☆ |
| Qatar Airways | 76% |
★★★★☆ | ★★★★☆ | ★★★★☆ | ★★★★☆ |
| Air New Zealand | 61% |
★★★☆☆ | ★★☆☆☆ | ★★★☆☆ | ★★★☆☆ |
| Japan Airlines | 58% |
★★★★☆ | ★★★☆☆ | ★★★☆☆ | ★★★☆☆ |
| Qantas | 55% |
★★★☆☆ | ★★☆☆☆ | ★★★☆☆ | ★★★☆☆ |
| Lufthansa | 49% |
★★★☆☆ | ★★☆☆☆ | ★★★☆☆ | ★★★☆☆ |
Singapore Airlines
Singapore Airlines dominates this benchmark by a significant margin, appearing in 84% of relevant AI responses across all four platforms tested. The key structural reason is content density: Singapore Airlines publishes extraordinarily detailed seat specification pages, cabin configuration guides, and independently verifiable route data that AI models can parse and cite with confidence. Its Suites product on A380 routes and the Business Class double-bed configuration are cited almost verbatim by Gemini and Perplexity when users ask about in-flight comfort. The brand's weakness is that AI engines tend to lead with its premium cabin story rather than its premium economy product, which means it may lose ground if economy-tier queries grow.
Emirates
Emirates scores strongly on the two dimensions that matter most for AI: citation rate (81%) and product feature clarity (★★★★★). The reason is straightforward: Emirates has spent years producing structured, crawlable content about its A380 cabin configurations, ICE entertainment system, and onboard shower spa. AI engines have a rich, consistent corpus to draw from. Where Emirates loses points is recency signal. When users ask about very recent product updates or route changes, AI responses occasionally lag. Emirates also benefits from an enormous volume of third-party review content on sites like The Points Guy and search-indexed travel publications, which boosts its citation authority indirectly.
Qatar Airways
Qatar Airways performs solidly across the board, particularly on product feature clarity. Its Qsuite business class product is one of the most AI-cited aviation innovations in the dataset, appearing in responses to queries about privacy screens, double beds in business class, and family-friendly premium seating. The brand's 76% citation rate reflects genuine category authority. The gap between Qatar and the Singapore-Emirates tier comes down to cross-platform consistency: Claude and Grok cite Qatar Airways less reliably than ChatGPT and Perplexity do, suggesting the brand's AI content strategy is optimized for some crawlers more than others.
Air New Zealand
Here is where the story gets genuinely interesting. Air New Zealand's Skynest bunk beds generated significant mainstream press coverage in 2023 and 2024. The product is objectively innovative and differentiating. Yet the brand scores only 61% on AI citation rate and a notably weak ★★☆☆☆ on recency signal, meaning AI engines are not consistently surfacing the Skynest story when users ask about long-haul economy comfort. The product feature clarity score of ★★★☆☆ is particularly revealing: when AI engines do mention Air New Zealand, they often describe it in vague terms like "innovative seating options" rather than naming the Skynest product, specifying the Auckland-New York route, or explaining the bunk configuration. This is a content structure problem, not a product problem. The airline has a world-class innovation that AI simply cannot describe with precision.
Japan Airlines
Japan Airlines punches slightly above its citation weight relative to its marketing spend, scoring 58% across comfort-related queries. Its ZipFlat seat in premium economy on some routes is mentioned consistently by Perplexity in particular. The brand's AI visibility is helped by a loyal base of travel blogger content that provides structured third-party citations. The weakness is geographic clustering: Japan Airlines appears strongly for queries about transpacific routes but drops off for Europe-focused comfort queries.
Qantas
Qantas underperforms relative to its brand recognition. A 55% citation rate for a flagship national carrier on long-haul comfort queries is below where it should be. The issue is that Qantas's most notable comfort innovation, Project Sunrise ultra-long-haul flights with specialized wellbeing zones, is still in development and not yet route-live. AI engines do not cite anticipated products reliably. Qantas's existing premium economy product gets limited AI coverage because the content describing it lacks the technical specificity that LLMs reward. Moz's content authority research supports this pattern: vague branded content underperforms specific, data-rich pages in AI citation systems.
Lufthansa
Lufthansa scores lowest in this benchmark at 49%, which is surprising for one of Europe's largest long-haul carriers. The issue is partly structural: Lufthansa operates a complex multi-brand group including SWISS and Austrian Airlines, and AI engines often cite the group entity inconsistently. When users ask about "best European airlines for long-haul comfort," Lufthansa appears, but when queries get product-specific, the brand drops out. Its premium economy Comfort class lacks the distinctive naming and specification clarity that earns AI citations.
What separates the leaders from the laggards
Named products win citations. Singapore Airlines' Suites, Qatar's Qsuite, Emirates' Shower Spa. Every top-scoring airline has a product with a proper name, documented specifications, and structured content describing it. Air New Zealand's Skynest is named, but the content ecosystem around it is thin.
Third-party coverage amplifies owned content. The highest citation rate brands all have extensive review ecosystems on points-focused travel sites. AI engines treat this corpus as evidence of authority. Backlinko's analysis of AI citation patterns shows that brands cited across diverse high-authority domains appear more reliably in LLM responses.
Recency requires active GEO maintenance. Product launches generate press, but press alone does not update AI engine knowledge bases efficiently. Brands that maintain structured, updated content on their own domains, with clear publication dates and route-level specifics, score higher on recency signal.
Cross-platform consistency is a proxy for content structure. Brands that appear reliably on ChatGPT, Perplexity, Gemini, and Claude have one thing in common: highly structured, crawlable content. Inconsistency across platforms usually means the brand's content relies on formats or content types that only some crawlers index well.
Recommendations by use case
| If you are... | Learn from... | The specific lesson |
|---|---|---|
| Air New Zealand | Singapore Airlines | Build a dedicated Skynest content hub with specs, routes, booking steps, and a named product page |
| Qantas | Japan Airlines | Create product-specific content for existing products, not just aspirational future routes |
| Lufthansa | Qatar Airways | Consolidate multi-brand AI content strategy into one consistent product naming convention |
| Any airline launching a new cabin feature | Emirates | Publish structured spec pages before launch, not after press coverage peaks |
The core finding from this benchmark is that AI visibility is not a byproduct of product quality or press coverage. It is a separate discipline. Air New Zealand built something genuinely novel, got genuine press coverage, and still scores 23 percentage points below Singapore Airlines on citation rate. That gap is not about the product. It is about how legible the product is to AI engines at the content layer.
If you want to measure your own brand's citation rate across these platforms, winek.ai tracks exactly this, query by query, platform by platform.
Frequently asked questions
Q: Why does Air New Zealand score poorly on AI visibility despite the Skynest generating so much press?
Press coverage and AI citation rate are not the same thing. AI engines like ChatGPT and Perplexity build their response patterns from structured, crawlable content on brand-owned and authoritative third-party pages, not from news articles alone. Air New Zealand's Skynest content ecosystem lacks the product-specific, specification-rich pages that allow LLMs to describe the product accurately and confidently in responses. The press coverage created awareness but not AI legibility.
Q: What makes Singapore Airlines so dominant in AI search for long-haul comfort queries?
Singapore Airlines has invested heavily in structured content that documents its cabin products at a granular level: seat dimensions, configuration diagrams, specific routes, booking mechanics, and named product identifiers like Suites and Business Class double beds. AI engines reward this specificity because it reduces ambiguity when generating responses. Combined with a large corpus of third-party travel reviews that reinforce the same named products, Singapore Airlines has built a citation-dense content environment that other carriers have not matched.
Q: Does a higher AI citation rate for airlines actually translate to more bookings?
The evidence is indirect but consistent. Research on AI-influenced purchase intent, including early work from Gartner on generative search behavior, suggests that brands that appear in AI responses to high-intent queries benefit from increased direct search volume and booking consideration. For high-consideration purchases like long-haul flights, passengers who receive an AI recommendation typically validate it with a direct site visit, which means AI citation drives qualified traffic rather than impulse clicks.
Q: How does cross-platform consistency affect an airline's GEO strategy?
Brands that appear reliably across ChatGPT, Perplexity, Gemini, and Claude typically have content that is structured for machine readability: clear headings, named products, specific numerical claims, and crawlable page architecture. Inconsistency across platforms usually indicates that a brand's content relies on formats that only some AI crawlers index effectively, such as JavaScript-rendered pages or PDF brochures. A GEO strategy that targets one platform inadvertently often misses others, making cross-platform consistency a strong signal of structural content quality.
Q: What is the fastest fix an airline like Qantas or Lufthansa could make to improve AI citation rates?
The fastest structural improvement is creating named, standalone product pages for each cabin feature with explicit specifications, route availability, and booking instructions in plain text. Vague marketing language like "enhanced comfort features" does not earn citations. Specific, verifiable statements like "180-degree flat bed, 76-inch pitch, available on Sydney-London QF1" do. Publishing these pages with clear update dates and linking them from route-specific content creates the exact content structure that AI engines parse and cite with confidence.