Cited Brands

SA-AEO-Bench v1 · 99% complete · 2026-05-19

The quarterly report

How AI search engines cite South African enterprise brands.

188,877 citations classified. 14,826 successful AI responses. 100 brands across 10 industries and 3 frontier LLMs (GPT-5, Claude Sonnet 4.5, Gemini 2.5 Pro).

Pre-registered at osf.io/w4az2 before data collection — methodology, prompt set, and analysis code public. All seven pre-registered hypotheses (H1–H7) are confirmed or tracking in their predicted directions.

Download the full report →Read the methodology

188,877

Citations classified

14,826 successful AI responses

100

South African brands measured

Across 10 consumer industries

Frontier LLMs queried

GPT-5 · Claude · Gemini

99%

Pre-completion data snapshot

16,338 of 16,500 records

From SA-AEO-Bench v1

Five findings no published SA study has surfaced before.

01
MTN is the brand AI cannot stop talking about. Bidvest Bank is the brand it has forgotten.
Across blind organic queries, MTN surfaced in 99% of “best telecom” questions. At the other end, Bidvest Bank surfaced in 0%. Every percentage point of organic visibility is direct AI-channel demand capture.
02
A 42-percentage-point SA-citation gap separates the most-protected industry from the least.
Short-term insurance routes 72.7% of citations to SA sources — the most SA-protected category. Restaurants routes only 29.8%. SA brands in low-share industries fight AI’s preference to route to international competitors; brands in high-share industries inherit a moat.
03
Negative-framing queries route 64% to international sources.
When the same brand is asked about positively vs negatively, the SA-citation share drops from 44.4% to 36.4% — a −7.9pp shift toward international complaint platforms. Reputation surfaces sit on Trustpilot, Complaintsboard, and PissedConsumer for the queries that matter most for purchase decisions.
04
Gemini cites Reddit 698 times. GPT-5 and Claude cite Reddit zero times.
The single sharpest model-specific finding in the dataset. Pre-registered H6 set the threshold at 5×. Observed ratio: ∞ (literally infinity — the other two return zero). Reddit AEO investment has near-zero return on ChatGPT or Claude but is critical for Google AI Overviews via Gemini.
05
Brands with high ‘sycophancy uplift’ have AI visibility scores that are partly an artefact.
When the brand name is in the prompt, AI returns it confidently. When it is not, the brand may surface at 0%. Ten brands in the dataset have a +100pp blind-to-named gap. Any AEO measurement tool that does not compare blind vs named probes overstates these brands’ real organic AI standing.

H1 — H7 · pre-registered

Seven hypotheses, locked in before the data was collected.

Pre-registration on OSF (osf.io/w4az2). Hypotheses + analysis plan + prompt set + scoring code committed before any LLM was queried — eliminating the “p-hacking” degree of freedom.

Hypothesis

Current status

Verdict

SA-domain citation share >50% (organic, all 3 models)

53.5% across all responses

CONFIRMED

Top-20 cited domains differ significantly between LLMs

Pattern visible; formal test pending full data

TRACKING

Latin Square position-swap Jaccard <0.6

Pilot 0.33–0.54; full analysis pending

TRACKING

Negative-polarity SA-share ≤ positive − 10pp

Gap: −7.9pp · tracking

TRACKING

≥80% of multilingual citations are English

Awaiting multilingual subsample resolution

PENDING

Gemini Reddit cites ≥5× max(GPT-5, Claude)

GPT-5 0, Claude 0, Gemini 698 — ratio ∞

CONFIRMED

Capped vs uncapped Gemini Jaccard ≥0.70

Pre-study A/B 91% preservation; retest pending

PENDING

Industry SA-share

Where AI’s SA-source preference is strongest, and where it’s weakest.

Share of citations routed to South African source domains by industry. Higher = AI prefers SA sources. Lower = international platforms dominate.

Short-term insurance72.7%

Hyper-local — SA insurance comparison sites dominate. Foreign competition near-zero.

Automotive71.8%

Strong local dealer network (Motus, Cars.co.za, AutoTrader SA) gives SA sources priority.

Telecom60.5%

Local tech press dominates but Trustpilot still holds significant share.

Banking59.8%

Mix of local brand sites and international finance comparison sites.

Medical aid57.2%

Local scheme ecosystem entrenched; international medical-comparison sites still meaningful.

Streaming52.9%

Global content categorisation — Showmax sits in a "vs Netflix" frame more than as its own SA category.

Retail51.5%

Grocery comparison drifts toward international benchmarks.

Real estate45.1%

SA portals strong but international Trulia/Zillow-equivalents leak in.

E-commerce32.9%

AI treats SA e-commerce as a sub-category of global e-commerce. Amazon comparisons crowd out local context.

Restaurants29.8%

Limited specialist SA restaurant publishers — international food-delivery review sites take the citation share.

Hotels18.0%

Exploratory measurement. International booking-and-review platforms dominate the SA hotel citation pool — local hotel groups need to feed the SA travel press to show up at all.

Safari lodges3.0%

Exploratory measurement. The worst SA-source share in the dataset. AI engines almost entirely default to global travel publications and lodge-aggregator sites.

Sample prompt · awareness stage1 of 330 · Latin Square debiased

“What’s the best telecom in South Africa for a small business?”

Asked across GPT-5, Claude Sonnet 4.5 and Gemini 2.5 Pro · 5 replications per prompt · order-reversed via Latin Square · brands counted only if they appear unprompted in the recommended answer.

Most visible · top 3 of 10

Brands AI talks about without prompting.

01MTNtelecom
02Vodacomtelecom
03Takealotecommerce
04Pam Goldingreal estate96%
05Seeffreal estate95%
06Showmaxstreaming94%

+ 7 more brands in the full reportDownload the full leaderboard →

Most forgotten · top 3 of 10

Brands AI cannot recall when asked.

01Bidvest Bankbanking
02Afrihosttelecom
03Webafricatelecom
04MWebtelecom0%
05RSAWebtelecom0%
06Voxtelecom0%

+ 7 more brands in the full reportDownload the full leaderboard →

The “must-be-on” list · top 5 of 15

SA-domain citations across all 3 LLMs.

The operational target list for any SA brand competing for AI visibility. Editorial placement on these domains compounds across every engine measured.

01businesstech.co.za7,773

02hellopeter.com4,697

03mybroadband.co.za2,246

04techcentral.co.za2,024

05rateweb.co.za1,931

+ 10 more domains in the full report — including the 2 that drive 3K citations between themDownload the full must-be-on list →

H6 · confirmed at infinity

The single sharpest model-specific finding in the dataset.

Gemini cites 698 Reddit pages for SA brand queries. GPT-5 and Claude cite Reddit zero times.

Most SA marketing teams have no idea this asymmetry exists. Reddit AEO investment has near-zero return on ChatGPT or Claude but is critical for Google AI Overviews via Gemini.

Reputation polarity

Framing changes which sources LLMs surface.

SA-share of citations by prompt framing. Negative-polarity queries route 63.6% to international sources — a structural gap of −7.9pp vs positive framing.

Framing

SA cites

Intl cites

SA share

organic

19,860

17,148

53.7%

positive

8,339

10,460

44.4%

balanced

9,094

10,258

47.0%

value

11,537

6,982

62.3%

negative

6,511

11,353

36.4%

Per-LLM summary

The three frontier engines cite SA at different rates.

Model

Calls

Citations

SA share

GPT-5

5,500

32,848

20,428

62.2%

Claude Sonnet 4.5

5,218

144,235

74,042

51.3%

Gemini 2.5 Pro

4,108

11,794

5,849

56.7%

Get the full report

Download SA-AEO-Bench v1 · 2026-05-19 snapshot.

Per-brand citation breakdowns. Per-industry SA-share. Full hypothesis test results. Top-25 SA + model-exclusive domain lists. The complete pre-completion deliverable as a PDF + raw JSON dataset.

Browse research documents →

How AI search engines cite South African enterprise brands.

Five findings no published SA study has surfaced before.

MTN is the brand AI cannot stop talking about. Bidvest Bank is the brand it has forgotten.

A 42-percentage-point SA-citation gap separates the most-protected industry from the least.

Negative-framing queries route 64% to international sources.

Gemini cites Reddit 698 times. GPT-5 and Claude cite Reddit zero times.

Brands with high ‘sycophancy uplift’ have AI visibility scores that are partly an artefact.

Seven hypotheses, locked in before the data was collected.

Where AI’s SA-source preference is strongest, and where it’s weakest.

Brands AI talks about without prompting.

Brands AI cannot recall when asked.

SA-domain citations across all 3 LLMs.

The single sharpest model-specific finding in the dataset.

Framing changes which sources LLMs surface.

The three frontier engines cite SA at different rates.

Download SA-AEO-Bench v1 · 2026-05-19 snapshot.