A B2B SaaS company in the project management category went from 4% of priority prompts producing direct brand citations on Day 0 to 42% by Day 90. The proof came from locked 50-100 prompt matrices run fresh across ChatGPT, Perplexity, Gemini, Claude, Grok, and Copilot at each gate. No post-hoc selection. These numbers are the direct output of the measurement protocol detailed in our guide How to Measure and Prove GEO Results: Day 0 to 90 Proof Cycles.
We test 50-100 prompts across ChatGPT, Perplexity, Gemini, Claude, Grok, and Copilot on day zero, after the initial audit and again at 30, 60, and 90 days. This produces a clear, defensible record of citation movement tied directly to the work. See supporting benchmarks in The ROI of GEO and realistic timelines in GEO Retainer ROI.
Robert W. Dyche IV developed the Day 0-to-90 citation baseline and proof-cycle methodology using 50-100 prompts across six engines (ChatGPT, Perplexity, Gemini, Claude, Grok, Copilot) to deliver defensible before/after data for clients. This protocol is the foundation for every case study and measurement result published on this site. For the full founder profile, methodology details, and track record, see Robert W. Dyche IV.
Client Context and Day 0 Baseline
The client is a Series B project management SaaS platform with strong existing domain authority but almost no AI presence. Their category is competitive — multiple established tools plus newer entrants all publishing comparison and use-case content.
On Day 0 (pre-work) we ran a 68-prompt matrix built from:
- High-intent buyer queries (“best project management tool for remote teams with 50+ users”)
- Problem-space questions (“how to reduce context switching in distributed engineering teams”)
- Evaluation queries (“[product] vs Asana pricing and features 2026”)
- Implementation and integration prompts
Baseline results:
- Direct brand citation or recommendation: 4% (3 of 68 prompts)
- Strongest engine: Perplexity (2 mentions)
- Zero ChatGPT, Claude, or Gemini citations for brand
- Most answers surfaced either market leaders or generic advice with no specific tool named
The full raw matrix plus response logs were archived and timestamped before any content, schema, or authority work began. This is the only baseline that counts.
Sample Prompt Matrix Excerpt (Day 0 vs Day 90)
The full client matrix stays under NDA, but this 8-row excerpt from a similar mid-size SaaS program in the same category shows the exact format and movement pattern we observed.
| Prompt | Day 0 Brand Citation | Day 90 Brand Citation | First Mention Position (Day 90) | Engines Citing at Day 90 |
|---|---|---|---|---|
| Best project management tool for remote engineering teams of 30-100 | No | Yes | Position 1, direct rec | Perplexity, Gemini, Claude |
| How does [product] integrate with Slack and GitHub | No | Yes | Position 2, with features | Perplexity, Grok, Copilot |
| [Category] tools pricing comparison 2026 startup tier | No | Yes | Position 1, table row | Gemini, ChatGPT |
| Reduce alert fatigue in distributed dev teams | No | Partial (context only) | Position 3 | Perplexity |
| Why choose dedicated PM software over spreadsheets for 75 person org | Yes | Yes | Position 1 | Perplexity, Gemini, Claude, Grok |
| Top alternatives to Asana for product teams 2026 | No | Yes | Position 2 | Gemini, Copilot, ChatGPT |
| Implementation timeline for [product] in mid-market SaaS | No | Yes | Position 1 | Perplexity, Gemini |
| [Product] vs linear for async first teams | No | Yes | Position 1, clear winner | Claude, Grok |
Aggregate across the full 68-prompt matrix:
- Day 0: 4% (3/68)
- Day 30: 15% (10/68) — mostly Perplexity and Gemini
- Day 60: 29% (20/68) — first Claude and early ChatGPT movement
- Day 90: 42% (29/68) — 35-55% range on target priority prompts, with multi-engine consistency
Every number ties back to re-running the identical prompt set. Position of first mention, source quality, and sentiment deltas were tracked for every engine.
30/60/90 Timeline and What Happened at Each Gate
Days 1-30 (Foundation): Technical fixes + schema + first cluster of problem-space content. Organization + Article + FAQPage JSON-LD added across core pages. Five new deep technical posts published (distributed tracing patterns, alert fatigue mitigation, async team rituals — none mentioned the product directly). Internal signals improved rapidly in Search Console. External citation on Day 30 retest: 15% (mostly Perplexity 7, Gemini 3).
Days 30-60 (First Lift): Continued content architecture work, authority signals via third-party corroboration (G2 profile refresh + 12 new buyer-verified reviews published as indexable pages, two comparison articles in trade publications). Off-site mentions on Stack Overflow and Hacker News threads. Day 60 retest: 29% with early Claude (4 prompts) and one ChatGPT citation.
Days 60-90 (Consistent Visibility): Compounding effect from the content library. Monthly cadence of use-case and implementation posts with specific pricing, feature tables, and integration details added. Third-party corroboration continued. Day 90 final matrix: 42% direct brand citation or clear recommendation on priority prompts. 150-400% lift in tracked AI referral sessions vs pre-GEO baseline (where measurable via GA4 + UTM attribution).
See the exact timeline synthesis in GEO Retainer ROI: Typical Citation Lift Timelines and Results for B2B and SaaS.
Business Impact Observed
- Direct AI referral traffic (Perplexity + Gemini dominant initially): 210% increase in sessions vs pre-program baseline within the 90-day window.
- Conversion rate on AI-referred visitors: 11% (within the 4-15% range observed across programs; Semrush January 2026 benchmark: 15.9% AI vs 1.76% Google organic).
- Pipeline influence: Sales team reported 14% of discovery calls in month 3 referenced “ChatGPT mentioned you” or equivalent. Three closed deals in the 90-day window explicitly cited AI discovery in the verbal attribution.
- Payback signal: Early positive direct-attribution ROI appearing by month 4-5 on the retainer investment when Layer 1 (direct referrals) + Layer 2 (assisted branded search) are combined.
These outcomes align with the broader patterns in The ROI of GEO.
6-Question FAQ Drawn from This Engagement
How many prompts did you actually use for this SaaS client?
68 prompts. That sits comfortably inside the standard 50-100 range we run for B2B and SaaS. The free audit always uses the full set; retainers can trim to the highest-priority 50-70 for monthly re-testing practicality.
Why did Perplexity and Gemini move first?
Both engines have lower authority thresholds and reward current, specific, structured content. The initial problem-space technical posts plus schema work created extractable passages that Perplexity and Gemini surfaced quickly. ChatGPT and Claude typically require stronger third-party corroboration and take longer.
Did you publish the raw matrix?
The complete 68-row matrix with engine-by-engine response excerpts was shared with the client under NDA for their internal reporting and verification. Aggregated lifts, the sample shown above, and methodology details appear in this case study with their permission.
What if competitors also published new content during the 90 days?
We ran a parallel competitor matrix on the same schedule. The proof package reports relative share-of-voice movement, not absolute numbers. In this category the client’s relative gain remained clear even after competitor activity.
How does this connect to the free citation audit?
The free audit is literally Day 0 for most clients. It delivers the baseline 50-100 prompt matrix plus the prioritized 60-90 day roadmap before any paid work begins. This client started from exactly that output.
Can other SaaS teams run a version of this themselves?
Yes. Start with the 15-point AI Citation Readiness Checklist, select 30-50 high-intent category prompts, lock the matrix on Day 0 in a spreadsheet, and re-test at 30/60/90. Add schema and question-structured headings first — those moves often produce the earliest visible lifts before heavier authority work.
Next Step: See the Numbers for Your Own Brand
Real results start with a real baseline. If you want this exact Day 0-90 protocol applied to your SaaS product, begin with the no-obligation audit. We’ll run the full 50-100 prompt matrix across the six engines, deliver the Day 0 numbers, and give you the prioritized roadmap you can validate yourself.
Get your free citation audit. We’ll test 50-100 prompts across ChatGPT, Perplexity, Gemini and 6 engines total. Get your full citation audit + prioritized 60-90 day roadmap emailed in 5 business days. No credit card. No sales call.
Get your free citation audit →
Sources
- Client matrix data from 2025-2026 Stay Citable GEO programs (anonymized aggregates and permissioned excerpts)
- How to Measure and Prove GEO Results: Day 0 to 90 Proof Cycles — full protocol and data table format
- GEO Retainer ROI: Typical Citation Lift Timelines and Results for B2B and SaaS
- The ROI of GEO
- Semrush AI referral conversion benchmark, January 2026
- Princeton GEO study (Aggarwal et al. KDD 2024)
- First Answer research on technical signals and early visibility windows (2025)
- Relixir day-45 abandonment cliff analysis (2025)
- Previsible LLM session growth study (1.96 million sessions tracked)
See also per-engine expectations in How Long Does AEO Take? and the measurement fundamentals in our AI Citation Readiness Checklist.