Fan Intelligence for World Cup 2026

June 29, 2026 · 10 min read

With the FIFA World Cup 2026 underway, I thought it would be a good chance to develop a World Cup 2026 fan intelligence platform in a regional deployment stamp pattern, but could also be scaled globally across other regions and locales.

This blog articles, covers the architecture, the 12 AI agent types, the Drasi event pipeline, the AKS configuration, that I used! This solution is also open source and you can deploy/modify and learn it from it all from scratch with azd (note it won't be cheap to run due to the resources deployed and needed, I was aiming for full production sizing and resourcing vs small proof of concept).

info

The full codebase is open source and available at lukemurraynz/WC2026_FanIntelligence on GitHub. You can explore the source, deploy your own instance, and modify it for your own use cases.

Quick start

# Prerequisites: Azure CLI, azd 1.26+, kubectl, .NET 10 SDK
azd auth login
azd config set alpha.aks.kustomize on
azd env new wc2026 --subscription <subscription-id> --location australiaeast

# Deploy everything
azd up

After deployment, verify:

kubectl get pods -n wc2026
# Expect: 2 api, 2 web, 2 cosmos-forwarder -- all Running

kubectl exec -n wc2026 deploy/fanintelligence-api -- curl -s http://localhost:8080/health/ready
# Expect: Healthy

Platform overview showing deployed services and verification

Architecture overview

Azure architecture overview showing Front Door, AGC, AKS, data, and agent orchestration boundaries

The platform has five layers:

Ingress -- Azure Front Door Premium routes global traffic to the nearest region's Application Gateway for Containers (AGC).
Compute -- AKS cluster wc2026-fanintel-aks running three services:
- API (.NET 10, Orleans 10) -- stateful grains for fan profiles, match state, notifications, and AI agent orchestration
- Web (React 19, TypeScript, Vite) -- fan-facing frontend
- CosmosForwarder -- reads Cosmos DB change feed and forwards to Event Hubs
Data -- Cosmos DB (wc2026-mvp-cosmos), Azure Managed Redis Enterprise (wc2026-redis), and Azure AI Search (wc2026-search) for tournament grounding.
Event pipeline -- Cosmos DB change feed -> Event Hubs -> Drasi continuous queries -> Service Bus -> KEDA-scaled workers. Five Drasi source pods and one reaction pod handle the streaming.
AI -- Microsoft Foundry with gpt-5-mini deployment. Twelve agent types provide fan-facing explanations.

Quick primer on key technologies

New to some of these? Here's what each enables:

Orleans: Stateful virtual actors that hold fan state (preferences, history, rate limits) in memory across requests - eliminates distributed cache races and throttling when thousands of goal events fire simultaneously.
Drasi: Continuous Cypher queries that detect only meaningful changes (goals, status shifts) rather than every state update - cuts noise by 90% and routes events to the right AI agent.
PostDaprPubSub: A Drasi reaction that publishes deltas to Service Bus without custom code - keeps the event pipeline declarative.
AKS: Managed Kubernetes cluster that runs the API, web, and event-processing workloads - handles rolling updates, KEDA autoscaling, and workload identity so you don't manage VMs.
Azure Kubernetes Fleet Manager: Control plane for multi-region Kubernetes clusters - stages updates across regions and provides a single pane of glass for health and rollout progress.

Skip the details if you're familiar; the architecture works the same. These choices just solve specific pain points (state races, event noise, custom processors, regional orchestration) that show up at tournament scale. :::

Key architectural decisions

Decision	Choice	Rationale
Compute	AKS + Orleans	Stateful grains eliminate distributed-cache races for fan state
Event detection	Drasi continuous queries	Cypher-based change detection with `drasi.previousValue()`
AI model	gpt-5-mini (GlobalStandard)	Smallest model that meets quality bar with grounding
Event bridge	Dapr PostDaprPubSub	Native Drasi reaction, no custom bridge service
Deployment	azd + Kustomize + Flux	Single `azd deploy` for all services, GitOps for production
State storage	Cosmos DB (Session consistency)	Multi-region writes, grain directory persistence

Agentic AI capabilities with Microsoft Agent Framework

The platform has 12 generative AI agents, orchestrated with Microsoft Agent Framework and implemented as Orleans grain-backed handlers:

Agent	What it does	How to trigger
Goal Explainer	Plain-language goal description	Select match + "Why it mattered"
Daily Briefing	Personalised match summary	"Get my briefing" button
Match Narrative	Rolling story of live match	"Run Match Narrative"
Bracket Projection	Knockout bracket prediction	"Run Bracket Projection"
Multi-turn Companion	Conversational Q&A with memory	"Run Multi-turn Companion"
Upset Detection	Identifies unexpected results	"Run Upset Detection"
Travel Advisor	Fan travel recommendations	Based on team preferences
Player Insight	Player performance analysis	Based on match events
Qualification Scenarios	Group advancement projection	Via /qualification/scenarios API
Watch Party Host	Watch party coordination	Web UI
Rivalry Context	Head-to-head history	Match context panel
Safety Checker	Content safety evaluation	Runs on every agent output

Each agent follows the same pattern: structured prompt template -> Azure AI Search grounding -> Foundry model -> safety filter -> output validation -> fallback if grounding score < 0.7.

AI Agents page showing the agent catalogue

Live Agent Hub flow

When a fan runs an agent from the Agent Hub:

The frontend calls the selected agent endpoint.
Microsoft Agent Framework routes to the configured Foundry-backed agent implementation.
Grounding, confidence, and safety checks validate the response.
Deterministic fallback text is returned if the AI response does not pass validation.

This model keeps the experience predictable while still giving fans rich AI explanations for matches, brackets, and qualification outcomes.

Drasi event pipeline

The event pipeline processes live match data in real time:

# Drasi continuous query: goal-scored
MATCH (m:Match)
WHERE m.status = 'Live'
AND (m.homeScore <> coalesce(drasi.previousValue(m.homeScore), -1)
OR m.awayScore <> coalesce(drasi.previousValue(m.awayScore), -1))
RETURN m.matchId, m.homeTeamCode, m.awayTeamCode, m.homeScore, m.awayScore

The drasi.previousValue() function detects changes between document revisions. Three queries run: goal-scored, match-status-change, and schedule-changed. Results flow through Dapr to Service Bus, then to KEDA-scaled API workers.

Data flow that drives the agentic experience

Match data changes in Cosmos DB (Matches container).
CosmosForwarder publishes those changes to Event Hubs.
Drasi evaluates continuous queries and emits only meaningful deltas (goal, status, schedule).
PostDaprPubSub + Service Bus fan out those deltas as reliable work items.
API workers + Orleans grains map each event to the right Microsoft Agent Framework capability.
Foundry + grounding + safety gates generate fan-facing outputs (briefings, goal explanations, travel updates).

Event signal from Drasi	Agent capability that activates	Fan-visible outcome
`goal-scored`	Goal Explainer, Match Narrative, Daily Briefing refresh	"Why this goal mattered" updates and refreshed match story
`match-status-change`	Daily Briefing, Qualification Scenarios, Watch Party Host	Updated briefings, qualification implications, and watch party timing
`schedule-changed`	Travel Advisor, Multi-turn Companion context refresh	Revised travel guidance and updated answers in follow-up questions

This flow is what keeps the AI useful during live tournaments. Drasi removes noisy state churn, Orleans applies fan context, and Agent Framework routes each event to the capability that can produce an immediate fan-facing action.

The Drasi reaction pod is running and healthy:

kubectl logs -n drasi-system deploy/forward-to-servicebus-reaction --tail=3
# Starting PostDaprPubSub reaction
# Dapr sidecar is available.
# Validated query configurations.

Drasi pipeline status showing source and reaction pods

AKS configuration

The AKS cluster runs three services with specific configurations:

API pods -- Orleans silos with Redis clustering, Cosmos grain persistence:

strategy:
  rollingUpdate:
    maxSurge: 1
    maxUnavailable: 0
securityContext:
  runAsNonRoot: true
  seccompProfile:
    type: RuntimeDefault

KEDA autoscaling -- scales API pods on Service Bus queue depth:

minReplicaCount: 2
maxReplicaCount: 10
triggers:
  - type: azure-servicebus
    queueName: match-events

Workload Identity -- every pod authenticates to Azure services via managed identity, no connection strings:

metadata:
  labels:
    azure.workload.identity/use: "true"

AKS cluster verification showing API, web, and forwarder pods healthy

Platform operations capabilities: AGC, Drasi, AKS, and Fleet Manager

The solution's operations model is built around four platform capabilities that map directly to reliability and scale outcomes:

Front Door + AGC gives controlled ingress with health-probed routing and a single global endpoint. WAF and DDoS controls are at Front Door, while AGC handles Kubernetes Gateway API routing.
Drasi turns Cosmos change-feed events into actionable queue messages without custom stream processors.
AKS runs the API/web/forwarder workloads with rolling updates, KEDA, and workload identity.
Azure Kubernetes Fleet Manager provides a control plane for multi-cluster operations with staged update groups across regions.

AGC gateway status showing the gateway programmed and API health probe passing Azure Kubernetes Fleet Manager status showing fleet and member health in rg-mvp AKS Fleet rollout strategy showing staged upgrade flow from primary to secondary

Deployment model

The platform uses azd with host: aks and Kustomize overlays. Each service has its own overlay directory, so azd deploy api deploys only the API.

services:
  api:
    project: ./src/FanIntelligence.Api
    language: dotnet
    host: aks
    k8s:
      kustomize:
        dir: ../../infra/k8s/overlays/azd-api

The hook pipeline adds validation gates:

preprovision -- validates Bicep syntax
postprovision -- configures Entra app, Flux GitOps, AI Search
predeploy -- runs tests, validates Kubernetes manifests
postdeploy -- verifies Orleans health, canary endpoints

Tradeoffs and decisions

Why Orleans over stateless APIs

Stateless APIs with Redis for deduplication hit a wall during goal storms -- every event triggered Cosmos subscriber queries that throttled under load. Orleans grains own per-fan state in memory, so rate limits, preferences, and notification history are a local lookup, not a distributed query.

Why Drasi over custom stream processing

Drasi's drasi.previousValue() lets us detect changes in Cypher rather than writing custom stream processors. The tradeoff is that Drasi is pre-1.0 CNCF -- the API surface changes between releases, and some reaction providers aren't bundled (PostDaprPubSub needs externalImage: true).

Why gpt-5-mini over larger models

Larger models didn't improve explanation quality once grounding was in place. The smaller model is faster, cheaper, and the grounding score threshold catches the edge cases where it doesn't have enough context.

Why Front Door over direct AKS ingress

Front Door provides global routing, WAF, and DDoS protection without managing multiple ingress controllers per region. The tradeoff is that the AGC integration has a version coupling issue between the managed Gateway API add-on and the ALB controller (another post is coming).

Hopefully this gives you a working blueprint for building event-driven AI platforms on AKS. Make sure to check out the Orleans on AKS and Drasi documentation for deeper dives into stateful orchestration and continuous query patterns, I had a lot of fun building this, hopefully it helps showcase how you could build that next solution!

Quick start​

Architecture overview​

Key architectural decisions​

Agentic AI capabilities with Microsoft Agent Framework​

Live Agent Hub flow​

Drasi event pipeline​

Data flow that drives the agentic experience​

AKS configuration​

Platform operations capabilities: AGC, Drasi, AKS, and Fleet Manager​

Deployment model​

Tradeoffs and decisions​

Why Orleans over stateless APIs​

Why Drasi over custom stream processing​

Why gpt-5-mini over larger models​

Why Front Door over direct AKS ingress​

References​