Scoring Audit

Model v7.0Updated Jun 28, 2026

Every week, we backtest every winning Powerball draw against our scoring criteria and use AI to flag drift. This page shows the results — no black boxes.

This audit shows how Balliqa performs over time — not by predicting wins, but by consistently selecting structured combinations.

All combinations have equal odds. This audit demonstrates consistency in selection, not an increase in probability.

Draws Analyzed

1,373

Avg Score (All-Time)

63.9 / 100

Avg Score (Recent 50)

65.9 / 100

Score Drift

+2

Latest audit: Jul 12, 2026 · Recent window: Mar 16, 2026 – Jul 8, 2026

Criteria Performance

Criterion	Max Pts	All-Time Pass	Recent Pass	Drift	Confidence	Avg Pts (All)	Avg Pts (Recent)
Parity	10	63.1%	60%	-3.1	34%	6.3	6
High/Low	10	66.1%	70%	+3.9	42%	6.6	7
Sum Range	9	67.4%	74%	+6.6	65%	7	7.6
Spread	10	63.8%	76%	+12.2	91.6%	7.5	8.5
Unique Digits	19	33.7%	38%	+4.3	46%	10.4	10.9
Modular Balance	10	62.6%	64%	+1.4	16%	6.3	6.4
Tens Diversity	9	68.6%	74%	+5.4	55%	6.2	6.7
Even Spacing	14	53.5%	48%	-5.5	56%	7.5	6.7
Primes	9	69.6%	68%	-1.6	18%	6.3	6.1

Stable (<5)Watch (5-10)Flagged (>10)

What This Audit Proves

Consistency

The system behaves the same way every draw. Criteria pass rates stay within expected ranges across thousands of draws.

Structure

Picks are not random. They follow defined combinatorial criteria that align with how real draws are structured.

Transparency

No cherry-picking. Every criterion, every weight, every drift metric is published here. You see exactly what we see.

Every lottery combination has the same odds. What changes is how combinations are selected. This audit shows that Balliqa consistently selects from a structured subset — rather than picking randomly.

AI Analysis

medium confidenceLast run: Jul 12, 2026

Overall scoring engine is well-calibrated with all-time average of 63.9/100 on winning draws. Recent 50-draw window shows +2.0 point drift, indicating stable performance. One criterion shows concerning drift requiring investigation.

Flagged Criteria

Spread (10pts): drift of +12.2pp (63.8% → 76%) exceeds 10pp threshold; recent pass rate 76% suggests possible over-fitting or temporary clustering in sample
Even Spacing (14pts): negative drift of -5.5pp (53.5% → 48%) shows consistent underperformance; well below 68% benchmark for well-calibrated criterion

Suggestions

Investigate Spread criterion: verify calculation logic and check if recent 50-draw window contains statistical anomaly; if validated, modest weight increase justified
Review Even Spacing: pass rate of 48% in recent draws indicates poor calibration; consider reducing weight from 14pts or recalibrating thresholds to approach 65-70% target
Monitor Unique Digits (33.7% all-time) which remains below benchmark but is not drifting; systematic underperformance may reflect inherent draw distribution rather than model error

Generated by Claude (Anthropic) · AI suggestions are reviewed by humans before any scoring changes are made.

Current Weights

v7.0100 pts

Unique Digits19

Even Spacing14

Parity10

Spread10

High/Low10

Modular Balance10

Primes9

Sum Range9

Tens Diversity9

Drift Rebalance+0-7 bonus

Model Changelog

Current-matrix calibration + honest reweighting. All analysis now scoped to the current 5/69 era (draws from 2015-10-07 on); ~591 older draws from the retired 5/59 matrix were being mixed in, biasing every frequency, σ-band, and drift figure. Verified against a 5M-combination Monte-Carlo: on current-era data all criteria sit within their 95% combinatorial bands (the lottery is fair). Recomputed every weight from exact combinatorial filter strength — weight ∝ (1 − pass rate), normalized to 100 by largest-remainder rounding; the prior v6.0 weights deviated from this formula by up to 7 pts (Even Spacing was the 2nd-hardest filter yet weighted lowest). Dropped Range Coverage as redundant (φ up to 0.41 with Sum Range, 0.33 with Tens Diversity) — down to 9 criteria.

Purely combinatorial model. Replace Consecutive Pair (empirical, 26.6% pass rate) with Modular Balance (all 3 residues mod 3, 64.3% pass rate). Remove Drift Rebalance overlay (empirical mean-reversion assumption). Reweight all criteria by filter strength — weight proportional to (1 - pass rate), normalized to 100. Unique Digits (35.1% pass rate) now highest at 17 pts. Protected criteria updated to Parity, High/Low, Unique Digits. All 10 criteria now derivable from pure combinatorial math with zero reliance on historical patterns.

Add Drift Rebalance overlay (+0-7 bonus pts). Compares last 20 draws against full-history baselines across 8 structural metrics (parity, high/low, sum, digit sum, primes, gap entropy, consecutive pairs, decades). Picks that lean against metrics drifted beyond 1σ earn up to 1 pt each, capped at 7. Operates at the distribution level only — no individual number targeting. Base 100-pt model unchanged.

Remove gambler's fallacy criteria. Replace Co-occurrence (5 pts, worse than random prospectively at 36% vs 50%), Hot/Cold Mix (5 pts, 35.6% vs 50%), and PB Weighting (5 pts, 70.1% vs 75%) with three combinatorially-grounded criteria: Range Coverage (6 pts, all 3 thirds represented, 62%), Tens Diversity (5 pts, 4+ tens groups, 66%), and Even Spacing (4 pts, all gaps ≤ 2x ideal, 59%). All criteria now verifiable from pure combinatorial math with zero reliance on historical patterns persisting.

Major model overhaul. Remove Decade Spread (97% pass rate, no discrimination) and Drought Bonus (gambler's fallacy, −20 drift). Add Consecutive Pair (6 pts, 28% of draws have one) and PB Weighting (5 pts, PB drought above 25th percentile). Graduate Sum Range (18/9/0 at 1σ/1.5σ), Spread (13/6/0), and Unique Digits (7/3/0 at 5/4 unique). Loosen Hot/Cold Mix to include 2+ hot or 2+ cold. Reduce Primes 8→5 pts, bump Co-occurrence 4→5 pts.

Replace Anti-Birthday (4 pts) with Co-occurrence (4 pts) — scores picks containing historically co-occurring number pairs. Recalibrate Drought Bonus thresholds (35→20 floor, 80→60 divisor) to fix −21.4 drift.

Redistribute points to maintain 100-point scale. Parity/High-Low/Sum Range 15→18 pts each, Spread 10→13 pts. Max stays at 100.

Drought Bonus 10→5 pts, Decade Spread 8→4 pts, Anti-Birthday 7→4 pts. First AI audit-driven recalibration.

Initial scoring model with 10 criteria, max 100 pts.

Reading the Audit

All-Time Avg

How the average winning draw scores against our criteria. Each criterion's all-time pass rate should closely match its combinatorial pass rate — confirming real draws behave like random samples from C(69,5).

Recent Avg

The average score across the last 50 draws. Comparing this to the all-time average reveals whether the model's criteria are holding steady or drifting — the difference between these two is the drift value.

Drift

The difference between recent (last 50 draws) and all-time pass rates. Low drift (±5) means the criterion is stable. High drift (>10) triggers auto-adjustment after 3 consecutive audits.

AI Confidence

How confident the AI analyst is in its assessment. Based on sample size and drift significance. High = clear signals, medium = some uncertainty, low = insufficient data.

Adjusted

Whether the model auto-adjusted its weights during this audit. Adjustments only fire when a criterion drifts >±10 for 3+ consecutive weeks, are capped at 2 points per cycle, and must pass validation.

Audit History

Date	Model	Draws	All-Time Avg	Recent Avg	Drift	AI Confidence	Adjusted
Jul 12, 2026	v7.0	1,373	63.9 / 100	65.9 / 100	+2	medium	—
Jul 5, 2026	v7.0	1,370	63.9 / 100	65.3 / 100	+1.4	high	—
Jun 28, 2026	v7.0	1,368	63.9 / 100	65.9 / 100	+2	high	—
Jun 21, 2026	v6.0	1,955	67.4 / 100	69.7 / 100	+2.3	medium	—
Jun 14, 2026	v6.0	1,952	67.4 / 100	69.7 / 100	+2.3	medium	—
Jun 7, 2026	v6.0	1,949	67.4 / 100	68.2 / 100	+0.8	medium	—
May 31, 2026	v6.0	1,946	67.4 / 100	68.3 / 100	+0.9	medium	—
May 24, 2026	v6.0	1,943	67.4 / 100	68.7 / 100	+1.3	medium	—
May 17, 2026	v6.0	1,940	67.4 / 100	69.5 / 100	+2.1	medium	—
May 10, 2026	v6.0	1,937	67.4 / 100	70 / 100	+2.6	medium	—
May 3, 2026	v6.0	1,934	67.4 / 100	70.8 / 100	+3.4	medium	—
Apr 26, 2026	v6.0	1,931	67.4 / 100	71.2 / 100	+3.8	medium	—
Apr 19, 2026	v6.0	1,928	67.4 / 100	71.3 / 100	+3.9	medium	—
Apr 12, 2026	v6.0	1,925	67.4 / 100	71.9 / 100	+4.5	medium	—
Apr 5, 2026	v6.0	1,922	67.4 / 100	71 / 100	+3.6	medium	—
Mar 29, 2026	v5.1	1,919	65.2 / 100	70.4 / 100	+5.2	medium	—
Mar 26, 2026	v5.0	1,918	65.2 / 100	70 / 100	+4.8	medium	—
Mar 22, 2026	v3.x	1,916	63.3 / 100	63.9 / 100	+0.6	high	—
Mar 17, 2026	v3.x	1,914	63.3 / 100	63.8 / 100	+0.5	high	—
Mar 15, 2026	v3.x	1,913	62.4 / 100	63 / 100	+0.6	medium	—

How Our AI Audit Works

We use AI to audit ourselves — not to predict numbers. Read the full breakdown of our scoring audit pipeline.

Read the Blog Post

The system is transparent. The math is published. The rest is up to you.

Get your picks before the next draw →