🐻⬇️🏀

Team model

Dynamic Bradley-Terry

Time-evolving paired-comparison model with latent team strength drift.

Status implemented Short Name DynBT
Rating Style elo Where Shown teams rankings, team pages, model performance

Model Card

Glicko-style paired-comparison model with team uncertainty that expands during idle stretches and tightens with new games. This is the first explicit temporal paired-comparison model beyond static BT plus recency weighting.

Validation on this page uses stored season-level full-fit rows plus strict weekly rolling holdouts. Rolling ranks are based on average weekly log loss in the selected league/season. Current scope: All Games. All scored games in the selected league and season. AP Poll is excluded here.

Selected League / Season

Use the selected scope to inspect this model against peers and across rolling weekly holdouts.

Full Fit AUC - · 0 games
7-Day Holdout Log Loss 0.564 (#10) · 285 games
Rolling Avg Log Loss 0.596 (#10) · 13 windows
Window Wins 0 · 67.1% rolling accuracy

Full-fit AUC is only available where the older stored-rating batch produced that model for the selected scope. Recency and ensemble families can still have valid 7-day and rolling holdout results even when the full-fit columns are blank.

Rolling Holdout Curves

Selected model plus comparison peers for NCAAMD2 2025-2026. The focus line is Dynamic Bradley-Terry. Click legend items to hide/show peers.

Log Loss Brier AUC Accuracy

Weekly strict holdout log loss. Lower is better. Showing 6 models across 14 windows.

Recent Windows

Holdout Log Loss Rank Winner
Feb 4 - Feb 7 0.559 #9/16 Points Off/Def (0.528)
Jan 28 - Feb 3 0.581 #7/16 Core Ensemble (0.570)
Jan 21 - Jan 27 0.565 #8/16 Margin Recency (0.533)
Jan 14 - Jan 20 0.584 #7/16 Margin (0.551)
Jan 7 - Jan 13 0.611 #9/16 Margin (0.569)
Dec 31 - Jan 6 0.596 #10/16 Margin (0.549)
Dec 24 - Dec 30 0.437 #11/16 Margin (0.237)
Dec 17 - Dec 23 0.656 #11/16 Margin (0.569)

Selected Scope Peer Table

Comparison snapshot for NCAAMD2 2025-2026. Rows are ordered by rolling log-loss rank; full-fit AUC may be blank for models that do not have legacy full-season rows stored for this scope.

Model Full AUC AUC Rank 7d LogLoss 7d Rank Rolling LL Rolling Rank Windows
Core Ensemble - - 0.541 #6 0.539 #1 14
Recency Ensemble - - 0.541 #7 0.540 #2 14
Margin 0.802 #3 0.535 #4 0.541 #3 14
Margin Recency - - 0.536 #5 0.543 #4 14
Avg Margin Baseline 0.874 #1 0.560 #8 0.554 #5 14
Dynamic Bradley-Terry - - 0.564 #10 0.596 #10 13

League × Season Summary

Cross-scope validation history for Dynamic Bradley-Terry. Rolling ranks are by average weekly log loss.

League Season Full AUC AUC Rank 7d LL 7d Rank Rolling LL Rolling Rank Windows Wins
WNBA 2025 - - 0.549 #6 0.628 #5 21 1
NCAAWD2 2025-2026 - - 0.508 #8 0.568 #6 14 2
NCAAW 2025-2026 - - 0.603 #9 0.555 #8 22 0
NCAAMD2 2025-2026 - - 0.564 #10 0.596 #10 13 0
NCAAM 2025-2026 - - 0.688 #14 0.595 #10 22 0
NBA 2025-2026 - - 0.710 #3 0.623 #6 26 1
GLEAGUE 2025-2026 - - 0.860 #14 0.691 #13 22 3

Output Coverage

Stored site-output rows for this model.

League Season Rows
WNBA 2025 17
NCAAWD3 2025-2026 581
NCAAWD2 2025-2026 530
NCAAW 2025-2026 592
NCAAMD3 2025-2026 644
NCAAMD2 2025-2026 517
NCAAM 2025-2026 704
NBA 2025-2026 37
GLEAGUE 2025-2026 31