Dynamic Bradley-Terry
Time-evolving paired-comparison model with latent team strength drift.
| Status | implemented | Short Name | DynBT |
|---|---|---|---|
| Rating Style | elo | Where Shown | teams rankings, team pages, model performance |
Model Card
Glicko-style paired-comparison model with team uncertainty that expands during idle stretches and tightens with new games. This is the first explicit temporal paired-comparison model beyond static BT plus recency weighting.
Validation on this page uses stored season-level full-fit rows plus strict weekly rolling holdouts. Rolling ranks are based on average weekly log loss in the selected league/season. Current scope: All Games. All scored games in the selected league and season. AP Poll is excluded here.
Selected League / Season
Use the selected scope to inspect this model against peers and across rolling weekly holdouts.
| Full Fit AUC | - · 0 games |
|---|---|
| 7-Day Holdout Log Loss | 0.540 (#7) · 20 games |
| Rolling Avg Log Loss | 0.607 (#2) · 3 windows |
| Window Wins | 1 · 69.4% rolling accuracy |
Full-fit AUC is only available where the older stored-rating batch produced that model for the selected scope. Recency and ensemble families can still have valid 7-day and rolling holdout results even when the full-fit columns are blank.
Rolling Holdout Curves
Selected model plus comparison peers for WNBA 2026. The focus line is Dynamic Bradley-Terry. Click legend items to hide/show peers.
Weekly strict holdout log loss. Lower is better. Showing 6 models across 3 windows.
Recent Windows
| Holdout | Log Loss | Rank | Winner |
|---|---|---|---|
| Jun 5 - Jun 8 | 0.518 | #8/16 | Adjusted Efficiency (0.465) |
| May 29 - Jun 4 | 0.589 | #1 | Dynamic Bradley-Terry |
| May 22 - May 28 | 0.713 | #11/16 | Bradley-Terry Recency (0.660) |
Selected Scope Peer Table
Comparison snapshot for WNBA 2026. Rows are ordered by rolling log-loss rank; full-fit AUC may be blank for models that do not have legacy full-season rows stored for this scope.
| Model | Full AUC | AUC Rank | 7d LogLoss | 7d Rank | Rolling LL | Rolling Rank | Windows |
|---|---|---|---|---|---|---|---|
| Bradley-Terry | - | - | 0.558 | #13 | 0.605 | #1 | 3 |
| Dynamic Bradley-Terry | - | - | 0.540 | #7 | 0.607 | #2 | 3 |
| Bradley-Terry Recency | - | - | 0.588 | #14 | 0.612 | #3 | 3 |
| Recency Ensemble | - | - | 0.522 | #6 | 0.616 | #4 | 3 |
| Core Ensemble | - | - | 0.521 | #5 | 0.617 | #5 | 3 |
| Elo | - | - | 0.597 | #15 | 0.624 | #6 | 3 |
| Points Off/Def Recency | - | - | 0.545 | #9 | 0.633 | #7 | 3 |
| Avg Margin Baseline | - | - | 0.510 | #3 | 0.636 | #8 | 3 |
League × Season Summary
Cross-scope validation history for Dynamic Bradley-Terry. Rolling ranks are by average weekly log loss.
| League | Season | Full AUC | AUC Rank | 7d LL | 7d Rank | Rolling LL | Rolling Rank | Windows | Wins |
|---|---|---|---|---|---|---|---|---|---|
| WNBA | 2026 | - | - | 0.540 | #7 | 0.607 | #2 | 3 | 1 |
| WNBA | 2025 | - | - | 0.549 | #6 | 0.628 | #5 | 21 | 1 |
| NCAAWD2 | 2025-2026 | - | - | 0.504 | #8 | 0.567 | #6 | 14 | 2 |
| NCAAW | 2025-2026 | - | - | 0.603 | #9 | 0.555 | #8 | 22 | 0 |
| NCAAMD2 | 2025-2026 | - | - | 0.564 | #9 | 0.597 | #10 | 13 | 0 |
| NCAAM | 2025-2026 | - | - | 0.688 | #14 | 0.595 | #10 | 22 | 0 |
| NBA | 2025-2026 | - | - | 0.769 | #11 | 0.634 | #11 | 33 | 1 |
| GLEAGUE | 2025-2026 | - | - | 0.860 | #14 | 0.691 | #13 | 22 | 3 |
Output Coverage
Stored site-output rows for this model.
| League | Season | Rows |
|---|---|---|
| WNBA | 2025 | 17 |
| NCAAWD3 | 2025-2026 | 585 |
| NCAAWD2 | 2025-2026 | 521 |
| NCAAW | 2025-2026 | 592 |
| NCAAMD3 | 2025-2026 | 648 |
| NCAAMD2 | 2025-2026 | 516 |
| NCAAM | 2025-2026 | 704 |
| NBA | 2025-2026 | 37 |
| GLEAGUE | 2025-2026 | 31 |