Betting For A Living
Back to articles

Backtest

Challenger model backtest

A comparison of alternative model shapes after the first bad longshot day, covering 1X2, totals and AH movement.

Updated 2026-05-11 21:31 UTC. Paper-trading research only.

Generated: 2026-05-11T21:31:22+00:00 Train before: 2022-07-01 Test from: 2022-07-01

This is a challenger run after the 2026-05-11 longshot failure. The purpose is not to find a magic model. It is to decide which market expressions deserve more live paper trading.

Diagnostic strategies use closing prices or closing movement, so they are CLV/research checks rather than fully live betting rules.

StrategyBetsProfit uROIWinsLossesPushesAvg odds
naive 1x2 price gap2117.2782.2%13803.076
guarded 1x2 no draw longshots6-0.25-4.2%3302.14
strong favourite price shop18-0.27-1.5%10801.699
over 2 5 bucket 170 190486724.790.5%2728213901.794
calibrated positive buckets53606.230.1%3029233101.816
totals price move hindsight (diagnostic)853891.161.1%4541399701.926
ah line move hindsight (diagnostic)8525367.694.3%448734515871.84
ah close best price (diagnostic)16934.2820.3%8967132.169

Read

  • The naive 1X2 price-gap model is not trustworthy. Even when it shows a positive historical result, the sample is tiny and it is exactly the failure mode that produced the bad day.
  • Strong favourites are not automatically good. They are price-sensitive and should usually be expressed through handicap/DNB when the outright is awkward.
  • Totals are cleaner than 1X2, but simple bucket rules are not enough on their own.
  • Asian handicap line movement is the strongest research signal in this run, but it is diagnostic until the bot stores openers and current lines live.
  • The better backbone is not a single model. It is a market router: longshot catalyst gate, then choose AH/DNB/totals before 1X2.

Proposed Backbone V2

1. Store opener immediately for 1X2, totals and Asian handicap. 2. Refresh current odds at least daily, then more often near kickoff. 3. Block 1X2 longshots unless there is news or systemic evidence. 4. If the view is favourite vulnerability, test handicap/DNB first. 5. Use totals only when the price bucket and team/news profile agree. 6. Publish rarely. No selection without a market expression that matches the thesis.

Raw report: challenger-model-backtest.md. Machine-readable data: challenger-model-backtest.json.