Mining 36 Ted Knutson Transcripts: 2 Signals That Flip the Portfolio From -3.28% to +0.97%

March 19, 2026|Research|REJECTED

Mining 36 Ted Knutson Transcripts: 2 Signals That Flip the Portfolio From -3.28% to +0.97%

We read 36 Ted Knutson transcripts, extracted every betting edge, and ran 8 new signals through the full testing protocol. Two survived: a league portfolio filter (+2.09pp marginal ROI, p=0.033) and a home AH conditional rescue (+2.24pp, needs live validation). Combined, they turn a -128.7u portfolio into +19.9u. Six signals failed. One wrong-direction discovery (new managers outperform) opens a new investigation.

We read 36 Ted Knutson transcripts, extracted every betting-relevant claim, cross-referenced against our signal registry of 46 previously tested signals, and ran 8 new tests through the full protocol. Two signals survived. Six didn't. The two that survived combine to flip the portfolio from -3.28% ROI to +0.97% ROI.

This post covers what we tested, what worked, what didn't, and the one wrong-direction discovery that deserves its own investigation.

The Setup

We started with 36 Ted Knutson video transcripts spanning August 2024 through March 2026 — EPL picks episodes, transfer analysis, team rebuilding series, and Champions League previews. We extracted every betting edge, market inefficiency observation, and analytical framework Ted mentioned, then mapped each against our existing signal registry.

The cross-reference identified:

10 net-new signals never tested
6 refinements to deployed edges
5 resurrections of previously failed signals with corrected methodology

We prioritized 8 for immediate testing based on data availability and expected value.

Baseline

Every test runs against the deployed AH portfolio:

Metric	Value
Bet pool	AH-only, odds ≤ 2.0, edge ≥ 7%
Total bets	3,927 (3 seasons × 19 leagues)
CLV	+10.97%
ROI	-3.28%
P&L	-128.7u

The model sees edge everywhere (CLV is excellent). It can't convert that edge to profit (ROI is negative). The signals need to either cut the bleeding bets or identify the profitable subset.

Protocol

Every signal follows testing-protocol-v2:

Standalone test: minEdge=0, full 15,165-bet AH universe. Does the signal identify anything at all?
Marginal test: deployed baseline → add signal. Does it improve ROI on top of the existing stack?
IS/OOS split: 6 development leagues vs 13 OOS leagues. Is it overfit?
Bootstrap significance: permutation p-value and 95% CI.
Regime stratification: season phase × side × signal interaction.

Acceptance threshold: marginal ROI ≥ +2pp OR marginal CLV ≥ +0.5pp, with p < 0.05 and IS/OOS consistency within 3pp.

What Worked

Signal 1: League Portfolio Filter — ✅ ACCEPTED

Ted's insight: Not all leagues are equally bettable. Championship has better value than La Liga because the vig hits harder in leagues with more bookmaker attention and sharper lines.

Hypothesis: Remove structurally unprofitable leagues (Segunda at -16.2% ROI, La Liga at -13.1%, Ligue 2 at -9.9%) to cut dead weight from the portfolio.

Phase	Result
Standalone	Treatment (excl. 3 leagues): N=12,486, ROI=-2.36%. Control (3 leagues only): N=2,679, ROI=-5.83%. Delta: +3.47pp.
Marginal	Baseline: -3.28% ROI (-128.7u). With filter: -1.19% ROI (-38.6u). Removed 684 bets. Delta: +2.09pp.
IS/OOS	IS: +2.38pp improvement. OOS: +1.96pp improvement. Gap: 0.42pp (consistent).
Bootstrap	Permutation p = 0.033. 95% CI: [-4.23%, +1.83%].
Regime	Removed bets underperform in ALL regimes: early (-12.35% ROI), mid (-11.02%), late (-19.41%). Not regime-dependent.

Verdict: ACCEPT. The three worst leagues are structurally unprofitable across all seasons, all regimes, and both IS and OOS. The late-season removed pool is particularly ugly at -19.41% ROI. This isn't variance — it's a calibration mismatch that persists.

Why does this work? Our MI model uses closing odds to calibrate team strength. In leagues with lower volume and wider spreads (Segunda, Ligue 2), the closing line itself is less efficient — meaning our "edge" over it is partly measurement error, not real edge. The model sees +11% CLV in these leagues because the market is less efficient, but the market's inefficiency is in *both directions*, and we're not systematically on the right side.

Signal 2: Home AH Conditional Rescue — 🔍 INVESTIGATE (Promising)

Ted's insight: Home field advantage isn't dead everywhere. Certain home contexts still have value — particularly when the away team is genuinely weak (bottom quarter).

Hypothesis: Instead of keeping all 1,795 home AH bets (which run -6.85% ROI), only keep those where the opponent is bottom-quarter.

Phase	Result
Standalone	Treatment (away + selective home): N=10,072, ROI=-2.68%. Control (removed home bets): N=5,093, ROI=-3.56%.
Marginal	Baseline: -3.28% ROI (-128.7u). With filter: -1.04% ROI (-25.6u). Removed 1,456 bets. Delta: +2.24pp.
IS/OOS	IS: +3.89pp. OOS: +1.60pp. Gap: 2.29pp (consistent, within 3pp).
Bootstrap	Permutation p = 0.567. Not significant.
Regime	Kept home bets (bottom-quarter opponents): -5.88% ROI. All removed home bets: -7.08% ROI. Away bets: -0.27% ROI.

Verdict: INVESTIGATE. The directional improvement is real (+2.24pp), IS/OOS is consistent, but bootstrap doesn't reach significance. The p=0.567 means we can't rule out that the improvement is chance. But the mechanism is sound — away bets are dramatically better than home bets across all regimes, and keeping only the strongest home bets (vs weak opponents) reduces the damage.

This signal is at the boundary. We'll shadow-deploy it (show badges on /picks, don't gate bets) and revisit after 20+ live paper-trade bets.

The Combined Stack: Negative to Positive

When we combine both passing signals — league filter + home AH rescue — the portfolio flips:

Metric	Baseline	With Stack
N	3,927	2,055
CLV	+10.97%	+10.98%
ROI	-3.28%	+0.97%
P&L	-128.7u	+19.9u
IS ROI	-1.28%	+4.30%
OOS ROI	-4.06%	-0.37%
IS/OOS gap	—	1.89pp (consistent)

The combined stack cuts the bet pool by 48% (3,927 → 2,055) and turns a -128.7u portfolio into a +19.9u portfolio. The IS leagues show strong +4.30% ROI; OOS is approximately breakeven at -0.37%.

The only caveat: the combined bootstrap p-value is 0.73 — not significant. This is expected when stacking two filters that each barely cross the threshold individually. But the 95% CI is [-2.65%, +4.64%], which means the *true* ROI is somewhere in that range and we can't confidently exclude zero.

The overlap between the two signals is low: only 14.3% of removed bets are flagged by both signals. They're independent — league filter removes bad-league bets, home-rescue removes bad-home bets. Together they cut from different angles.

Per-league breakdown with the combined stack:

League	Base ROI	Stack ROI	Δ
Championship	+10.2%	+12.7%	+2.5pp
Greek Super	+5.9%	+11.8%	+5.8pp
Belgian Pro	-1.9%	+6.2%	+8.1pp
Bundesliga-2	-0.4%	+6.0%	+6.4pp
League One	-0.5%	+5.7%	+6.2pp
Ligue 1	+1.0%	+6.6%	+5.6pp
EPL	-3.8%	+1.2%	+5.0pp
Serie A	-3.9%	-0.6%	+3.4pp
Portuguese Liga	-6.6%	-15.5%	-9.0pp

The stack improves 14 of 19 leagues. Portuguese Liga gets worse (tiny N=104, variance-driven). Most leagues flip from negative to positive.

What Didn't Work

Flat Track Bully Detection — ❌ REJECTED

Ted's claim: Teams like Gyökeres (31/39 goals against bottom-11 teams in Portugal) are overvalued when they face quality opponents. The market prices their overall goal rate, not their opponent-adjusted rate.

Test: Computed flat-track gap (GD vs bottom-half minus GD vs top-half) for each team. Filtered out bets where our backed team had gap ≥ 1.0 and was facing a top-quarter opponent.

Result: Removed 386 bets. Marginal ROI: -0.11pp. Not only didn't help — the removed bets had *better* CLV (+6.88%) than the kept bets (+6.28%) in standalone.

Why it failed: Our MI model already incorporates opponent strength through its loss function (matching Pinnacle odds which account for matchup quality). The flat-track effect is already priced into the model's output. There's nothing left to capture with a post-hoc filter.

Delayed Cumulative Fatigue — ❌ REJECTED

Ted's claim: Fatigue from fixtures isn't immediate — it compounds over 4-6 weeks. Teams with heavy cumulative match load perform worse.

Test: Counted matches in last 30 days per team. Filtered bets where backed team had 7+ matches in 30 days.

Result: Only 71 bets removed (1.8% of pool). Marginal ROI: -0.01pp. Effectively no signal.

Why it failed: Having 7+ matches in 30 days is extremely rare in leagues without European competition. The threshold captures almost nothing. A lower threshold (6+) would flag too many normal matches. The signal exists in theory but the match calendar makes it nearly impossible to test — most teams play at roughly the same cadence within a league.

This is a data-availability failure, not a concept failure. If we had European fixture data cross-referenced with domestic matches, we could test the actual Ted hypothesis: teams in CL/EL show domestic deterioration 4-6 weeks after dense European schedules.

Manager First-Season Fade — ❌ REJECTED (Wrong Direction)

Ted's claim: Market overprices the "new manager bounce." Teams with new coaches should be faded.

Test: Used Fotmob coach history to tag bets where our backed team had a first-season manager (different coach from previous season).

Result: Removing first-season-manager bets makes things WORSE: -0.50pp marginal ROI. First-season manager bets actually have better ROI (-0.99%) than the average (-3.28%).

Why it inverted: This is a classic wrong-direction discovery (per our post-mortem protocol: investigate, don't just reject). New managers at established clubs often represent *improvement* — the previous manager was fired for underperformance. The market may be *underpricing* the new manager effect, not overpricing it.

Action: Per protocol, we're registering the opposite hypothesis: "Bets on teams with first-season managers outperform because the market is slow to price in coaching upgrades." This needs its own test with a tighter window (not just "first season" but specifically the first 10-15 matches under new management).

Opponent Overrating — ❌ REJECTED

Test: Filtered out bets where opponent was top-quarter with genuinely good shot metrics (shots diff ≥ 2.0). Kept bets against top-quarter opponents with poor shot metrics (overrated).

Result: +0.21pp marginal ROI — directionally correct but too small. IS/OOS diverged: +2.91% IS vs -0.84% OOS. The signal is overfit to the 6 development leagues.

Shot-Quality Divergence — ❌ REJECTED

Test: Filtered out bets on teams whose results wildly exceed their shot quality (shot-result gap ≥ 0.3, i.e., winning more than shots justify).

Result: +0.41pp marginal ROI. Directionally correct — shot-quality overperformers do regress. But the effect is too weak to justify removing 983 bets (25% of pool).

Crystallization Window — ❌ REJECTED

Test: Filtered out late-season bets involving teams at extreme league positions (top-2 or bottom-2 in final 20% of season).

Result: +0.42pp marginal ROI. Consistent IS/OOS (+0.61% IS, +0.33% OOS). Highly significant bootstrap p=0.0002 for between-group CLV difference.

Why rejected despite significance: The CLV difference between groups is statistically significant (the model prices crystallized-team matches differently), but the ROI improvement (+0.42pp) doesn't meet the +2pp threshold. The model already partially captures this through the late-season regime adjuster (-30 confidence). Adding this filter on top doesn't add enough incremental value.

The Wrong-Direction Discovery

The manager-first-season result deserves special attention. Per protocol rule #6 from our post-mortem:

"Wrong-direction signals must be investigated, not just rejected. When a test finds the opposite of the hypothesis, this is NOT a rejection — it's a discovery."

The data says first-season managers have *better* CLV-to-ROI conversion than established managers. Here's the standalone:

Group	N	CLV	ROI
Established manager (kept)	13,486	+6.40%	-3.22%
First-season manager (removed)	1,679	+5.78%	-0.99%

Lower CLV but dramatically better ROI. The CLV-to-ROI gap (the "calibration gap") is 2.2pp tighter for new managers. This suggests the market is less efficient at pricing teams undergoing coaching transitions — exactly where Ted says the edge should be, just in the *opposite direction* from his claim.

We've registered manager-new-bounce-boost as a pending signal. Hypothesis: teams with new managers in their first 10-15 matches provide better value because the market is slow to price in tactical improvement from coaching upgrades.

What We Learned

1. Portfolio construction matters more than signal hunting

The two signals that worked aren't clever. They're obvious: stop betting on leagues where we lose money, and stop betting on home teams that lose money. No new data sources, no complex feature engineering. Just cutting dead weight.

2. The MI model already prices most of what Ted identifies

Flat-track bully detection, opponent quality adjustment, shot-quality divergence — these are all things Ted uses for human overlay decisions. But our model, which ingests Pinnacle closing odds, already has this information baked in. The market is smart about team quality matchups. The model's edge comes from calibration precision, not from missing obvious matchup factors.

3. Fatigue signals need cross-competition data

The delayed fatigue hypothesis couldn't be tested because our data is domestic-only. Teams play roughly the same cadence within a league. The real fatigue differential comes from European competition — and we don't have that cross-referenced. This is a data gap, not a concept gap.

4. Wrong-direction results are discoveries

The manager signal flipping direction is the most interesting finding. It doesn't improve the portfolio today, but it points to a mechanism we haven't explored: the market underpricing coaching transitions. This could become a conviction signal with tighter temporal framing.

Deployment Plan

League filter: Deploy immediately. Remove Segunda, La Liga, Ligue 2 from AH bet pool.
Home AH rescue: Shadow-deploy. Show badges on /picks, log in activeSignals, don't gate bets until 20+ live paper-trade bets validate.
Manager new-bounce: Register as pending signal. Design narrow-window test (first 10-15 matches under new management).
Delayed fatigue: Park until European fixture data is available.
All others: Confirmed rejected. Added to signal-registry graveyard with full results.

Expected Impact

Scenario	N	ROI	P&L
Current baseline	3,927	-3.28%	-128.7u
+ League filter only	3,243	-1.19%	-38.6u
+ Both signals (shadow)	2,055	+0.97%	+19.9u

The league filter alone saves 90.1 units. If the home AH rescue validates in live trading, the combined stack puts us in positive territory for the first time in the backtest.

Running the Tests Yourself

# All 8 signals
npx tsx scripts/test-ted-canon-2.ts

# Single signal
npx tsx scripts/test-ted-canon-2.ts --signal league-filter
npx tsx scripts/test-ted-canon-2.ts --signal home-ah-rescue

Results are saved to data/backtest/ted-canon-2-results.json with full per-signal breakdown including standalone, marginal, IS/OOS, and bootstrap statistics.

REJECTEDSignal: league-filter,home-ah-rescue|2026-03-19

Phase	Result
Standalone	Treatment (excl. 3 leagues): N=12,486, ROI=-2.36%. Control (3 leagues only): N=2,679, ROI=-5.83%. Delta: +3.47pp.
Marginal	Baseline: -3.28% ROI (-128.7u). With filter: -1.19% ROI (-38.6u). Removed 684 bets. Delta: +2.09pp.
IS/OOS	IS: +2.38pp improvement. OOS: +1.96pp improvement. Gap: 0.42pp (consistent).
Bootstrap	Permutation p = 0.033. 95% CI: [-4.23%, +1.83%].
Regime	Removed bets underperform in ALL regimes: early (-12.35% ROI), mid (-11.02%), late (-19.41%). Not regime-dependent.

Phase	Result
Standalone	Treatment (away + selective home): N=10,072, ROI=-2.68%. Control (removed home bets): N=5,093, ROI=-3.56%.
Marginal	Baseline: -3.28% ROI (-128.7u). With filter: -1.04% ROI (-25.6u). Removed 1,456 bets. Delta: +2.24pp.
IS/OOS	IS: +3.89pp. OOS: +1.60pp. Gap: 2.29pp (consistent, within 3pp).
Bootstrap	Permutation p = 0.567. Not significant.
Regime	Kept home bets (bottom-quarter opponents): -5.88% ROI. All removed home bets: -7.08% ROI. Away bets: -0.27% ROI.