Research notes

Methodology, backtests, durable findings. Informal write-ups with full data tables, not polished essays. Every number is reproducible from the open-source sma200-bt harness.

Informational and educational use only. Specific portfolio compositions discussed here are research outputs, not allocation recommendations. Full disclaimer →

Durable findings

Cross-study conclusions that hold across multiple tests and regimes. Each links to its supporting evidence.

01

The SMA200 trend filter is real on unleveraged equity.

Over 86 years of S&P 500 data the filter adds +0.18 Sharpe over plain buy-and-hold. The benefit holds in 8 of 9 decades, with the only underperformance in the 2010s QE bull. Most durable single-result on the entire research sheet.

Evidence: SMA200 on Extended/Synthetic Long Windows
02

The SMA200 filter is a portfolio construction tool, not a single-asset rescue.

Sharpe lift from the filter scales with how much leveraged-equity vol-decay sits in the portfolio: +0.238 on a 75/25 UPRO/UGL split, +0.019 on a pure 3x equity position with no defensive. Don't expect the filter to "fix" 100% TQQQ; do expect it to make heavy-equity portfolios survivable.

Evidence: Borrow-Cost Correction: Synthetic LETF Methodology + Revi..., Portfolio Archetypes Search: Finding the Sweet Spot
03

Gold helps as the defensive bucket; long bonds hurt.

Across 21 years on a leveraged-equity/defensive strategy, 100% gold gives Sharpe +0.032 over plain T-bills. 100% TLT (long Treasuries, ZROZ-proxy) finishes worst on Sharpe AND worst on max drawdown. The 2022 rate shock broke the bond/equity inverse correlation that pre-2022 portfolio theory assumed.

Evidence: Defensive Bucket Comparison: Cash vs Gold vs TLT vs Combinations, Borrow-Cost Correction: Synthetic LETF Methodology + Revi...
04

Managed futures show promise as a third defensive option.

DBMF (only 2019+ data so far) gives +0.076 Sharpe over plain T-bills and the best single-asset max drawdown protection (-47%) in a 7y test window. Regime-dependent until longer data confirms (the window captured 2022 which was uniquely good for trend-following CTAs).

Evidence: Borrow-Cost Correction: Synthetic LETF Methodology + Revi...
05

Synthetic LETF backtests must model borrow cost to be trusted.

The simple L*daily − ER/252 formula used in most retail synthetic LETF backtests overstates real TQQQ by 62% over 2015-2024. Adding (L-1)*borrow_rate/252 (Testfolio's approach, matching real swap-financing mechanics) tracks real TQQQ to within 5%. This is a methodology requirement, not a refinement.

Evidence: Borrow-Cost Correction: Synthetic LETF Methodology + Revi...
06

Volatility decay destroys 3x leverage above ~25-30% underlying vol.

SPY at 19% vol handles 3x leverage. QQQ at 27% handles it poorly. ^SOX at 37% is destroyed by it. For high-vol underlyings, 2x with a trend filter dominates 3x with a trend filter on multi-decade windows.

Evidence: Leverage vs Volatility per Underlying: 1x vs 3x, B&H vs SMA200, Full Leverage Spectrum: 1x/2x/3x × B&H/SMA200 across 5 Un...
07

Heavy-gold beats heavy-equity in Sharpe terms over 25 years.

The Sharpe-optimal UPRO/UGL ratio over 2000-2026 is roughly 33% UPRO / 67% UGL (Sharpe 0.745). Common "75/25 leveraged equity/gold" intuition gives Sharpe 0.536 on the same window. Gold's risk-adjusted return over a window with two major gold bulls materially exceeded leveraged equity's.

Evidence: Portfolio Archetypes Search: Finding the Sweet Spot, Borrow-Cost Correction: Synthetic LETF Methodology + Revi...
08

Walk-forward parameter tuning on leveraged ETFs is statistical noise.

Train→test Sharpe correlation across ~60 parameter combinations on TQQQ/SOXL/UPRO: +0.19, -0.09, +0.09. Real predictive correlations would be 0.4-0.7. The "optimal" parameters from one decade have no predictive power for the next decade. Beware of in-sample optimization marketed as backtested edge.

Evidence: LRS Family: Replication + Walk-Forward Parameter Sweep
09

SMA200 filter helps equities; it hurts metals.

Applying the same filter to gold reduces Sharpe from 0.71 to 0.59 over 25 years; on silver from 0.49 to 0.34. Metals have slower bull/bear cycles and choppy ranges that produce too many false signals. Apply trend filtering to equity sleeves, not metals sleeves.

Evidence: Full Leverage Spectrum: 1x/2x/3x × B&H/SMA200 across 5 Un...
10

Rebalance frequency matters less than asset selection.

Daily and yearly rebalancing tie for best Sharpe on a tested 9-asset leveraged portfolio (~0.945); middle frequencies cluster at 0.91-0.93. For tax efficiency in taxable accounts, yearly wins. Don't over-optimize this dial.

Evidence: Rebalance Frequency Study on a 9-Asset Leveraged Portfolio

All studies

🟢 May 19, 2026 Methodology
Borrow-Cost Correction: Synthetic LETF Methodology + Revised Numbers
The simple L*daily − ER/252 synthetic LETF formula overstates real TQQQ by 62%. Adding the (L-1)*borrow_rate/252 term tracks real TQQQ within 5%. Walk through the bug, the fix, and revised numbers for synthetic TQQQ 1999-2026 + defensive bucket comparison.
🟡 May 17, 2026 Leveraged ETFs
Full Leverage Spectrum: 1x/2x/3x × B&H/SMA200 across 5 Underlyings
Tests every cell of the leverage × filter matrix across SPY, QQQ, ^SOX, gold, and silver over 25.7 years. Finds a vol break-point at ~25-30%: below it, more leverage helps; above it, vol decay destroys returns even with the filter on.
🟢 May 17, 2026 Portfolio construction
Rebalance Frequency Study on a 9-Asset Leveraged Portfolio
Tests daily / weekly / monthly / quarterly / yearly / 30%-band rebalancing on a tested 9-asset leveraged portfolio. Finds that rebalance frequency matters far less than people assume; daily and yearly tie for best Sharpe, middle frequencies are slightly worse.
🟡 May 17, 2026 Portfolio construction
Portfolio Archetypes Search: Finding the Sweet Spot
Tests 10 candidate portfolio compositions across both stress (2000-2026) and bull (2012-2026) windows, plus sensitivity sweeps for UPRO/UGL ratio and a BTC sleeve. Headline: heavy-gold beats heavy-equity in Sharpe, and SMA200 on the equity sleeve dominates every other configuration.
🟡 May 17, 2026 Defensive bucket
Defensive Bucket Comparison: Cash vs Gold vs TLT vs Combinations
Across 21 years, tested what to hold during SMA200-filter OFF periods on a leveraged-equity strategy. Gold > combinations > cash > long Treasuries. The TLT/ZROZ result is the most surprising and durable: long bonds finish worst on both Sharpe and max drawdown.
🟢 May 17, 2026 Methodology
LRS Family: Replication + Walk-Forward Parameter Sweep
Replicates the Testfolio LRS-style trend strategies on TQQQ, SOXL, UPRO with walk-forward parameter sweeps. The brutal finding: train-test Sharpe correlation is near zero, meaning 'optimal' parameters don't predict next-decade performance.
🟡 May 17, 2026 Leveraged ETFs
Leverage vs Volatility per Underlying: 1x vs 3x, B&H vs SMA200
Per-underlying analysis of SPY, QQQ, and ^SOX showing how leverage-vs-vol-decay interacts with the SMA200 filter. Establishes the vol-break-point hypothesis that the full leverage spectrum study later confirmed.
🟡 May 14, 2026 Leveraged ETFs
SMA200 on Extended/Synthetic Long Windows
Three long-window tests: synthetic TQQQ since 1999, unleveraged S&P 500 since 1940, synthetic 3x S&P since 1940. The unleveraged S&P decade table is the load-bearing finding; the synthetic LETF portions are superseded by the May 19 borrow-cost correction.
🟢 May 13, 2026 Leveraged ETFs
LETF Inception Backtest: Buy-and-Hold vs SMA200 Filter
Per-fund backtest comparing buy-and-hold against the SMA200 trend filter on SOXL, TQQQ, and UPRO from each fund's launch through 2026. Results show the filter dramatically reduces max drawdown on every fund (-80%+ to -30-40%) with similar long-term Sharpe.

Reproducibility

All synthetic LETF return series in these studies come from sma200-bt, an open-source Python module calibrated against real TQQQ to within 5%. Test the numbers yourself:

pip install sma200-bt

Real-fund data comes from yfinance. Underlying indices and macroeconomic series are public; no proprietary data is used anywhere in these studies.