sma200.trade
For informational and educational use only. Not investment advice. Past performance does not predict future returns. Read full disclaimer →

Research notes

Methodology, backtests, durable findings. Informal write-ups with full data tables, not polished essays. Every number is reproducible from the open-source sma200-bt harness.

Informational and educational use only. Specific portfolio compositions discussed here are research outputs, not allocation recommendations. Full disclaimer →

Durable findings

Cross-study conclusions that hold across multiple tests and regimes. Each links to its supporting evidence.

  1. 01

    The SMA200 trend filter is real on unleveraged equity.

    Over 86 years of S&P 500 data the filter adds +0.18 Sharpe over plain buy-and-hold. The benefit holds in 8 of 9 decades, with the only underperformance in the 2010s QE bull. Most durable single-result on the entire research sheet.

    Evidence: SMA200 on Extended/Synthetic Long Windows

  2. 02

    The SMA200 filter is a portfolio construction tool, not a single-asset rescue.

    Sharpe lift from the filter scales with how much leveraged-equity vol-decay sits in the portfolio: +0.238 on a 75/25 UPRO/UGL split, +0.019 on a pure 3x equity position with no defensive. Don't expect the filter to "fix" 100% TQQQ; do expect it to make heavy-equity portfolios survivable.

    Evidence: Borrow-Cost Correction: Synthetic LETF Methodology + Revi..., Portfolio Archetypes Search: Finding the Sweet Spot

  3. 03

    Gold helps as the defensive bucket; long bonds hurt.

    Across 21 years on a leveraged-equity/defensive strategy, 100% gold gives Sharpe +0.032 over plain T-bills. 100% TLT (long Treasuries, ZROZ-proxy) finishes worst on Sharpe AND worst on max drawdown. The 2022 rate shock broke the bond/equity inverse correlation that pre-2022 portfolio theory assumed.

    Evidence: Defensive Bucket Comparison: Cash vs Gold vs TLT vs Combinations, Borrow-Cost Correction: Synthetic LETF Methodology + Revi...

  4. 04

    Managed futures show promise as a third defensive option.

    DBMF (only 2019+ data so far) gives +0.076 Sharpe over plain T-bills and the best single-asset max drawdown protection (-47%) in a 7y test window. Regime-dependent until longer data confirms (the window captured 2022 which was uniquely good for trend-following CTAs).

    Evidence: Borrow-Cost Correction: Synthetic LETF Methodology + Revi...

  5. 05

    Synthetic LETF backtests must model borrow cost to be trusted.

    The simple L*daily − ER/252 formula used in most retail synthetic LETF backtests overstates real TQQQ by 62% over 2015-2024. Adding (L-1)*borrow_rate/252 (Testfolio's approach, matching real swap-financing mechanics) tracks real TQQQ to within 5%. This is a methodology requirement, not a refinement.

    Evidence: Borrow-Cost Correction: Synthetic LETF Methodology + Revi...

  6. 06

    Volatility decay destroys 3x leverage above ~25-30% underlying vol.

    SPY at 19% vol handles 3x leverage. QQQ at 27% handles it poorly. ^SOX at 37% is destroyed by it. For high-vol underlyings, 2x with a trend filter dominates 3x with a trend filter on multi-decade windows.

    Evidence: Leverage vs Volatility per Underlying: 1x vs 3x, B&H vs SMA200, Full Leverage Spectrum: 1x/2x/3x × B&H/SMA200 across 5 Un...

  7. 07

    Heavy-gold beats heavy-equity in Sharpe terms over 25 years.

    The Sharpe-optimal UPRO/UGL ratio over 2000-2026 is roughly 33% UPRO / 67% UGL (Sharpe 0.745). Common "75/25 leveraged equity/gold" intuition gives Sharpe 0.536 on the same window. Gold's risk-adjusted return over a window with two major gold bulls materially exceeded leveraged equity's.

    Evidence: Portfolio Archetypes Search: Finding the Sweet Spot, Borrow-Cost Correction: Synthetic LETF Methodology + Revi...

  8. 08

    Walk-forward parameter tuning on leveraged ETFs is statistical noise.

    Train→test Sharpe correlation across ~60 parameter combinations on TQQQ/SOXL/UPRO: +0.19, -0.09, +0.09. Real predictive correlations would be 0.4-0.7. The "optimal" parameters from one decade have no predictive power for the next decade. Beware of in-sample optimization marketed as backtested edge.

    Evidence: LRS Family: Replication + Walk-Forward Parameter Sweep

  9. 09

    SMA200 filter helps equities; it hurts metals.

    Applying the same filter to gold reduces Sharpe from 0.71 to 0.59 over 25 years; on silver from 0.49 to 0.34. Metals have slower bull/bear cycles and choppy ranges that produce too many false signals. Apply trend filtering to equity sleeves, not metals sleeves.

    Evidence: Full Leverage Spectrum: 1x/2x/3x × B&H/SMA200 across 5 Un...

  10. 10

    Rebalance frequency matters less than asset selection.

    Daily and yearly rebalancing tie for best Sharpe on a tested 9-asset leveraged portfolio (~0.945); middle frequencies cluster at 0.91-0.93. For tax efficiency in taxable accounts, yearly wins. Don't over-optimize this dial.

    Evidence: Rebalance Frequency Study on a 9-Asset Leveraged Portfolio

All studies

  • 🟢 Methodology
    Borrow-Cost Correction: Synthetic LETF Methodology + Revised Numbers

    The simple L*daily − ER/252 synthetic LETF formula overstates real TQQQ by 62%. Adding the (L-1)*borrow_rate/252 term tracks real TQQQ within 5%. Walk through the bug, the fix, and revised numbers for synthetic TQQQ 1999-2026 + defensive bucket comparison.

  • 🟡 Leveraged ETFs
    Full Leverage Spectrum: 1x/2x/3x × B&H/SMA200 across 5 Underlyings

    Tests every cell of the leverage × filter matrix across SPY, QQQ, ^SOX, gold, and silver over 25.7 years. Finds a vol break-point at ~25-30%: below it, more leverage helps; above it, vol decay destroys returns even with the filter on.

  • 🟢 Portfolio construction
    Rebalance Frequency Study on a 9-Asset Leveraged Portfolio

    Tests daily / weekly / monthly / quarterly / yearly / 30%-band rebalancing on a tested 9-asset leveraged portfolio. Finds that rebalance frequency matters far less than people assume; daily and yearly tie for best Sharpe, middle frequencies are slightly worse.

  • 🟡 Portfolio construction
    Portfolio Archetypes Search: Finding the Sweet Spot

    Tests 10 candidate portfolio compositions across both stress (2000-2026) and bull (2012-2026) windows, plus sensitivity sweeps for UPRO/UGL ratio and a BTC sleeve. Headline: heavy-gold beats heavy-equity in Sharpe, and SMA200 on the equity sleeve dominates every other configuration.

  • 🟡 Defensive bucket
    Defensive Bucket Comparison: Cash vs Gold vs TLT vs Combinations

    Across 21 years, tested what to hold during SMA200-filter OFF periods on a leveraged-equity strategy. Gold > combinations > cash > long Treasuries. The TLT/ZROZ result is the most surprising and durable: long bonds finish worst on both Sharpe and max drawdown.

  • 🟢 Methodology
    LRS Family: Replication + Walk-Forward Parameter Sweep

    Replicates the Testfolio LRS-style trend strategies on TQQQ, SOXL, UPRO with walk-forward parameter sweeps. The brutal finding: train-test Sharpe correlation is near zero, meaning 'optimal' parameters don't predict next-decade performance.

  • 🟡 Leveraged ETFs
    Leverage vs Volatility per Underlying: 1x vs 3x, B&H vs SMA200

    Per-underlying analysis of SPY, QQQ, and ^SOX showing how leverage-vs-vol-decay interacts with the SMA200 filter. Establishes the vol-break-point hypothesis that the full leverage spectrum study later confirmed.

  • 🟡 Leveraged ETFs
    SMA200 on Extended/Synthetic Long Windows

    Three long-window tests: synthetic TQQQ since 1999, unleveraged S&P 500 since 1940, synthetic 3x S&P since 1940. The unleveraged S&P decade table is the load-bearing finding; the synthetic LETF portions are superseded by the May 19 borrow-cost correction.

  • 🟢 Leveraged ETFs
    LETF Inception Backtest: Buy-and-Hold vs SMA200 Filter

    Per-fund backtest comparing buy-and-hold against the SMA200 trend filter on SOXL, TQQQ, and UPRO from each fund's launch through 2026. Results show the filter dramatically reduces max drawdown on every fund (-80%+ to -30-40%) with similar long-term Sharpe.

Reproducibility

All synthetic LETF return series in these studies come from sma200-bt, an open-source Python module calibrated against real TQQQ to within 5%. Test the numbers yourself:

pip install sma200-bt

Real-fund data comes from yfinance. Underlying indices and macroeconomic series are public; no proprietary data is used anywhere in these studies.