Skip to content

Instantly share code, notes, and snippets.

View ttarler's full-sized avatar

Thomas Tarler ttarler

  • Denver
View GitHub Profile
@ttarler
ttarler / 05-ensemble-decisioning-system.md
Last active March 3, 2026 21:35
Post 5: The Ensemble That Actually Trades — How PPO, SAC, and TD3 Became One Decisioning System

The Ensemble That Actually Trades: How PPO, SAC, and TD3 Became One Decisioning System

End-to-End Signal Flow

The Problem With One Model

I spent weeks training a single PPO agent to make portfolio decisions. It worked well in calm markets. Then VIX spiked from 14 to 22 and the model froze — its learned policy didn't generalize to a volatility regime it hadn't trained in.

This isn't a novel observation. Any quantitative researcher who's deployed a single model to trade across regimes has hit this wall. The issue isn't the algorithm. It's that markets have structural breaks — volatility regimes, sector rotations, macro shocks — and no single model captures all of them.

@ttarler
ttarler / 01-intro-full-stack-ai-trading.md
Created January 27, 2026 01:34
Medium Post 1: I Built a Full-Stack AI Trading App with LLMs (comprehensive diagrams)

I Built a Full-Stack AI Trading App with LLMs: Here's What I Learned

From Notebooks to Production with AI Pair Programmers

As a professional data scientist, I've spent years building ML models in Jupyter notebooks. But I wanted to go beyond research. I wanted to deploy a full-stack, production-grade AI trading system. The catch? I needed to supplement my data science expertise with full-stack development, infrastructure, and operations knowledge I didn't have.

Enter AI coding agents.

Over the past few months, I used LLMs (primarily Claude via Cursor) to build an automated trading system from scratch: FastAPI backend, Celery workers, Next.js frontend, PostgreSQL database, AWS infrastructure (ECS, SageMaker, EventBridge), and a complete ML pipeline for training and deploying models.

@ttarler
ttarler / 08-proof-it-works-jan-20-24-performance.md
Created January 27, 2026 01:10
Medium Post 8: Proof It Works - A Week of Paper Trading Performance

Proof It Works: A Week of Paper Trading Performance (Jan 20-24, 2026)

After All the Lessons, Does It Actually Work?

Seven posts on architecture, context management, testing, deployment, ML failures, and operational assumptions.

But here's the real question: Does the system actually work?

In this final post, I'll show you a week of paper trading performance—the proof that all those lessons, fixes, and patterns resulted in a functioning trading system.

@ttarler
ttarler / 07-scheduled-not-equal-working.md
Created January 27, 2026 01:09
Medium Post 7: 'Scheduled ≠ Working' and Other Expensive Assumptions

"Scheduled ≠ Working" and Other Expensive Assumptions

The Assumption That Cost 7 Days

Celery beat logs:

[2025-11-18 09:32:00] Sending task: check_stop_loss_take_profit
[2025-11-18 09:34:00] Sending task: check_stop_loss_take_profit
[2025-11-18 09:36:00] Sending task: check_stop_loss_take_profit
@ttarler
ttarler / 06-ml-model-live-6-days-never-used.md
Created January 27, 2026 01:09
Medium Post 6: The ML Model Was 'Live' for 6 Days—It Never Made a Single Decision

The ML Model Was 'Live' for 6 Days—It Never Made a Single Decision

"Deployed and Working"

Jan 8: "RL model deployed to SageMaker endpoint. Trading with trained model."

Jan 9: "RL model is active and making portfolio decisions."

Jan 10-13: Portfolio trades every day. System reports "working."

@ttarler
ttarler / 05-cicd-deployment-with-ai-code.md
Created January 27, 2026 01:09
Medium Post 5: CI/CD and Deployment When AI Writes the Code

CI/CD and Deployment When AI Writes the Code

The Illusion: "Committed = Running"

I committed a critical fix at 5:51 PM. Pushed to GitHub. CI passed. Green checkmark.

At 6:15 PM, the bug was still happening.

Why? The Docker containers were still running the old code.

@ttarler
ttarler / 04-training-rl-portfolio-agent.md
Last active February 23, 2026 15:37
Medium Post 4: Training an RL Portfolio Agent: 47 Commits of "Why Won't You Learn to Sell?"

Training an RL Portfolio Agent: 47 Commits of "Why Won't You Learn to Sell?"

RL Training Journey

The Problem Nobody Warns You About

Every RL tutorial on the internet ends the same way: train PPO on CartPole, watch the reward curve go up, celebrate. Maybe they graduate to Atari and show you a DQN playing Breakout.

None of them prepare you for what happens when your action space has 25 continuous dimensions, your reward signal is delayed by days, and your agent discovers that the optimal policy is to buy everything and never sell.

@ttarler
ttarler / 03-building-the-ml-pipeline.md
Last active March 9, 2026 02:33
Medium Post 3: Managing LLM Context So Your AI Coworker Doesn't Forget

Building the ML Pipeline: From LSTM Experiments to Production RL

Two-Stage ML Architecture

The ML System That Actually Trades

Go read any ML trading tutorial online. It's always the same: download some OHLCV data, train a model, show a backtest chart that goes up and to the right, collect Medium claps.

Nobody talks about what happens when you try to run that model every single day. When you need to retrain monthly without breaking the thing that's already trading. When the feature your model was trained on doesn't exist at inference time. When your processing job takes 8 hours and your data is stale by the time training starts.

@ttarler
ttarler / 02-architecture-evolution.md
Last active February 2, 2026 19:37
Medium Post 2: From Multi-Agent Chaos to a Single Execution Path

From Multi-Agent Chaos to a Single Execution Path

V1 vs V2 Architecture Comparison

I Built Something Too Clever for My Own Good

Here's a confession: when I started this project, I was way more interested in building something impressive than something that worked.

Multiple LLM agents coordinating decisions. A dozen strategies running in parallel. Complex fallback hierarchies. HuggingFace endpoints for model inference. It was beautiful on the whiteboard.