Thomas Tarler ttarler

## 05-ensemble-decisioning-system.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                ttarler
                / 05-ensemble-decisioning-system.md
            
            
              Last active
              March 3, 2026 21:35
            
              
                Post 5: The Ensemble That Actually Trades — How PPO, SAC, and TD3 Became One Decisioning System
              
          
    The Ensemble That Actually Trades: How PPO, SAC, and TD3 Became One Decisioning System


The Problem With One Model

I spent weeks training a single PPO agent to make portfolio decisions. It worked well in calm markets. Then VIX spiked from 14 to 22 and the model froze — its learned policy didn't generalize to a volatility regime it hadn't trained in.
This isn't a novel observation. Any quantitative researcher who's deployed a single model to trade across regimes has hit this wall. The issue isn't the algorithm. It's that markets have structural breaks — volatility regimes, sector rotations, macro shocks — and no single model captures all of them.

  
## 01-intro-full-stack-ai-trading.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                ttarler
                / 01-intro-full-stack-ai-trading.md
            
            
              Created
              January 27, 2026 01:34
            
              
                Medium Post 1: I Built a Full-Stack AI Trading App with LLMs (comprehensive diagrams)
              
          
    I Built a Full-Stack AI Trading App with LLMs: Here's What I Learned

From Notebooks to Production with AI Pair Programmers

As a professional data scientist, I've spent years building ML models in Jupyter notebooks. But I wanted to go beyond research. I wanted to deploy a full-stack, production-grade AI trading system. The catch? I needed to supplement my data science expertise with full-stack development, infrastructure, and operations knowledge I didn't have.
Enter AI coding agents.
Over the past few months, I used LLMs (primarily Claude via Cursor) to build an automated trading system from scratch: FastAPI backend, Celery workers, Next.js frontend, PostgreSQL database, AWS infrastructure (ECS, SageMaker, EventBridge), and a complete ML pipeline for training and deploying models.

  
## 08-proof-it-works-jan-20-24-performance.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                ttarler
                / 08-proof-it-works-jan-20-24-performance.md
            
            
              Created
              January 27, 2026 01:10
            
              
                Medium Post 8: Proof It Works - A Week of Paper Trading Performance
              
          
    Proof It Works: A Week of Paper Trading Performance (Jan 20-24, 2026)

After All the Lessons, Does It Actually Work?

Seven posts on architecture, context management, testing, deployment, ML failures, and operational assumptions.
But here's the real question: Does the system actually work?
In this final post, I'll show you a week of paper trading performance—the proof that all those lessons, fixes, and patterns resulted in a functioning trading system.

  
## 07-scheduled-not-equal-working.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                ttarler
                / 07-scheduled-not-equal-working.md
            
            
              Created
              January 27, 2026 01:09
            
              
                Medium Post 7: 'Scheduled ≠ Working' and Other Expensive Assumptions
              
          
    "Scheduled ≠ Working" and Other Expensive Assumptions

The Assumption That Cost 7 Days

Celery beat logs:
[2025-11-18 09:32:00] Sending task: check_stop_loss_take_profit
[2025-11-18 09:34:00] Sending task: check_stop_loss_take_profit
[2025-11-18 09:36:00] Sending task: check_stop_loss_take_profit


## 06-ml-model-live-6-days-never-used.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                ttarler
                / 06-ml-model-live-6-days-never-used.md
            
            
              Created
              January 27, 2026 01:09
            
              
                Medium Post 6: The ML Model Was 'Live' for 6 Days—It Never Made a Single Decision
              
          
    The ML Model Was 'Live' for 6 Days—It Never Made a Single Decision

"Deployed and Working"

Jan 8: "RL model deployed to SageMaker endpoint. Trading with trained model."
Jan 9: "RL model is active and making portfolio decisions."
Jan 10-13: Portfolio trades every day. System reports "working."

  
## 05-cicd-deployment-with-ai-code.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                ttarler
                / 05-cicd-deployment-with-ai-code.md
            
            
              Created
              January 27, 2026 01:09
            
              
                Medium Post 5: CI/CD and Deployment When AI Writes the Code
              
          
    CI/CD and Deployment When AI Writes the Code

The Illusion: "Committed = Running"

I committed a critical fix at 5:51 PM. Pushed to GitHub. CI passed. Green checkmark.
At 6:15 PM, the bug was still happening.
Why? The Docker containers were still running the old code.

  
## 04-training-rl-portfolio-agent.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                ttarler
                / 04-training-rl-portfolio-agent.md
            
            
              Last active
              February 23, 2026 15:37
            
              
                Medium Post 4: Training an RL Portfolio Agent: 47 Commits of "Why Won't You Learn to Sell?"
              
          
    Training an RL Portfolio Agent: 47 Commits of "Why Won't You Learn to Sell?"


The Problem Nobody Warns You About

Every RL tutorial on the internet ends the same way: train PPO on CartPole, watch the reward curve go up, celebrate. Maybe they graduate to Atari and show you a DQN playing Breakout.
None of them prepare you for what happens when your action space has 25 continuous dimensions, your reward signal is delayed by days, and your agent discovers that the optimal policy is to buy everything and never sell.

  
## 03-building-the-ml-pipeline.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              1 star
            
          
                ttarler
                / 03-building-the-ml-pipeline.md
            
            
              Last active
              March 9, 2026 02:33
            
              
                Medium Post 3: Managing LLM Context So Your AI Coworker Doesn't Forget
              
          
    Building the ML Pipeline: From LSTM Experiments to Production RL


The ML System That Actually Trades

Go read any ML trading tutorial online. It's always the same: download some OHLCV data, train a model, show a backtest chart that goes up and to the right, collect Medium claps.
Nobody talks about what happens when you try to run that model every single day. When you need to retrain monthly without breaking the thing that's already trading. When the feature your model was trained on doesn't exist at inference time. When your processing job takes 8 hours and your data is stale by the time training starts.

  
## 02-architecture-evolution.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                ttarler
                / 02-architecture-evolution.md
            
            
              Last active
              February 2, 2026 19:37
            
              
                Medium Post 2: From Multi-Agent Chaos to a Single Execution Path
              
          
    From Multi-Agent Chaos to a Single Execution Path


I Built Something Too Clever for My Own Good

Here's a confession: when I started this project, I was way more interested in building something impressive than something that worked.
Multiple LLM agents coordinating decisions. A dozen strategies running in parallel. Complex fallback hierarchies. HuggingFace endpoints for model inference. It was beautiful on the whiteboard.