

Finding Theta Θ

  • Home
  • Blog Categories
    
    Publications
    Tutorials
    Projects
    Guides
  • Projects
    
    AlphaRacerApex
  • Contact

Search for articles


Tutorials

Solving Gymnasium's Lunar Lander with Deep Q Learning (DQN)

Michael Kudlaty
Michael Kudlaty

July 1, 2024
Solving Gymnasium's Lunar Lander with Deep Q Learning (DQN)

Lastest articles

Aligning and Augmenting Intelligence: A Technical Survey of Reinforcement Learning in Large Language Models
Projects

Aligning and Augmenting Intelligence: A Technical Survey of Reinforcement Learning in Large Language Models

Learn how Reinforcement Learning from Human Feedback (RLHF) and emerging techniques like DPO are used to train safer, more helpful, and aligned Large Language Models (LLMs).

Michael Kudlaty
Michael Kudlaty

October 1, 2025
The Unseen Hand: Guiding a Virtual Drone with Sparse and Dense Rewards
Projects

The Unseen Hand: Guiding a Virtual Drone with Sparse and Dense Rewards

Exploring training a drone with reinforcement learning, focusing on the trade-offs between sparse and dense rewards to achieve agile flight while avoiding unintended "reward hacking."

Michael Kudlaty
Michael Kudlaty

September 1, 2025
Ultimate Guide to Contextual Bandits: From Theory to Python Implementation
Projects

Ultimate Guide to Contextual Bandits: From Theory to Python Implementation

Discover the ultimate guide to contextual bandits, covering everything from core theory and key algorithms to a complete Python implementation with code for building powerful personalization and recommendation systems

Michael Kudlaty
Michael Kudlaty

August 1, 2025
Mastering Autonomy: A Comprehensive Guide to the AutoDRIVE Ecosystem and Reinforcement Learning
Projects

Mastering Autonomy: A Comprehensive Guide to the AutoDRIVE Ecosystem and Reinforcement Learning

A complete guide to the AutoDRIVE ecosystem. Learn to train self-driving car agents from scratch using reinforcement learning in this powerful Unity-based simulator.

Michael Kudlaty
Michael Kudlaty

July 1, 2025
Agentic AI: The Autonomous Evolution of Machine Learning and Its Dawn in Robotics
Guides

Agentic AI: The Autonomous Evolution of Machine Learning and Its Dawn in Robotics

Explore Agentic AI, the next frontier in machine learning. Discover how autonomous agents learn, act independently, and are revolutionizing the field of robotics.

Michael Kudlaty
Michael Kudlaty

June 1, 2025
Serving Up Some Robotics: Setting Up a Tennis Environment in MuJoCo
Tutorials

Serving Up Some Robotics: Setting Up a Tennis Environment in MuJoCo

Build a MuJoCo robot tennis simulation! Learn to set up a wall tennis environment, tackle physics/control challenges, understand its architecture, and improve it for robotics or reinforcement learning projects with MuJoCo

Michael Kudlaty
Michael Kudlaty

May 1, 2025
Bridging Worlds: How Visual Language Action Models are Teaching Robots to See, Understand, and Act
Projects

Bridging Worlds: How Visual Language Action Models are Teaching Robots to See, Understand, and Act

Understand how Visual Language Action Models (VLAs) let robots follow commands by fusing AI vision and language for action, demonstrated with a conceptual Python robotics simulation

Michael Kudlaty
Michael Kudlaty

April 1, 2025
From Zero to Dino-Roar: Teaching a T-Rex to Walk with MuJoCo and Reinforcement Learning
Projects

From Zero to Dino-Roar: Teaching a T-Rex to Walk with MuJoCo and Reinforcement Learning

Build a bipedal T-Rex model in MuJoCo and lay the groundwork for teaching it to walk using reinforcement learning techniques like PPO or SAC. Get the XML, Python setup, and conceptual RL overview

Michael Kudlaty
Michael Kudlaty

March 1, 2025
Using Reinforcement Learning for Stock Trading with FinRL
Projects

Using Reinforcement Learning for Stock Trading with FinRL

Learn stock trading with Reinforcement Learning (RL) and FinRL. This guide covers RL theory, FinRL setup, agent training (DQN, PPO), backtesting, and practical considerations, including code examples and pitfalls.

Michael Kudlaty
Michael Kudlaty

February 1, 2025
Mastering Robotic Manipulation with Reinforcement Learning: TQC and DDPG for Fetch Environments
Tutorials

Mastering Robotic Manipulation with Reinforcement Learning: TQC and DDPG for Fetch Environments

Using reinforcement learning (RL), specifically Truncated Quantile Critics (TQC) and Deep Deterministic Policy Gradient (DDPG), to solve the Fetch environments in Gymnasium Robotics

Michael Kudlaty
Michael Kudlaty

January 1, 2025
Beginner's Guide to Model-Based Reinforcement Learning (MBRL) with Atari's Breakout
Tutorials

Beginner's Guide to Model-Based Reinforcement Learning (MBRL) with Atari's Breakout

Learn Model-Based Reinforcement Learning: Build sample-efficient RL agents by modeling environment dynamics and planning with Python in tasks like Atari's Breakout

Michael Kudlaty
Michael Kudlaty

December 1, 2024
Using Multi-Agent Reinforcement Learning to play OpenSpiel's Connect 4 with Ray's RLlib
Projects

Using Multi-Agent Reinforcement Learning to play OpenSpiel's Connect 4 with Ray's RLlib

Discover how self-play can be used to train a reinforcement learning agent to master Connect 4, achieving advanced strategies without human intervention

Michael Kudlaty
Michael Kudlaty

November 1, 2024
Mastering Atari's Pong with Reinforcement Learning: Overcoming Sparse Rewards and Optimizing Performance
Projects

Mastering Atari's Pong with Reinforcement Learning: Overcoming Sparse Rewards and Optimizing Performance

Train an RL agent to master Atari's Pong with sparse rewards and high-dimensional inputs. Explore preprocessing, replay buffers, and performance-boosting strategies

Michael Kudlaty
Michael Kudlaty

October 1, 2024
Solving Gymnasium's Car Racing with Reinforcement Learning
Projects

Solving Gymnasium's Car Racing with Reinforcement Learning

Learn how to apply reinforcement learning to solve Gymnasium's Car Racing game, see how different algorithms perform, and explore whether discrete or continuous action spaces are better.

Michael Kudlaty
Michael Kudlaty

September 1, 2024
Comparing how PPO, SAC, and DQN Perform on Gymnasium's Lunar Lander
Tutorials

Comparing how PPO, SAC, and DQN Perform on Gymnasium's Lunar Lander

Explore how different On-Policy and Off-Policy reinforcement learning algorithms perform on Gymnasium's Lunar Lander

Michael Kudlaty
Michael Kudlaty

August 1, 2024

Subscribe to newsletter


Thanks for joining our newsletter
Oops! Something went wrong while submitting the form.

