Robotics · Stanley Jacob

Projects

Matched 50-seed evaluation

Imitation learning for robot manipulation

Most robot manipulation policies today are learned from demonstrations: people teleoperate a robot, and a model is trained to reproduce that behavior. I wanted to work through that loop myself rather than just read about it. This project trains a diffusion policy from scratch on PushT, an open benchmark where a round pusher has to slide a T-shaped block onto a target, using Hugging Face's LeRobot library and dataset on a single consumer GPU. Because success rates over a few dozen rollouts are noisy and only matched comparisons mean much, the same evaluation harness runs the pretrained reference checkpoint under identical seeds, and every number ships with a confidence interval. The repo also includes a survey I wrote of the current open robot foundation models (GR00T, pi0, RDT2, OpenVLA, SmolVLA, and others), with the claims link-verified. Results, rollout clips, and the write-up live in the repo.

Imitation learning Diffusion policy LeRobot PushT PyTorch

Read the project write-up Source on GitHub ↗ VLA model survey

Monocular depth for navigation

Produces a dense depth map from a single camera frame and uses it for obstacle awareness and visual odometry, enabling navigation without a dedicated depth sensor.

Depth Anything PyTorch Visual odometry

Model: Depth-Anything-V2 ↗

Sensor fusion & state estimation

Tracks position and orientation by fusing inertial and visual measurements with an extended Kalman filter, holding an accurate pose estimate when GPS is unavailable.

Extended Kalman filter Sensor fusion C++

Projects

Imitation learning for robot manipulation

Monocular depth for navigation

Sensor fusion & state estimation

Notes