Study sandbox · last verified 2026-05-08

Physical AI in 2026 — research, projects, and a learning roadmap.

Sixteen cross-cutting research docs (including a glossary, a canonical reading list, and a four-part automotive-industry history), three interactive tutorials, and twenty hands-on projects, organized around the four-loop data flywheel: collect → curate → label → eval.

Start with the overview →Explore tutorials Browse projects See the connections graph

Research docs

~780 min total reading

Hands-on projects

From laptop to H100

Phases

Data fluency → Strategy

Loops

Collect · Curate · Label · Eval

Research

All 16 docs →

00~30m

Overview — Physical AI in 2026

Cross-cutting synthesis of the field: data is the bottleneck, the data engine is the product, the modular/E2E pendulum is dissolving.

01~90m

AV industry and data

Who collects what at what scale: Tesla, Waymo, Mobileye, Wayve, Waabi. Fleet logs vs customer-shadow vs sim vs world models.

02~60m

Robotics foundation models

The VLA recipe — RT-X, OpenVLA, π0/π0.5, Helix, Gemini Robotics. Open X-Embodiment, demo collection economics.

03~90m

Simulation and synthetic data

The competitor map. Applied Intuition Simian, Nvidia Cosmos + Isaac, CARLA, Waymax, the death of pure-rendering shops.

04~120m

Labeling and data curation

The most important doc for the role. Data-engine philosophy, FiftyOne/SAM2/OpenSCENARIO tools, foundation models as labelers.

05~60m

World models and generative

Cosmos, GAIA, GR00T-Dreams, Wayve PRISM-1. World models as data engine and as eval substrate.

Interactive tutorials

All 3 tutorials →

Self-contained guides with live simulations — open one full-screen and poke at it.

6 parts~75m

LiDAR & the Autonomy Stack

A five-part interactive series walking up the stack from the physical sensor to the data engine everything depends on — point clouds, perception, SLAM, corruption, robustness, and building a simulator. Grounded in published benchmarks; figures current as of early 2026.

3 parts~45m

SLAM & Localization

Three interactive field guides on how self-driving cars know where they are: a tour of the SLAM back-end (ORB-SLAM, LOAM, Cartographer), the localization spectrum the industry is converging on, and why one LiDAR sweep does localization and perception at once.

Guide~40m

Physical AI Data Strategy Explorer

A deep dive into 30 companies across self-driving and robotics in the US, China, and beyond — how they collect real-world data, manufacture synthetic data, auto-label, and evaluate. The 2025–26 shift: the flywheel collapsing into a single foundation/world model that drives, simulates, and evaluates at once.

The 8-phase project arc

All 20 projects →

Phase A
Data fluency
2 projects
Phase B
Labeling fundamentals
3 projects
Phase C
Production hygiene
1 project
Phase D
Simulation and world models
4 projects
Phase E
Robotics adjacency
2 projects
Phase F
Behavior, sim agents, closed-loop
3 projects
Phase G
Active learning + capstone
2 projects
Phase H
Strategy
1 project
Phase I
Frontier extensions (Round 3)
2 projects

Compressed-time critical chain

If you must compress 18 weeks into 6, this is the minimum-viable order. One thing not to skip: project 18 — the strategy memo.

01FiftyOne scenario mining→

02MCAP / ROS 2 / Foxglove plumbing→

03SAM 2 + Grounding DINO auto-labeling→

04BEVFormer 3D detection→

06Privacy + provenance pipeline→

07CARLA OpenSCENARIO scenarios→

15Bench2Drive closed-loop eval→

16Active learning loop→

17BDD100K mini data-engine (capstone)→

18Strategy memo

Four loops

Every project is tagged by which part of the data flywheel it touches.

COLLECT
Fleet logs, customer-shadow, simulation, world-model generation. Weeks–months.
CURATE
Triage, embedding mining, scenario taxonomy, dedup, slicing. Hours–days.
LABEL
Auto-label, human verify, distill, pretrain, fine-tune. Days–weeks.
EVAL
Open-loop, closed-loop sim, scenario coverage, safety case. Continuous.