On AI & Robotics
The Physical AI Deployment Gap
- Major advances, trends in the last few years include:
- Challenge #1: distribution shift, aka discrepency from research to deployment- transitioning from orchestrated lab environment to picking arbitrary objects in cluttered, unstructured places when conditions eg lighting, object texture, background, vary. This compounds with other factors and compounds to an increasingly high failure.
- The most powerful VLA models often have high latency & insufficient control frequencies since they are largely based on transformers and have ginormous (billions) parameter counts.
- Ethical issues - need for developing safety frameworks for learning-based systems. How can they be adapted?
1. sim2real transfer success rate with fewer attempts; VLAs in the last few years have made cross-embodiment a focus; robot dexterity has also significantly improved - it’s now able to handle delicate, small, and deformable objects that used to pose too much uncertainty to the programming.
so… how do we move forward? —> revving up the robotics data flywheel
- Collecting high quality robotics data, then curating and annotating it in non-lab environments;
- collecting data in real world settings (through teleoperation infrastructure)
- collecting data while robots are generating useful work
- Ramping up reliability engineering strategies for failure fallback
- Studying failure, characterize the failure’s root cause
- Human assistance when the system detects its in unfamiliar territory
The State of the Robotics Ecosystem
🤠 What’s the future of robotics hardware?
Two metrics: 1, range of tasks that robots can do; 2, ultimate goal, mass manufacturing, which requires significant affordability.
🤠 What is this age of infinite leverage
- Real time localization when the real world is constantly updating but the map isn’t. Although the vehicle is able to reference the map and understand its whereabouts, it can’t predict spontaneous changes, e.g construction worker repainting double yellow lines.
- Zoox’s Solution: The team built ZRN Monitors to identify safety implications of these unanticipated environmental modifications & notify team of this potential update that’s required.
What self-driving cars can teach us about trust in Physical AI: Zoox
🤠 What’s special about Zoox?
- zoox vehicles have many external sensors mounted and boasts “superhuman detection”. Use both Physical and Semantic maps for extra safety.
- Their core technology CLAM uses gathered visual images and LiDAR data that the team collect by driving around the city in a Zoox vehicle with a retrofitted sensor architecture. Then they use machine learning to remove people, cyclists, cars, trucks from the urban landscapes so that the (sensors) can independently recognize these subjects.
🤠 Zoox’s technical challenges
- Real time localization when the real world is constantly updating but the map isn’t. Although the vehicle is able to reference the map and understand its whereabouts, it can’t predict spontaneous changes, e.g construction worker repainting double yellow lines.
- Zoox’s Solution: The team built ZRN Monitors to identify safety implications of these unanticipated environmental modifications & notify team of this potential update that’s required.
Robotics Control Via Embodied Chain of Thought (ECoT) Reasoning
🤠 How is ECoT more effective
intermediate reasoning tokens
Unlike purely textual CoT, ECoT requires the policy to “look carefully” by grounding reasoning in visual observations and robot proprioception.
proposal: 5-step freezing + asynchronous execution
limitation 1: ECoT is not yet flexible; it still has a fixed sequence and number of reasoning steps. having an adaptive approach might improve efficiency.
limitation 2: ECoT’s inference speed is slow even with optimizations so its application to high frequency control scenarios is limited
Why Design must evolve alongside technology: How mutualism can reframe the way we think about AI
🤠 Why might mutualism be key to a safe and sustainable relationship with Artificial Intelligence?
“It’s not about making technology more human; it’s about recognizing that emerging technological systems possess their own kind of aliveness—not biological, but behavioral, relational, and world-shaping. It no longer serves us to think of them simply as tools to be wielded; we must imagine them as colleagues, with needs and trajectories that intersect with our own.”
Data collection & implementation : What if companies approached data as a resource critical to mutual nourishment, centered in careful cultivation and respect?
This shift from a purely utilitarian, transactional approach to one of relational alliance
Creation of flexible systems with ability to adapt
Accounting for long term implications instead of immediate benefits of the next design
Feb 10th
America Cannot Lose The Robotics Race
“If we don’t act to avert it, this will be another Toyota moment, but on a much greater scale.
🤠 Why does China have a massive advantage?
- “Homegrown Chinese companies can now design and fabricate precision parts like harmonic reducers at competitive quality, cheaper prices, and – most importantly – colocated with their customers in manufacturing superclusters.”
- Many Chinese cities offer robotics companies complete tax deductions on their research expenses, generous subsidies, and preferential corporate income tax rate.
- Robotics in China is a dynamic and open market where market discipline naturally filters out those uncompetitive firms. This combined with strategic government support gives it a passive boost in lead.
🤠 Why is it hard for the US to maintain its lead in this robotics race?
The US’s highly restrictive regulatory approach constricts both demand and supply for robots - making it hard for enterprises to buy & startups to build robots.
Feb 11th
Six Things I Learned Watching a Robotics Startup Die from the Inside
Large Model Chauvinism - superior intelligence cannot compensate for poor hardware in humanoids.
K-scale’s crash was not a technical or a product failure, but an irreversible situation caused by the misalignment of rhythm, narrative, and cash flow.
🤠 Why did K-scale eventually crash?
Former CEO Ben said the failure was rooted in a critical strategic error of switching gears to go all-in on its expensive flagship robot, K-bot, for mass manufacturing over the cheaper Z-bot; they did these because a VC verbally promised to give $20M if they did. However by the time they were ready to launch, there was already a humanoid robot “demo-fatigue”; investors calmed down from their FOMO and switched from "looking at demos" to "looking at industrial level shipment volume" and they were unable to secure the necessary Series A funding.
On the other strategic level, K-scales fell on its subordination of physical engineering to AI optimism, but treating hardware as a secondary commodity rather than a primary constraint. Instead of the “software first, hardware eventually” mentality, Rui implies that humanoid teams should iterate in a tight loop where the hardware’s physical limits define the AI’s boundaries, and the AI’s needs dictate the hardware’s sensor suite.
🤠 What were the 6 lessons learned?
Feb 25th
π0: A Vision-Language-Action Flow Model for General Robot Control
🤠 What are the key differences that make pi0 transformative?
Feb 25th
Toward a Horizontal Robotics Platform
spate of ai-enabled products
The confluence of talent, capital, and technology in the field suggests we are in the midst of a robotics and embodied AI upswing
the Chat GPT moment: an inflection point when the technology experiences a mass-market product breakthrough
🤠 Why does overlapping bodies of research push progress?
🤠 ”AI scaling laws are now connected to robotics embodiment”
- scaling robotics like scaling LLMs - both as sequence modeling problems.
🤠 Why is commodification of robot solution providers for generic pick & place tasks an issue for existing robotics companies?
- commodification means buyers treat these robotics products as interchangeable solutions - since picking & placing has already been tackled, companies with similar capabilities lose their edge and venture appeal; they are forced to compete in prices.
🤠 How do robotics startups build their MOAT for anti-commodification then?
1️⃣ Software Intelligence Layer: solidifying the value is in adaptive learning, not hardware.
2️⃣ Data Exclusivity: if your deployed robots generate data others can’t access.
3️⃣ Platform Ecosystems: become the operating system for robot, kickstart the apps —>infra cycle
Noise is all you need to bridge the Sim-to-Real locomotion gap
🤠
A16z- Good News: AI will eat Application Software
🤠
March 5th
https://a16z.com/toward-a-general-purpose-robotics-platform/
Will scaling solve robotics?
core debate: Is training a large neural network on a very large dataset a feasible way to solve robotics?
What are the main arguments in favor of scaling as a solution to robotics?
it’s easy for the community to fall into a local minimum where we make a lot of progress that’s specific to the tabletop setting and therefore not generalizable. A similar thing could happen if we work predominantly in simulation



