InfoCapability

Physical Intelligence releases π0 and π0-FAST: Vision-Language-Action Models for LeRobot

AI Impact Summary

Physical Intelligence has released π0 and π0-FAST, Vision-Language-Action (VLA) models designed for general robot control, leveraging pre-training and flow matching for dexterous manipulation. These models, available in the LeRobot repository, represent a significant step towards versatile robot intelligence by integrating vision, language, and action, addressing the challenge of bridging the gap between AI and the physical world. The models utilize a state token to represent the robot's environment, action tokens to define motor commands, and a flow matching approach for generating smooth, real-time action trajectories.

Affected Systems

π0π0-FAST

Date: Date not specified
Change type: capability
Severity: info

Physical Intelligence releases π0 and π0-FAST: Vision-Language-Action Models for LeRobot

More from Hugging Face

Get alerts for Hugging Face