Figure Helix Vision Language Action Model Humanoid Robots
Figure’s Helix is a new Vision-Language-Action (VLA) model that can help the company’s robots pick up nearly any small household object, including various items they have never encountered before, simply by following natural language prompts. This model basically unifies perception, language understanding, and learned control to overcome multiple longstanding challenges in robotics.