
An AI agent configured a robot arm, used camera vision to locate objects, and slowly grabbed items. The system also trained another AI model to pick up and place specific objects. Robotics work that previously required substantial expertise is becoming easier as modern AI models handle more of the setup and control. AI-powered coding is presented as a way to connect reliable conventional engineering with more general vision-language-action models. A prebuilt open-source arm setup from HuggingFace, LeRobot 101, uses a controller arm operated by a person and a follower arm with a camera that replicates those movements. Training involves teleoperating the controller while the model learns follower actions from camera observations. After initial connection and calibration challenges, OpenClaw and Codex helped generate a Python program that detected a red ball and closed the gripper, including robot connection configuration, joint calibration, and object identification and gripping logic.
"The AI agent was able to configure the arm, use it to see and slowly grab things, and even train another AI model to pick up and place specific objects. And they say AGI is still a few years away! (I'm joking, it probably is). The results have me convinced that we may be on the brink of a robotics breakthrough. Training and controlling robots used to require considerable skill. Today's AI models can make it almost easy."
"“AI-powered coding is super exciting because it has the potential to bridge the gap between conventional engineering methods, which are reliable but don't generalize, and contemporary vision-language-action models, which generalize but are not yet reliable,” says Ken Goldberg, a roboticist at UC Berkeley who is exploring the approach."
"The LeRobot comes with two arms: a controller arm that a person operates using a handle and a trigger, and a follower arm with a camera that replicates those movements. You can train an AI model by teleoperating the controller arm and having the model learn how to move the follower in response to what it sees on the camera."
"Before using OpenClaw, I spent several hours trying to connect and calibrate the robot, at one point nearly breaking the motors by applying the wrong settings, which caused them to overheat. Then, with help from OpenClaw and Codex, I was able to vibe code a simple program that closed the claw's gripper when it spotted a red ball. In the terminal, Codex went through the tricky work of configuring the connections to the robot. Then, with my help, it calibrated the positions of its joints. It also wrote a Python script that used several libraries to identify and grip the ball in qu"
Read at WIRED
Unable to calculate read time
Collection
[
|
...
]