Google DeepMind released a video on YouTube on September 25, revealing how their humanoids are now capable of performing multi-step, complex tasks using multimodal reasoning. In a series of tests ...
The new models, dubbed Gemini Robotics 1.5 and Gemini Robotics-ER 1.5, greatly expand on the capabilities of the original version to handle multistep, "long-horizon" tasks and are a significant ...