Speaking robot: Our new AI model translates vision and language into robotic actions