From Words and Vision to Robot Grasps: Leveraging Large Models for Object Manipulation

Overview

Preliminary Results