Abstract: In this paper, we propose an efficient multi-level convolution architecture for 3D visual grounding. Conventional methods are difficult to meet the requirements of real-time inference due to ...
🗞️[Nov. 7th '24] Updated and released our video on youtube, and added a preview for our data exploration website. 🗞️[Oct. 31th '24] Release of a script to visualize motions and to save the ...