Language-Conditioned Affordance-Pose Detection
in 3D Point Clouds

Toan Nguyen1,2       Minh Nhat Vu3,4       Baoru Huang5       Tuan Van Vo1
Vy Truong1       Ngan Le6       Thieu Vo7       Bac Le2       Anh Nguyen8

1FPT Software AI Center   2VNUHCM-University of Science   3ACIN - TU Wien   4Austrian Institute of Technology   5Imperial College London  
6University of Arkansas   7Ton Duc Thang University   8University of Liverpool
HyperNeRF architecture.

Our framework allows the simultaneous detection of affordance region and corresponding supporting poses given the input point cloud object and an arbitrary affordance text.

Abstract

Affordance detection and pose estimation are of great importance in many robotic applications. Their combination helps the robot gain an enhanced manipulation capability, in which the generated pose can facilitate the corresponding affordance task. Previous methods for affodance-pose joint learning are limited to a predefined set of affordances, thus limiting the adaptability of robots in real-world environments. In this paper, we propose a new method for language-conditioned affordance-pose joint learning in 3D point clouds. Given a 3D point cloud object, our method detects the affordance region and generates appropriate 6-DoF poses for any unconstrained affordance label. Our method consists of an open-vocabulary affordance detection branch and a language-guided diffusion model that generates 6-DoF poses based on the affordance text. We also introduce a new high-quality dataset for the task of language-driven affordance-pose joint learning. Intensive experimental results demonstrate that our proposed method works effectively on a wide range of open-vocabulary affordances and outperforms other baselines by a large margin. In addition, we illustrate the usefulness of our method in real-world robotic applications.

BibTeX

@inproceedings{nguyen2024language,
      title={Language-Conditioned Affordance-Pose Detection in 3D Point Clouds},
      author={Nguyen, Toan and Vu, Minh Nhat and Huang, Baoru and Van Vo, Tuan and Truong, Vy and Le, Ngan and Vo, Thieu and Le, Bac and Nguyen, Anh},
      booktitle = ICRA,
      year      = {2024}
  }

Acknowledgements

We borrow the page template from HyperNeRF. Special thanks to them!