A Simple Demonstration of Camera Only Perception-Navigation with Metric3D for an Indoor Robot #4830
Owen-Liuyuxuan
started this conversation in
Show and tell
Replies: 1 comment 1 reply
-
Hello I am also working on autoware and I have just started working on autonomous vehicles. I'd very grateful |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello Community,
We are excited to share an update on our internal training project at TIER IV, where we are developing a camera-only perception navigation stack for indoor robots, utilizing Autoware and a monocular depth prediction model, specifically Metric3D. This model converts images from a single camera into dense point clouds, which then serve as the sole input for the Autoware Perception stack. Our aim is to assess the efficacy of the camera-only perception and planning modules within an indoor robotic setup.
Overview of Metric3D
Metric3D is an advanced monocular depth prediction model that processes a single image to estimate the depth of each pixel. Key characteristics of Metric3D include:
Screencast.from.2024.05.28.17.44.17.webm
Exporting to ONNX and ROS2
Our contributions to the community include code for converting Metric3D's pretrained models to the ONNX format, facilitating integration with different systems. The code can be accessed here: YvanYin/Metric3D#102
Additionally, we developed a ROS2 module for processing images into point clouds, which are then aligned with the robot’s base_link for use in Autoware: https://github.com/Owen-Liuyuxuan/ros2_vision_inference
Performance Benchmarks
We evaluated the model's inference time across various platforms at its native resolution (616 x 1064).
Despite acceptable performance on higher-end GPUs, the model struggles on less powerful devices like the Jetson Xavier without further optimization. But we can conduct the following test with the robot running on my laptop.
Modifications to Autoware
Modifications were necessary within the Autoware framework to accommodate the unique inputs from Metric3D:
tier4_perception_launch/launch/perception.launch.xml
to accept data from Metric3D.Test on Indoor Robot
Our preliminary tests with an indoor robot have provided valuable insights, helping us identify areas for enhancement.
VID_20240530_143957.mp4
Future Directions
Looking forward, we aim to refine our approach by:
Special Thanks
Special thanks to Metric3D for the great work.
Beta Was this translation helpful? Give feedback.
All reactions