当前位置:网站首页>[Detr for 3D object detection] detr3d: 3D object detection from multi view images via 3D-to-2D queries
[Detr for 3D object detection] detr3d: 3D object detection from multi view images via 3D-to-2D queries
2022-07-25 19:09:00 【Bit reachable duck】
DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries
Brief introduction of the paper :
This paper introduces a framework for multi camera 3D target detection . The existing work is to estimate the 3D bounding box directly from monocular images , Or use the depth prediction network to generate the input of three-dimensional target detection from two-dimensional information , Unlike the , The method in this paper operates prediction directly in three-dimensional space .
DETR3D Extract two-dimensional features from multiple camera images , Then use a sparse set of 3D Object query to index these two-dimensional features , Use the camera conversion matrix to 3D The location is linked to the multi view image , Then the bounding box prediction is performed for each object query , Use the set to set loss to measure the difference between the ground truth and the prediction .
This top-down approach is better than the bottom-up approach , That is, the object boundary box prediction follows the depth estimation per pixel , Because it is not affected by the composite error introduced by the depth prediction model . Besides , This method does not require post-processing , If not the maximum inhibition , Significantly improve the reasoning speed , And in nuScenes The self driving benchmark has achieved the most advanced performance .
Contribution of thesis :
- The original name is based on RGB 3D object detection model of image . Different from the existing work ,DETR3D At the last stage
边栏推荐
- Fearless of high temperature and rainstorm, how can Youfu network protect you from worry?
- Basic mode of music theory
- 常用的开发软件下载地址
- Modelsim and quartus jointly simulate PLL FIFO and other IP cores
- With 8 years of product experience, I have summarized these practical experience of continuous and efficient research and development
- 【小程序开发】常用组件及基本使用详解
- 果链“围城”:傍上苹果,是一场甜蜜与苦楚交错的旅途
- The difference between QT exec and show
- How to change the chords after the tune of the song is changed
- telnet安装以及telnet(密码正确)无法登录!
猜你喜欢

Baklib: make excellent product instruction manual

【DETR用于3D目标检测】3DETR: An End-to-End Transformer Model for 3D Object Detection

SQL Server 2019 installation tutorial

2022 IAA industry category development insight series report - phase II

Clip can also do segmentation tasks? The University of Gottingen proposed a model clipseg that uses text and image prompt and can do three segmentation tasks at the same time, squeezing out the clip a

【919. 完全二叉树插入器】

【Web技术】1391- 页面可视化搭建工具前生今世

21 days proficient in typescript-4 - type inference and semantic check

基于FPGA的1080P 60Hz BT1120接口调试过程记录
![[iniparser] simple use of the project configuration tool iniparser](/img/2b/1d20b4ef44dfe2544891d9c72b676e.png)
[iniparser] simple use of the project configuration tool iniparser
随机推荐
[encryption weekly] has the encryption market recovered? The cold winter has not thawed yet! Check the major events in the encryption market last week!
qt之编译成功但程序无法运行
Pymoo learning (5): convergence analysis
Clip can also do segmentation tasks? The University of Gottingen proposed a model clipseg that uses text and image prompt and can do three segmentation tasks at the same time, squeezing out the clip a
房地产行业大洗牌
Basic music theory -- configuring chords
【919. 完全二叉树插入器】
Introduction of this course (Introduction to machine learning)
HTTP缓存通天篇,可能有你想要的
这种动态规划你见过吗——状态机动态规划之股票问题(上)
Virtual machine VMware installation steps (how to install software in virtual machine)
小程序毕设作品之微信校园维修报修小程序毕业设计成品(8)毕业设计论文模板
How to create an effective help document?
小程序毕设作品之微信校园维修报修小程序毕业设计成品(6)开题答辩PPT
【DETR用于3D目标检测】DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries
How to change the chords after the tune of the song is changed
聊聊接口性能优化的11个小技巧
Modelsim and quartus jointly simulate PLL FIFO and other IP cores
Ping command details [easy to understand]
Gan, why ".Length! == 3??