Method

[st]Efficient Mamba-based Voxel Feature Modeling for Stereo 3D Object Detection [MSVN]
[Anonymous Submission]

Submitted on 8 Jun. 2026 09:27 by
[Anonymous Submission]

Running time:0.15 s
Environment:1 core @ 2.5 Ghz (Python)

Method Description:
Existing voxel-based stereo methods still face
challenges in redundant voxel computation, limited
global dependency modeling, and insufficient local
geometric perception. This paper proposes MSVN, an
efficient voxel-feature-based framework for stereo
3D object detection based on mamba.
Parameters:
The model is trained on the KITTI stereo 3D object
detection dataset without using any LiDAR data
during training. The batch size is set to 1, and
the network is trained for 120 epochs. AdamW is
adopted as the optimizer, with an initial learning
rate of 0.001 and a weight decay of 0.01.
Latex Bibtex:

Detailed Results

Object detection and orientation estimation results. Results for object detection are given in terms of average precision (AP) and results for joint object detection and orientation estimation are provided in terms of average orientation similarity (AOS).


Benchmark Easy Moderate Hard
Car (Detection) 96.30 % 93.42 % 87.92 %
Car (Orientation) 96.28 % 93.32 % 87.72 %
Car (3D Detection) 77.77 % 55.88 % 48.95 %
Car (Bird's Eye View) 84.63 % 67.17 % 59.88 %
Pedestrian (Detection) 69.38 % 55.61 % 51.41 %
Pedestrian (Orientation) 55.41 % 43.11 % 39.45 %
Pedestrian (3D Detection) 34.89 % 25.12 % 22.09 %
Pedestrian (Bird's Eye View) 40.47 % 29.62 % 26.40 %
Cyclist (Detection) 73.77 % 58.10 % 51.64 %
Cyclist (Orientation) 63.56 % 47.16 % 42.19 %
Cyclist (3D Detection) 58.09 % 37.75 % 32.76 %
Cyclist (Bird's Eye View) 62.39 % 41.59 % 36.65 %
This table as LaTeX


2D object detection results.
This figure as: png eps txt gnuplot



Orientation estimation results.
This figure as: png eps txt gnuplot



3D object detection results.
This figure as: png eps txt gnuplot



Bird's eye view results.
This figure as: png eps txt gnuplot



2D object detection results.
This figure as: png eps txt gnuplot



Orientation estimation results.
This figure as: png eps txt gnuplot



3D object detection results.
This figure as: png eps txt gnuplot



Bird's eye view results.
This figure as: png eps txt gnuplot



2D object detection results.
This figure as: png eps txt gnuplot



Orientation estimation results.
This figure as: png eps txt gnuplot



3D object detection results.
This figure as: png eps txt gnuplot



Bird's eye view results.
This figure as: png eps txt gnuplot




eXTReMe Tracker