Method

MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching [MatchStereo]
https://github.com/TingmanYan/MatchAttention

Submitted on 29 Aug. 2025 11:02 by
Tingman Yan (Dalian University of Technology)

Running time:0.05 s
Environment:GPU @ 2.5 Ghz (Python + C/C++)

Method Description:
Cross-view matching is fundamentally achieved
through cross-attention mechanisms. However,
matching of high-resolution images remains
challenging due to the quadratic complexity and
lack of explicit matching constraints in the
existing cross-attention. This paper proposes an
attention mechanism, MatchAttention, that
dynamically matches relative positions. The
relative position determines the attention
sampling center of the key-value pairs given a
query. Continuous and differentiable sliding-
window attention sampling is achieved by the
proposed BilinearSoftmax. The relative positions
are iteratively updated through residual
connections across layers by embedding them into
the feature channels. Since the relative position
is exactly the learning target for cross-view
matching, an efficient hierarchical cross-view
decoder, MatchDecoder, is designed with
MatchAttention as its core component. To handle
cross-view occlusions, gated cross-MatchAttention
and a consistency-constrained loss are
Parameters:
MatchStereo-B, 76M parameters
Latex Bibtex:
@article{yan2025matchattention,
title={MatchAttention: Matching the Relative
Positions for High-Resolution Cross-View Matching},
author={Tingman Yan and Tao Liu and Xilian Yang
and Qunfei Zhao and Zeyang Xia},
journal={arXiv preprint arXiv:2510.14260},
year={2025}
}

Detailed Results

This page provides detailed results for the method(s) selected. For each of the first 20 test images, the number of erroneous pixels at all thresholds is depicted in the table. Underneath, the left input image, the disparity / end-point error map and the estimated (and interpolated) disparity / optical flow map are shown. The error map scales linearly between 0 (black) and >=5 (white) pixels error. Red denotes all occluded pixels, falling outside the image boundaries. The false color map is scaled to the largest ground truth disparity / flow value.

Test Set Average

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 1.63 % 2.08 % 0.4 px 0.4 px
3 pixels 1.09 % 1.39 % 0.4 px 0.4 px
4 pixels 0.85 % 1.06 % 0.4 px 0.4 px
5 pixels 0.68 % 0.85 % 0.4 px 0.4 px
This table as LaTeX

Reflective Regions

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 5.53 % 6.64 % 0.7 px 0.8 px
3 pixels 2.99 % 3.60 % 0.7 px 0.8 px
4 pixels 1.95 % 2.31 % 0.7 px 0.8 px
5 pixels 1.39 % 1.63 % 0.7 px 0.8 px
This table as LaTeX

Test Image 0

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 1.03 % 1.25 % 0.3 px 0.3 px
3 pixels 0.78 % 1.00 % 0.3 px 0.3 px
4 pixels 0.64 % 0.87 % 0.3 px 0.3 px
5 pixels 0.57 % 0.80 % 0.3 px 0.3 px
This table as LaTeX





Test Image 1

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 1.84 % 2.87 % 0.4 px 0.4 px
3 pixels 1.18 % 1.72 % 0.4 px 0.4 px
4 pixels 0.95 % 1.31 % 0.4 px 0.4 px
5 pixels 0.79 % 1.02 % 0.4 px 0.4 px
This table as LaTeX





Test Image 2

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.82 % 0.87 % 0.2 px 0.3 px
3 pixels 0.64 % 0.68 % 0.2 px 0.3 px
4 pixels 0.49 % 0.53 % 0.2 px 0.3 px
5 pixels 0.44 % 0.48 % 0.2 px 0.3 px
This table as LaTeX





Test Image 3

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.27 % 0.29 % 0.2 px 0.2 px
3 pixels 0.15 % 0.15 % 0.2 px 0.2 px
4 pixels 0.10 % 0.10 % 0.2 px 0.2 px
5 pixels 0.09 % 0.09 % 0.2 px 0.2 px
This table as LaTeX





Test Image 4

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.45 % 0.58 % 0.2 px 0.3 px
3 pixels 0.27 % 0.34 % 0.2 px 0.3 px
4 pixels 0.23 % 0.30 % 0.2 px 0.3 px
5 pixels 0.21 % 0.28 % 0.2 px 0.3 px
This table as LaTeX





Test Image 5

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.67 % 0.89 % 0.2 px 0.3 px
3 pixels 0.45 % 0.65 % 0.2 px 0.3 px
4 pixels 0.34 % 0.43 % 0.2 px 0.3 px
5 pixels 0.25 % 0.26 % 0.2 px 0.3 px
This table as LaTeX





Test Image 6

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 3.00 % 4.21 % 0.5 px 0.5 px
3 pixels 1.57 % 2.17 % 0.5 px 0.5 px
4 pixels 1.01 % 1.25 % 0.5 px 0.5 px
5 pixels 0.83 % 0.99 % 0.5 px 0.5 px
This table as LaTeX





Test Image 7

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.98 % 1.14 % 0.4 px 0.4 px
3 pixels 0.69 % 0.82 % 0.4 px 0.4 px
4 pixels 0.58 % 0.68 % 0.4 px 0.4 px
5 pixels 0.48 % 0.56 % 0.4 px 0.4 px
This table as LaTeX





Test Image 8

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.86 % 0.94 % 0.3 px 0.3 px
3 pixels 0.60 % 0.62 % 0.3 px 0.3 px
4 pixels 0.48 % 0.48 % 0.3 px 0.3 px
5 pixels 0.37 % 0.37 % 0.3 px 0.3 px
This table as LaTeX





Test Image 9

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.96 % 1.93 % 0.4 px 0.6 px
3 pixels 0.57 % 0.77 % 0.4 px 0.6 px
4 pixels 0.45 % 0.63 % 0.4 px 0.6 px
5 pixels 0.41 % 0.59 % 0.4 px 0.6 px
This table as LaTeX





Test Image 10

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 1.21 % 1.71 % 0.3 px 0.4 px
3 pixels 0.86 % 0.94 % 0.3 px 0.4 px
4 pixels 0.68 % 0.74 % 0.3 px 0.4 px
5 pixels 0.61 % 0.65 % 0.3 px 0.4 px
This table as LaTeX





Test Image 11

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.95 % 0.92 % 0.4 px 0.4 px
3 pixels 0.56 % 0.54 % 0.4 px 0.4 px
4 pixels 0.38 % 0.37 % 0.4 px 0.4 px
5 pixels 0.32 % 0.31 % 0.4 px 0.4 px
This table as LaTeX





Test Image 12

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 2.98 % 4.96 % 0.5 px 0.7 px
3 pixels 2.31 % 4.11 % 0.5 px 0.7 px
4 pixels 2.14 % 3.82 % 0.5 px 0.7 px
5 pixels 1.43 % 3.01 % 0.5 px 0.7 px
This table as LaTeX





Test Image 13

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 1.02 % 1.02 % 0.3 px 0.3 px
3 pixels 0.68 % 0.68 % 0.3 px 0.3 px
4 pixels 0.56 % 0.56 % 0.3 px 0.3 px
5 pixels 0.48 % 0.48 % 0.3 px 0.3 px
This table as LaTeX





Test Image 14

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.21 % 0.23 % 0.2 px 0.2 px
3 pixels 0.14 % 0.14 % 0.2 px 0.2 px
4 pixels 0.13 % 0.13 % 0.2 px 0.2 px
5 pixels 0.12 % 0.12 % 0.2 px 0.2 px
This table as LaTeX





Test Image 15

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 2.16 % 2.13 % 0.4 px 0.4 px
3 pixels 1.44 % 1.40 % 0.4 px 0.4 px
4 pixels 0.96 % 0.93 % 0.4 px 0.4 px
5 pixels 0.51 % 0.50 % 0.4 px 0.4 px
This table as LaTeX





Test Image 16

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 1.26 % 1.24 % 0.4 px 0.4 px
3 pixels 0.73 % 0.72 % 0.4 px 0.4 px
4 pixels 0.49 % 0.48 % 0.4 px 0.4 px
5 pixels 0.35 % 0.34 % 0.4 px 0.4 px
This table as LaTeX





Test Image 17

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 2.76 % 3.07 % 0.4 px 0.5 px
3 pixels 1.78 % 2.02 % 0.4 px 0.5 px
4 pixels 1.35 % 1.48 % 0.4 px 0.5 px
5 pixels 1.14 % 1.24 % 0.4 px 0.5 px
This table as LaTeX





Test Image 18

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 0.32 % 0.32 % 0.2 px 0.2 px
3 pixels 0.26 % 0.26 % 0.2 px 0.2 px
4 pixels 0.19 % 0.19 % 0.2 px 0.2 px
5 pixels 0.17 % 0.16 % 0.2 px 0.2 px
This table as LaTeX





Test Image 19

Error Out-Noc Out-All Avg-Noc Avg-All
2 pixels 1.00 % 0.97 % 0.4 px 0.4 px
3 pixels 0.72 % 0.70 % 0.4 px 0.4 px
4 pixels 0.60 % 0.58 % 0.4 px 0.4 px
5 pixels 0.54 % 0.53 % 0.4 px 0.4 px
This table as LaTeX







eXTReMe Tracker