Method

MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching [MatchStereo]
https://github.com/TingmanYan/MatchAttention

Submitted on 29 Aug. 2025 11:00 by
Tingman Yan (Dalian University of Technology)

Running time:0.05 s
Environment:GPU @ 2.5 Ghz (Python + C/C++)

Method Description:
Cross-view matching is fundamentally achieved
through cross-attention mechanisms. However,
matching of high-resolution images remains
challenging due to the quadratic complexity and
lack of explicit matching constraints in the
existing cross-attention. This paper proposes an
attention mechanism, MatchAttention, that
dynamically matches relative positions. The
relative position determines the attention
sampling center of the key-value pairs given a
query. Continuous and differentiable sliding-
window attention sampling is achieved by the
proposed BilinearSoftmax. The relative positions
are iteratively updated through residual
connections across layers by embedding them into
the feature channels. Since the relative position
is exactly the learning target for cross-view
matching, an efficient hierarchical cross-view
decoder, MatchDecoder, is designed with
MatchAttention as its core component. To handle
cross-view occlusions, gated cross-MatchAttention
and a consistency-constrained loss are
Parameters:
MatchStereo-B, 76M parameters
Latex Bibtex:
@article{yan2025matchattention,
title={MatchAttention: Matching the Relative
Positions for High-Resolution Cross-View Matching},
author={Tingman Yan and Tao Liu and Xilian Yang
and Qunfei Zhao and Zeyang Xia},
journal={arXiv preprint arXiv:2510.14260},
year={2025}
}

Detailed Results

This page provides detailed results for the method(s) selected. For the first 20 test images, the percentage of erroneous pixels is depicted in the table. We use the error metric described in Object Scene Flow for Autonomous Vehicles (CVPR 2015), which considers a pixel to be correctly estimated if the disparity or flow end-point error is <3px or <5% (for scene flow this criterion needs to be fulfilled for both disparity maps and the flow map). Underneath, the left input image, the estimated results and the error maps are shown (for disp_0/disp_1/flow/scene_flow, respectively). The error map uses the log-color scale described in Object Scene Flow for Autonomous Vehicles (CVPR 2015), depicting correct estimates (<3px or <5% error) in blue and wrong estimates in red color tones. Dark regions in the error images denote the occluded pixels which fall outside the image boundaries. The false color maps of the results are scaled to the largest ground truth disparity values / flow magnitudes.

Test Set Average

Error D1-bg D1-fg D1-all
All / All 1.34 2.32 1.50
All / Est 1.34 2.32 1.50
Noc / All 1.23 2.27 1.40
Noc / Est 1.23 2.27 1.40
This table as LaTeX

Test Image 0

Error D1-bg D1-fg D1-all
All / All 1.85 0.69 1.69
All / Est 1.85 0.69 1.69
Noc / All 1.83 0.69 1.67
Noc / Est 1.83 0.69 1.67
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 1

Error D1-bg D1-fg D1-all
All / All 1.60 2.23 1.67
All / Est 1.60 2.23 1.67
Noc / All 1.53 2.23 1.61
Noc / Est 1.53 2.23 1.61
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 2

Error D1-bg D1-fg D1-all
All / All 2.02 2.92 2.07
All / Est 2.02 2.92 2.07
Noc / All 1.97 2.92 2.01
Noc / Est 1.97 2.92 2.01
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 3

Error D1-bg D1-fg D1-all
All / All 2.29 3.45 2.40
All / Est 2.29 3.45 2.40
Noc / All 2.22 3.45 2.34
Noc / Est 2.22 3.45 2.34
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 4

Error D1-bg D1-fg D1-all
All / All 0.27 0.05 0.24
All / Est 0.27 0.05 0.24
Noc / All 0.27 0.05 0.24
Noc / Est 0.27 0.05 0.24
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 5

Error D1-bg D1-fg D1-all
All / All 2.15 2.52 2.18
All / Est 2.15 2.52 2.18
Noc / All 2.09 2.52 2.13
Noc / Est 2.09 2.52 2.13
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 6

Error D1-bg D1-fg D1-all
All / All 1.67 2.17 1.73
All / Est 1.67 2.17 1.73
Noc / All 1.71 2.17 1.76
Noc / Est 1.71 2.17 1.76
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 7

Error D1-bg D1-fg D1-all
All / All 0.27 2.24 0.65
All / Est 0.27 2.24 0.65
Noc / All 0.27 2.24 0.66
Noc / Est 0.27 2.24 0.66
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 8

Error D1-bg D1-fg D1-all
All / All 0.26 2.00 0.58
All / Est 0.26 2.00 0.58
Noc / All 0.25 2.00 0.57
Noc / Est 0.25 2.00 0.57
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 9

Error D1-bg D1-fg D1-all
All / All 0.28 0.85 0.42
All / Est 0.28 0.85 0.42
Noc / All 0.28 0.89 0.43
Noc / Est 0.28 0.89 0.43
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 10

Error D1-bg D1-fg D1-all
All / All 1.33 2.36 1.56
All / Est 1.33 2.36 1.56
Noc / All 1.34 2.36 1.57
Noc / Est 1.34 2.36 1.57
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 11

Error D1-bg D1-fg D1-all
All / All 1.21 0.46 1.07
All / Est 1.21 0.46 1.07
Noc / All 1.22 0.46 1.08
Noc / Est 1.22 0.46 1.08
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 12

Error D1-bg D1-fg D1-all
All / All 0.64 0.53 0.63
All / Est 0.64 0.53 0.63
Noc / All 0.49 0.53 0.49
Noc / Est 0.49 0.53 0.49
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 13

Error D1-bg D1-fg D1-all
All / All 0.58 0.15 0.53
All / Est 0.58 0.15 0.53
Noc / All 0.50 0.15 0.45
Noc / Est 0.50 0.15 0.45
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 14

Error D1-bg D1-fg D1-all
All / All 1.39 0.00 1.37
All / Est 1.39 0.00 1.37
Noc / All 1.25 0.00 1.23
Noc / Est 1.25 0.00 1.23
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 15

Error D1-bg D1-fg D1-all
All / All 2.08 0.11 1.90
All / Est 2.08 0.11 1.90
Noc / All 2.12 0.11 1.94
Noc / Est 2.12 0.11 1.94
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 16

Error D1-bg D1-fg D1-all
All / All 3.69 0.81 3.27
All / Est 3.69 0.81 3.27
Noc / All 3.54 0.81 3.13
Noc / Est 3.54 0.81 3.13
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 17

Error D1-bg D1-fg D1-all
All / All 1.10 0.04 0.99
All / Est 1.10 0.04 0.99
Noc / All 1.08 0.04 0.97
Noc / Est 1.08 0.04 0.97
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 18

Error D1-bg D1-fg D1-all
All / All 3.94 2.09 3.06
All / Est 3.94 2.09 3.06
Noc / All 3.86 2.09 3.01
Noc / Est 3.86 2.09 3.01
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 19

Error D1-bg D1-fg D1-all
All / All 0.63 0.29 0.60
All / Est 0.63 0.29 0.60
Noc / All 0.64 0.29 0.60
Noc / Est 0.64 0.29 0.60
This table as LaTeX

Input Image

D1 Result

D1 Error




eXTReMe Tracker