TY - GEN
T1 - Complex-Valued Embeddings of Generic Proximity Data
AU - Münch, Maximilian
AU - Straat, Michiel
AU - Biehl, Michael
AU - Schleif, Frank-Michael
PY - 2021
Y1 - 2021
N2 - Proximities are at the heart of almost all machine learning methods. In a more generic view, objects are compared by a (symmetric) similarity or dissimilarity measure, which may not obey particular mathematical properties. This renders many machine learning methods invalid, leading to convergence problems and the loss of generalization behavior. In many cases, the preferred dissimilarity measure is not metric. If the input data are non-vectorial, like text sequences, proximity-based learning is used or embedding techniques can be applied. Standard embeddings lead to the desired fixed-length vector encoding, but are costly and are limited in preserving the full information. As an information preserving alternative, we propose a complex-valued vector embedding of proximity data, to be used in respective learning approaches. In particular, we address supervised learning and use extensions of prototype-based learning. The proposed approach is evaluated on a variety of standard benchmarks showing good performance compared to traditional techniques in processing non-metric or non-psd proximity data.
AB - Proximities are at the heart of almost all machine learning methods. In a more generic view, objects are compared by a (symmetric) similarity or dissimilarity measure, which may not obey particular mathematical properties. This renders many machine learning methods invalid, leading to convergence problems and the loss of generalization behavior. In many cases, the preferred dissimilarity measure is not metric. If the input data are non-vectorial, like text sequences, proximity-based learning is used or embedding techniques can be applied. Standard embeddings lead to the desired fixed-length vector encoding, but are costly and are limited in preserving the full information. As an information preserving alternative, we propose a complex-valued vector embedding of proximity data, to be used in respective learning approaches. In particular, we address supervised learning and use extensions of prototype-based learning. The proposed approach is evaluated on a variety of standard benchmarks showing good performance compared to traditional techniques in processing non-metric or non-psd proximity data.
UR - https://link.springer.com/chapter/10.1007%2F978-3-030-73973-7_2
U2 - 10.1007/978-3-030-73973-7_2
DO - 10.1007/978-3-030-73973-7_2
M3 - Conference contribution
SN - 978-3-030-73973-7
T3 - Lecture Notes in Computer Science
SP - 14
EP - 23
BT - Structural, Syntactic, and Statistical Pattern Recognition
A2 - Torsello, Andrea
A2 - Rossi, Luca
A2 - Pelillo, Marcello
A2 - Biggio, Battista
A2 - Robles-Kelly, Antonio
PB - Springer International Publishing
CY - Cham
ER -