On Tree-Based Methods for Similarity Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In many situations, the choice of an adequate similarity measure or metric on the feature space dramatically determines the performance of machine learning methods. Building automatically such measures is the specific purpose of metric/similarity learning. In [21], similarity learning is formulated as a pairwise bipartite ranking problem: ideally, the larger the probability that two observations in the feature space belong to the same class (or share the same label), the higher the similarity measure between them. From this perspective, the (formula presented) curve is an appropriate performance criterion and it is the goal of this article to extend recursive tree-based (formula presented) optimization techniques in order to propose efficient similarity learning algorithms. The validity of such iterative partitioning procedures in the pairwise setting is established by means of results pertaining to the theory of U-processes and from a practical angle, it is discussed at length how to implement them by means of splitting rules specifically tailored to the similarity learning task. Beyond these theoretical/methodological contributions, numerical experiments are displayed and provide strong empirical evidence of the performance of the algorithmic approaches we propose.

Original languageEnglish
Title of host publicationMachine Learning, Optimization, and Data Science - 5th International Conference, LOD 2019, Proceedings
EditorsGiuseppe Nicosia, Panos Pardalos, Renato Umeton, Giovanni Giuffrida, Vincenzo Sciacca
PublisherSpringer
Pages676-688
Number of pages13
ISBN (Print)9783030375980
DOIs
Publication statusPublished - 1 Jan 2019
Externally publishedYes
Event5th International Conference on Machine Learning, Optimization, and Data Science, LOD 2019 - Siena, Italy
Duration: 10 Sept 201913 Sept 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11943 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Conference on Machine Learning, Optimization, and Data Science, LOD 2019
Country/TerritoryItaly
CitySiena
Period10/09/1913/09/19

Keywords

  • Metric-learning
  • Rate bound analysis
  • Similarity learning
  • Tree-based algorithms
  • U-processes

Fingerprint

Dive into the research topics of 'On Tree-Based Methods for Similarity Learning'. Together they form a unique fingerprint.

Cite this