Orthogonal Matching Pursuit for Text Classification

Konstantinos Skianis, Nikolaos Tziortziotis, Michalis Vazirgiannis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In text classification, the problem of overfitting arises due to the high dimensionality, making regularization essential. Although classic regularizers provide sparsity, they fail to return highly accurate models. On the contrary, state-of-the-art group-lasso regularizers provide better results at the expense of low sparsity. In this paper, we apply a greedy variable selection algorithm, called Orthogonal Matching Pursuit, for the text classification task. We also extend standard group OMP by introducing overlapping Group OMP to handle overlapping groups of features. Empirical analysis verifies that both OMP and overlapping GOMP constitute powerful regularizers, able to produce effective and very sparse models.

Original languageEnglish
Title of host publication4th Workshop on Noisy User-Generated Text, W-NUT 2018 - Proceedings of the Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages93-103
Number of pages11
ISBN (Electronic)9781948087797
Publication statusPublished - 1 Jan 2018
Externally publishedYes
Event4th Workshop on Noisy User-Generated Text, W-NUT 2018 - Brussels, Belgium
Duration: 1 Nov 2018 → …

Publication series

Name4th Workshop on Noisy User-Generated Text, W-NUT 2018 - Proceedings of the Workshop

Conference

Conference4th Workshop on Noisy User-Generated Text, W-NUT 2018
Country/TerritoryBelgium
CityBrussels
Period1/11/18 → …

Fingerprint

Dive into the research topics of 'Orthogonal Matching Pursuit for Text Classification'. Together they form a unique fingerprint.

Cite this