Skip to main navigation Skip to search Skip to main content

Determining the Need for Multi-label Classifiers by Measuring Unexplained Covariance

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Multi-label classifiers make use of associations between labels in multi-label data to increase the accuracy of prediction. Before using a multi-label classifier, the data should be analysed to identify if there are associations between labels. If sets of independent labels are found, the data can be split in to multiple smaller data sets for analysis. Unfortunately, each label is dependent on the set of observations and so measuring label dependence is futile. What we actually seek is independence after taking the observations into account. In this article, we examine the concepts of explained and unexplained label covariance for measuring label dependence. We explore the use of a Normal copula model for modelling the label dependence/covariance and show that it is not able to measure conditional covariance directly. We then propose a new statistical model that allows direct measurement of label covariance (both constant and conditional). The model is validated using generated data and it is also used to examine the label covariance in real world data, allowing us to build simpler multi-label models.

Original languageEnglish
Title of host publicationData Science
Subtitle of host publicationFoundations and Applications - 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025, Sydney, NSW, Australia, June 10-13, 2025, Proceedings
EditorsXintao Wu, Myra Spiliopoulou, Can Wang, Vipin Kumar, Longbing Cao, Xiangmin Zhou, Guansong Pang, Joao Gama
PublisherSpringer Science and Business Media Deutschland GmbH
Pages250-261
Number of pages12
ISBN (Print)9789819682942
DOIs
Publication statusPublished - 1 Jan 2025
Event29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025 - Sydney, Australia
Duration: 10 Jun 202513 Jun 2025

Publication series

NameLecture Notes in Computer Science
Volume15875 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025
Country/TerritoryAustralia
CitySydney
Period10/06/2513/06/25

Fingerprint

Dive into the research topics of 'Determining the Need for Multi-label Classifiers by Measuring Unexplained Covariance'. Together they form a unique fingerprint.

Cite this