TY - GEN
T1 - Set labelling using multi-label classification
AU - Ngurah Agus Sanjaya, E. R.
AU - Abdessalem, Talel
AU - Read, Jesse
AU - Bressan, Stéphane
N1 - Publisher Copyright:
© 2018 Association for Computing Machinery.
PY - 2018/11/19
Y1 - 2018/11/19
N2 - We propose the task of set labelling. Starting from some examples members of a set, set labelling tries to infer the most appropriate labels for the given set. For this work, we consider sets of words. We illustrate the task and a possible solution with an application to the classification of cosmetic products and hotels. The novel solution proposed in this research is to incorporate a multi-label classifier trained from the labeled datasets. We use vectorization of the description of the seeds as input to the classifier as well as labels assigned to it. Given a previously unseen data, the trained classifier returns a ranked list of candidate labels (i.e., additional seeds) for the set. These results could then be used to infer the labels for the set. We implement our proposed solution to the classification of cosmetic products and hotels. We show that the solution is effective and efficient.
AB - We propose the task of set labelling. Starting from some examples members of a set, set labelling tries to infer the most appropriate labels for the given set. For this work, we consider sets of words. We illustrate the task and a possible solution with an application to the classification of cosmetic products and hotels. The novel solution proposed in this research is to incorporate a multi-label classifier trained from the labeled datasets. We use vectorization of the description of the seeds as input to the classifier as well as labels assigned to it. Given a previously unseen data, the trained classifier returns a ranked list of candidate labels (i.e., additional seeds) for the set. These results could then be used to infer the labels for the set. We implement our proposed solution to the classification of cosmetic products and hotels. We show that the solution is effective and efficient.
KW - Classification
KW - Multi-label
KW - Set labelling
U2 - 10.1145/3282373.3282391
DO - 10.1145/3282373.3282391
M3 - Conference contribution
AN - SCOPUS:85061138399
T3 - ACM International Conference Proceeding Series
SP - 216
EP - 220
BT - 20th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2018 - Proceedings
A2 - Anderst-Kotsis, Gabriele
A2 - Pardede, Eric
A2 - Steinbauer, Matthias
A2 - Indrawan-Santiago, Maria
A2 - Salvadori, Ivan Luiz
A2 - Salvadori, Ivan Luiz
A2 - Khalil, Ismail
PB - Association for Computing Machinery
T2 - 20th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2018
Y2 - 19 November 2018 through 21 November 2018
ER -