TY - GEN
T1 - Cat2Type
T2 - 11th ACM International Conference on Knowledge Capture, K-CAP 2021
AU - Biswas, Russa
AU - Sofronova, Radina
AU - Sack, Harald
AU - Alam, Mehwish
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/12/2
Y1 - 2021/12/2
N2 - The entity type information in Knowledge Graphs (KGs) such as DBpedia, Freebase, etc. is often incomplete due to automated generation. Entity Typing is the task of assigning or inferring the semantic type of an entity in a KG. This paper introduces an approach named Cat2Type which exploits the Wikipedia Categories to predict the missing entity types in a KG. This work extracts information from Wikipedia Category names and the Wikipedia Category graph which are the sources of rich semantic information about the entities. In Cat2Type, the characteristic features of the entities encapsulated in Wikipedia Category names are exploited using Neural Language Models. On the other hand, a Wikipedia Category graph is constructed to capture the connection between the categories. The Node level representations are learned by optimizing the neighbourhood information on the Wikipedia category graph. These representations are then used for entity type prediction via classification. The performance of Cat2Type is assessed on two real-world benchmark datasets DBpedia630k and FIGER. The experiments depict that Cat2Type obtained a significant improvement over state-of-the-art approaches.
AB - The entity type information in Knowledge Graphs (KGs) such as DBpedia, Freebase, etc. is often incomplete due to automated generation. Entity Typing is the task of assigning or inferring the semantic type of an entity in a KG. This paper introduces an approach named Cat2Type which exploits the Wikipedia Categories to predict the missing entity types in a KG. This work extracts information from Wikipedia Category names and the Wikipedia Category graph which are the sources of rich semantic information about the entities. In Cat2Type, the characteristic features of the entities encapsulated in Wikipedia Category names are exploited using Neural Language Models. On the other hand, a Wikipedia Category graph is constructed to capture the connection between the categories. The Node level representations are learned by optimizing the neighbourhood information on the Wikipedia category graph. These representations are then used for entity type prediction via classification. The performance of Cat2Type is assessed on two real-world benchmark datasets DBpedia630k and FIGER. The experiments depict that Cat2Type obtained a significant improvement over state-of-the-art approaches.
KW - entity type prediction
KW - language models
KW - node embeddings
KW - wikipedia categories
UR - https://www.scopus.com/pages/publications/85120878834
U2 - 10.1145/3460210.3493575
DO - 10.1145/3460210.3493575
M3 - Conference contribution
AN - SCOPUS:85120878834
T3 - K-CAP 2021 - Proceedings of the 11th Knowledge Capture Conference
SP - 81
EP - 88
BT - K-CAP 2021 - Proceedings of the 11th Knowledge Capture Conference
PB - Association for Computing Machinery, Inc
Y2 - 2 December 2021 through 3 December 2021
ER -