Character and subword-based word representation for neural language modeling prediction

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Most of neural language models use different kinds of embeddings for word prediction. While word embeddings can be associated to each word in the vocabulary or derived from characters as well as factored morphological decomposition, these word representations are mainly used to parametrize the input, i.e. the context of prediction. This work investigates the effect of using subword units (character and factored morphological decomposition) to build output representations for neural language modeling. We present a case study on Czech, a morphologically-rich language, experimenting with different input and output representations. When working with the full training vocabulary, despite unstable training, our experiments show that augmenting the output word representations with character-based embeddings can significantly improve the performance of the model. Moreover, reducing the size of the output look-up table, to let the character-based embeddings represent rare words, brings further improvement.

Original languageEnglish
Title of host publicationEMNLP 2017 - 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017 - Proceedings of the Workshop
EditorsManaal Faruqui, Hinrich Schutze, Isabel Trancoso, Yaghoobzadeh Yadollah
PublisherAssociation for Computational Linguistics (ACL)
Pages1-13
Number of pages13
ISBN (Electronic)9781945626913
Publication statusPublished - 1 Jan 2017
EventEMNLP 2017 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017 - Copenhagen, Denmark
Duration: 7 Sept 2017 → …

Publication series

NameEMNLP 2017 - 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017 - Proceedings of the Workshop

Conference

ConferenceEMNLP 2017 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017
Country/TerritoryDenmark
CityCopenhagen
Period7/09/17 → …

Fingerprint

Dive into the research topics of 'Character and subword-based word representation for neural language modeling prediction'. Together they form a unique fingerprint.

Cite this