Comparing LSTM and FOFE-based Architectures for Named Entity Recognition

Research output: Contribution to conferencePaper, not in proceeding

Abstract

LSTM architectures (Hochreiter and Schmidhuber, 1997) have become standard to recognize named entities (NER) in text (Lample et al., 2016; Chiu and Nichols, 2016). Nonetheless, Zhang et al. (2015) recently proposed an approach based on fixed-size ordinally forgetting encoding (FOFE) to translate variable-length contexts into fixed-length features. This encoding method can be used with feed-forward neural networks and, despite its simplicity, reach accuracy rates matching those of LTSMs in NER tasks (Xu et al., 2017). However, the figures reported in the NER articles are difficult to compare precisely as the experiments often use external resources such as gazetteers and corpora. In this paper, we describe an experimental setup, where we reimplemented the two core algorithms, to level the differences in initial conditions. This allowed us to measure more precisely the accuracy of both architectures and to report what we believe are unbiased results on English and Swedish datasets.

Details

Authors
Organisations
Research areas and keywords

Subject classification (UKÄ) – MANDATORY

  • Language Technology (Computational Linguistics)

Keywords

  • LSTM, FOFE, Named Entity Recognition
Original languageEnglish
Publication statusPublished - 2018 Nov 7
Publication categoryResearch
Peer-reviewedYes
Event Seventh Swedish Language Technology Conference: Third National Swe-Clarin Workshop: Making ends meet - Stockholm, Sweden
Duration: 2018 Nov 72018 Nov 9
Conference number: 7
http://sltc2018.su.se/

Conference

Conference Seventh Swedish Language Technology Conference
Abbreviated titleSLTC 2018
CountrySweden
CityStockholm
Period2018/11/072018/11/09
Internet address

Total downloads

No data available