Research output per year
Research output per year
Research output: Chapter in Book/Report/Conference proceeding › Paper in conference proceeding › peer-review
Named entity linking is the task of identifying mentions of named things in text, such as “Barack Obama” or “New York”, and linking these mentions to unique identifiers. In this paper, we describe Hedwig, an end-to-end named entity linker, which uses a combination of word and character BILSTM models for mention detection, a Wikidata and Wikipedia-derived knowledge base with global information aggregated over nine language editions, and a PageRank algorithm for entity linking. We evaluated Hedwig on the TAC2017 dataset, consisting of news texts and discussion forums, and we obtained a final score of 59.9% on CEAFmC+, an improvement over our previous generation linker Ugglan, and a trilingual entity link score of 71.9%.
Original language | English |
---|---|
Title of host publication | LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings |
Editors | Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis |
Publisher | European Language Resources Association |
Pages | 4501-4508 |
Number of pages | 8 |
ISBN (Electronic) | 9791095546344 |
Publication status | Published - 2020 |
Event | 12th International Conference on Language Resources and Evaluation, LREC 2020 - Marseille, France Duration: 2020 May 11 → 2020 May 16 |
Name | LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings |
---|
Conference | 12th International Conference on Language Resources and Evaluation, LREC 2020 |
---|---|
Country/Territory | France |
City | Marseille |
Period | 2020/05/11 → 2020/05/16 |
Research output: Thesis › Doctoral Thesis (compilation)