Constructing large proposition databases

Forskningsoutput: Kapitel i bok/rapport/Conference proceedingKonferenspaper i proceeding

Abstract

With the advent of massive online encyclopedic corpora such as Wikipedia, it has become possible to apply a systematic analysis to a wide range of documents covering a significant part of human knowledge. Using semantic parsers, it has become possible to extract such knowledge in the form of propositions (predicate―argument structures) and build large proposition databases from these documents. This paper describes the creation of multilingual proposition databases using generic semantic dependency parsing. Using Wikipedia, we extracted, processed, clustered, and evaluated a large number of propositions. We built an architecture to provide a complete pipeline dealing with the input of text, extraction of knowledge, storage, and presentation of the resulting propositions

Detaljer

Författare
Enheter & grupper
Forskningsområden

Ämnesklassifikation (UKÄ) – OBLIGATORISK

  • Datavetenskap (datalogi)

Nyckelord

Originalspråkengelska
Titel på värdpublikationProceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)
FörlagEuropean Language Resources Association (ELRA)
Sidor3836-3839
ISBN (elektroniskt) 978-2-9517408-7-7
StatusPublished - 2012
PublikationskategoriForskning
Peer review utfördJa
EvenemangThe eighth international conference on Language Resources and Evaluation (LREC 2012) - Istanbul, Turkiet
Varaktighet: 2012 maj 212012 maj 27

Konferens

KonferensThe eighth international conference on Language Resources and Evaluation (LREC 2012)
LandTurkiet
OrtIstanbul
Period2012/05/212012/05/27

Nedladdningar

Ingen tillgänglig data