Sammanfattning
Semantic role labeling has become a key module of many language processing applications. To build an unrestricted semantic
role labeler, the first step is to develop a comprehensive proposition bank. However, building such a bank is a costly enterprise,
which has only been achieved for a handful of languages. In this paper, we describe a technique to build proposition banks
for new languages using distant supervision. Starting from PropBank in English and loosely parallel corpora such as versions
of Wikipedia in different languages, we carried out a mapping of semantic propositions we extracted from English to syntactic
structures in Swedish using named entities. We could identify 2,333 predicate–argument frames in Swedish.
role labeler, the first step is to develop a comprehensive proposition bank. However, building such a bank is a costly enterprise,
which has only been achieved for a handful of languages. In this paper, we describe a technique to build proposition banks
for new languages using distant supervision. Starting from PropBank in English and loosely parallel corpora such as versions
of Wikipedia in different languages, we carried out a mapping of semantic propositions we extracted from English to syntactic
structures in Swedish using named entities. We could identify 2,333 predicate–argument frames in Swedish.
Originalspråk | engelska |
---|---|
Status | Published - 2014 |
Evenemang | The Fifth Swedish Language Technology Conference (SLTC 2014) - Uppsala, Sverige Varaktighet: 2014 nov. 13 → 2014 nov. 14 |
Konferens
Konferens | The Fifth Swedish Language Technology Conference (SLTC 2014) |
---|---|
Land/Territorium | Sverige |
Ort | Uppsala |
Period | 2014/11/13 → 2014/11/14 |
Ämnesklassifikation (UKÄ)
- Datavetenskap (datalogi)