Sammanfattning
Almost all automatic semantic role labeling (SRL) systems rely on a preliminary parsing step that derives a syntactic structure from the sentence being analyzed. This makes the choice of syntactic representation an essential design
decision. In this paper, we study the influence of syntactic representation on the performance of SRL systems. Specifically, we compare constituent-based and dependency-based representations for SRL of English in the FrameNet paradigm.
Contrary to previous claims, our results demonstrate that the systems based on dependencies perform roughly as well as those based on constituents: For the argument classification task, dependency-based systems perform
slightly higher on average, while the opposite holds for the argument identification task. This is remarkable because dependency parsers are still in their infancy while constituent parsing is more mature. Furthermore, the results show that dependency-based semantic role classifiers rely less on lexicalized features, which makes them more robust to domain changes and makes them learn more efficiently with respect to the amount of training data.
decision. In this paper, we study the influence of syntactic representation on the performance of SRL systems. Specifically, we compare constituent-based and dependency-based representations for SRL of English in the FrameNet paradigm.
Contrary to previous claims, our results demonstrate that the systems based on dependencies perform roughly as well as those based on constituents: For the argument classification task, dependency-based systems perform
slightly higher on average, while the opposite holds for the argument identification task. This is remarkable because dependency parsers are still in their infancy while constituent parsing is more mature. Furthermore, the results show that dependency-based semantic role classifiers rely less on lexicalized features, which makes them more robust to domain changes and makes them learn more efficiently with respect to the amount of training data.
Originalspråk | engelska |
---|---|
Titel på värdpublikation | Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008) |
Förlag | Association for Computational Linguistics |
Sidor | 393-400 |
Antal sidor | 8 |
Status | Published - 2008 |
Evenemang | International Conference on Computational Linguistics (Coling) - Manchester Varaktighet: 2008 aug. 18 → 2008 aug. 22 |
Konferens
Konferens | International Conference on Computational Linguistics (Coling) |
---|---|
Period | 2008/08/18 → 2008/08/22 |
Ämnesklassifikation (UKÄ)
- Datavetenskap (datalogi)