Marker words for negation and speculation in health records and consumer reviews

Research output: Contribution to journalArticle


Conditional random fields were trained to detect marker words for negation and speculation in two corpora belonging to two very different domains: clinical text and consumer review text. For the corpus of clinical text, marker words for speculation and negation were detected with results in line with previously reported interannotator agreement scores. This was also the case for speculation markers in the consumer review corpus, while detection of negation markers was unsuccessful in this genre. Also a setup in which models were trained on markers in consumer reviews, and applied on the clinical text genre, yielded low results. This shows that neither the trained models, nor the choice of appropriate machine learning algorithms and features, were transferable across the two text genres.


External organisations
  • Linnaeus University
  • Gavagai AB
Research areas and keywords

Subject classification (UKÄ) – MANDATORY

  • Language Technology (Computational Linguistics)
Original languageEnglish
Pages (from-to)64-69
Number of pages6
JournalCEUR Workshop Proceedings
Publication statusPublished - 2016
Publication categoryResearch
Event7th International Symposium on Semantic Mining in Biomedicine - Hasso-Plattner Institute, Potsdam, Germany
Duration: 2016 Aug 42016 Aug 5
Conference number: 7

Related projects

Carita Paradis, Andreas Kerren, Magnus Sahlgren, Kostiantyn Kucher, Maria Skeppstedt & Vasiliki Simaki

Swedish Research Council


Project: Research

View all (1)