English dictionaries, gold and silver standard corpora for biomedical natural language processing related to SARS-CoV-2 and COVID-19

Research output: Other contributionMiscellaneous

Bibtex

@misc{ec656521c625426d8ef9e93344481819,
title = "English dictionaries, gold and silver standard corpora for biomedical natural language processing related to SARS-CoV-2 and COVID-19",
abstract = "Here we present a toolbox for natural language processing tasks related to SARS-CoV-2. It comprises English dictionaries of synonyms for SARS-CoV-2 and COVID-19, a silver standard corpus generated with the dictionaries and a gold standard corpus of 10 Pubmed abstracts manually annotated for disease, virus, symptom and protein/gene terms. This toolbox is freely available on github and can be used for text analytics in a variety of settings related to the COVID-19 crisis. It will be expanded and applied in NLP tasks over the next weeks and the community is invited to contribute.",
keywords = "SARS-CoV-2, COVID-19, Text mining, BioNLP, Artificial Intelligence",
author = "{Kazemi Rashed}, Salma and Johan Frid and Lim, {Jong Chan} and Sonja Aits",
year = "2020",
month = mar,
day = "22",
language = "English",
series = "ARXIV",
publisher = "Cornell University Library",
type = "Other",

}