A Machine Learning Approach for Semi-Automated Search and Selection in Literature Studies

Research output: Chapter in Book/Report/Conference proceedingPaper in conference proceedingpeer-review

Abstract

Background. Search and selection of primary studies in Systematic Literature Reviews (SLR) is labour intensive, and hard to replicate and update. Aims. We explore a machine learning approach to support semi-automated search and selection in SLRs to address these weaknesses. Method. We 1) train a classi er on an initial set of papers, 2) extend this set of papers by automated search and snowballing, 3) have the researcher validate the top paper, selected by the classi er, and 4) update the set of papers and iterate the process until a stopping criterion is met. Results. We demonstrate with a proof-of-concept tool that the proposed automated search and selection approach generates valid search strings and that the performance for subsets of primary studies can reduce the manual work by half. Conclusions. Thee approach is promising and the demonstrated advantages include cost savings and replicability. e next steps include further tool development and evaluate the approach on a complete SLR.
Original languageEnglish
Title of host publicationEASE'17 Proceedings of the 21st International Conference on Evaluation and Assessment in Software Engineering
Place of PublicationKarlskrona, Sweden
PublisherAssociation for Computing Machinery (ACM)
ISBN (Print)978-1-4503-4804-1
DOIs
Publication statusPublished - 2017 Jun 1
Event21st International Conference on Evaluation and Assessment in Software Engineering (EASE'17) - Karlskrona, Sweden
Duration: 2017 Jun 152017 Jun 16

Conference

Conference21st International Conference on Evaluation and Assessment in Software Engineering (EASE'17)
Country/TerritorySweden
CityKarlskrona
Period2017/06/152017/06/16

Subject classification (UKÄ)

  • Computer Sciences

Fingerprint

Dive into the research topics of 'A Machine Learning Approach for Semi-Automated Search and Selection in Literature Studies'. Together they form a unique fingerprint.

Cite this