Entrepreneurship in the Age of Big Data: A Researcher's Guide to Data Mining, Inference and Prediction

Frederik Witte, Alan R. Johnson

Research output: Working paper/PreprintWorking paper

Abstract

The bottleneck of Big Data today is the analysis of large amounts of information, including data mining, inference and prediction. Entrepreneurship researchers who want to take advantage of Big Data, need tools and workflows to find important trends and patterns within massive data sets, and understand "what the data tell us."
Here, we apply gradient boosting to big, register-based data, and establish the intensity of
individual-level risk factors to predict entrepreneurship entry. We find structural differences
between unincorporated and incorporated entry, and test two separate prediction trees: we
correctly predict 20.4% of incorporated entries, using only six risk factors. Data mining
techniques, like gradient boosting, offer unique opportunities for entrepreneurship researchers to use objective methods and learn from the data, prior to model inference and prediction.
Original languageEnglish
Publication statusUnpublished - 2015

Subject classification (UKÄ)

  • Business Administration

Fingerprint

Dive into the research topics of 'Entrepreneurship in the Age of Big Data: A Researcher's Guide to Data Mining, Inference and Prediction'. Together they form a unique fingerprint.

Cite this