Entrepreneurship in the Age of Big Data: A Researcher's Guide to Data Mining, Inference and Prediction

Forskningsoutput: Working paper


The bottleneck of Big Data today is the analysis of large amounts of information, including data mining, inference and prediction. Entrepreneurship researchers who want to take advantage of Big Data, need tools and workflows to find important trends and patterns within massive data sets, and understand "what the data tell us."
Here, we apply gradient boosting to big, register-based data, and establish the intensity of
individual-level risk factors to predict entrepreneurship entry. We find structural differences
between unincorporated and incorporated entry, and test two separate prediction trees: we
correctly predict 20.4% of incorporated entries, using only six risk factors. Data mining
techniques, like gradient boosting, offer unique opportunities for entrepreneurship researchers to use objective methods and learn from the data, prior to model inference and prediction.


  • Frederik Witte
  • Alan R. Johnson
Enheter & grupper

Ämnesklassifikation (UKÄ) – OBLIGATORISK

  • Företagsekonomi
StatusUnpublished - 2015