01/12/2016 Computer Science Medicine
DOI: 10.3389/fpubh.2016.00248 SemanticScholar ID: 16393394 MAG: 2557659274

The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap

Publication Summary

Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public’s perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller datasets to illustrate to the public how big data can produce tangible benefits in the long term. In order to highlight the immediate value of a small data approach, we produced a proof-of-concept model predicting hospital length of stay. The results demonstrate that existing small datasets can be used to create models that generate a reasonable prediction, facilitating health-care delivery. We propose that greater attention (and funding) needs to be directed toward the utilization of existing information resources in parallel with current efforts to create and exploit “big data.”

CAER Authors

Avatar Image for Faisal Mushtaq

Dr. Faisal Mushtaq

University of Leeds - Associate Professor in Cognitive Neuroscience

Avatar Image for Mark Mon-Williams

Prof. Mark Mon-Williams

University of Leeds - Chair in Cognitive Psychology

Share this

Next publication

2009 Psychology

The Dynamics of Category Conjunctions

R. Hutter, R. Crisp, G. Humphreys, Gillian. M. Waters + 1 more