Title :
Crowdsourcing Predictors of Behavioral Outcomes
Author :
Bongard, Josh C. ; Hines, Paul D H ; Conger, Dylan ; Hurd, Peter ; Lu, Zhenyu
Author_Institution :
Coll. of Eng. & Math. Sci., Univ. of Vermont, Burlington, VT, USA
Abstract :
Generating models from large data sets-and determining which subsets of data to mine-is becoming increasingly automated. However, choosing what data to collect in the first place requires human intuition or experience, usually supplied by a domain expert. This paper describes a new approach to machine science which demonstrates for the first time that nondomain experts can collectively formulate features and provide values for those features such that they are predictive of some behavioral outcome of interest. This was accomplished by building a Web platform in which human groups interact to both respond to questions likely to help predict a behavioral outcome and pose new questions to their peers. This results in a dynamically growing online survey, but the result of this cooperative behavior also leads to models that can predict the user´s outcomes based on their responses to the user-generated survey questions. Here, we describe two Web-based experiments that instantiate this approach: The first site led to models that can predict users´ monthly electric energy consumption, and the other led to models that can predict users´ body mass index. As exponential increases in content are often observed in successful online collaborative communities, the proposed methodology may, in the future, lead to similar exponential rises in discovery and insight into the causal factors of behavioral outcomes.
Keywords :
behavioural sciences computing; data mining; groupware; peer-to-peer computing; social networking (online); Web platform; Web-based experiment; behavioral outcome; causal factor; cooperative behavior; crowdsourcing predictor; data collection; data mining; dynamically growing online survey; human experience; human group interaction; human intuition; machine science; nondomain expert; online collaborative community; peer questions; user body mass index prediction; user monthly electric energy consumption prediction; user outcome prediction; user-generated survey question; Data models; Electricity; Energy consumption; Humans; Predictive models; Water heating; Web sites; Crowdsourcing; human behavior modeling; machine science; social media; surveys;
Journal_Title :
Systems, Man, and Cybernetics: Systems, IEEE Transactions on
DOI :
10.1109/TSMCA.2012.2195168