<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The CrowdFlower Blog &#187; Economics</title>
	<atom:link href="http://blog.crowdflower.com/tag/economics/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.crowdflower.com</link>
	<description></description>
	<lastBuildDate>Tue, 10 Jan 2012 20:00:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Designing Incentives for Crowdsourcing Workers</title>
		<link>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/</link>
		<comments>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/#comments</comments>
		<pubDate>Tue, 24 May 2011 19:19:45 +0000</pubDate>
		<dc:creator>Aaron Shaw</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Human Behavior]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[behavior]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[incentives]]></category>
		<category><![CDATA[motivation]]></category>
		<category><![CDATA[social science]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2572</guid>
		<description><![CDATA[In a recent paper, presented at the ACM Conference on Computer Supported Cooperative Work (CSCW), John Horton, Daniel Chen and I used a large-scale experiment to test the effect of different incentive schemes on the quality of crowdsourcing work. The results surprised us. They suggest that workers perform most accurately when the task design credibly [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/" data-text="Designing Incentives for Crowdsourcing Workers" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/"></g:plusone></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/" data-counter="top"></script></div></div><p>In a <a title="Designing Incentives for Inexpert Human Raters, Berkman Center" href="http://cyber.law.harvard.edu/publications/2011/Designing_Incentives_Inexpert_Human_Raters">recent paper</a>, presented at the ACM Conference on Computer Supported Cooperative Work (CSCW), <a title="John Horton, oDesk" href="https://sites.google.com/site/johnjosephhorton/">John Horton</a>, <a title="Daniel Chen, Duke Law School" href="http://www.law.duke.edu/fac/chen">Daniel Chen</a> and <a title="Aaron Shaw, UC Berkeley &amp; Harvard" href="http://aaronshaw.org">I</a> used a large-scale experiment to test the effect of different incentive schemes on the quality of crowdsourcing work.</p>
<p>The results surprised us. They suggest that workers perform most accurately when the task design credibly links payoffs to a worker&#8217;s ability to think about the answers that their peers are likely to provide.</p>
<p style="text-align: center;">
<div id="attachment_2577" class="wp-caption aligncenter" style="width: 549px"><a href="http://www.flickr.com/photos/iyoupapa/"><img class="size-full wp-image-2577 " title="Horserace!" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/3757438159_horserace-iyoupapa-altered.jpg" alt="Horserace!" width="539" height="264" /></a><p class="wp-caption-text">a horserace experiment! (photo cc-by-sa by iyoupapa)</p></div>
<p><span id="more-2572"></span></p>
<p>The idea for this study came out of our sense that, as social scientists, we had something unique to offer the existing research on human computation. <a title="AMT is fast, cheap, and good for machine learning data" href="http://blog.crowdflower.com/2008/09/amt-fast-cheap-good-machine-learning/">Early</a> and <a title="&quot;Get Another Label?&quot; Ipeirotis et al. 2008" href="http://archive.nyu.edu/handle/2451/25882">influential</a> crowdsourcing research has focused on how to filter the judgments of the crowd to find the best answers. We wanted to know whether simple task-design changes could improve the quality of data coming into a crowdsourcing system in the first place.</p>
<p>To test this idea, we chose 14 different incentive schemes and framing techniques developed and validated across the social sciences and set up a horse race experiment to see which schemes/techniques would work best.</p>
<p>Consistent with our personal biases (John and Daniel are both economists, and I&#8217;m a sociologist), some of the schemes were financially oriented, some were social or psychological, and some were hybrids combining social and financial incentives. The details of all the schemes are included <a title="Designing Incentives for Inexpert Human Raters" href="http://cyber.law.harvard.edu/publications/2011/Designing_Incentives_Inexpert_Human_Raters">in the paper</a> (it&#8217;s a long list, and some of them are kind of involved), but it&#8217;s worth giving some examples.</p>
<p>On the financial end of the incentives spectrum, we had one condition we called &#8220;reward-accuracy,&#8221; which was pretty much what you&#8217;d expect: we told workers, &#8220;we&#8217;ll pay you a bonus if you get the answers right.&#8221; We also had one called &#8220;punishment-accuracy,&#8221; the gist of which you can deduce. On the purely social-psychological side, we had one we called &#8220;trust,&#8221; in which we told workers, &#8220;we&#8217;ll pay you for this job no matter how bad your performance, we trust that you&#8217;ll still make your best effort.&#8221;</p>
<p>One of the weirdest schemes turns out to be important, so I need to explain that one. Called &#8220;Bayesian Truth Serum&#8221; (BTS), it incorporates a design from the work of <a title="Drazen Prelec" href="http://econ-www.mit.edu/faculty/dprelec">Drazen Prelec</a>, a behavioral economist at MIT, who realized that research subjects could probably provide useful information regarding the expected distribution for subjective, qualitative questions (<em>nb</em>, the mechanics of how he does this are arcane in a way that is almost sure to delight the geeks among you, so I encourage you to <a title="Bayesian Truth Serum" href="http://econ-www.mit.edu/files/1966">read his paper</a>). Few of the details of <em>real</em> BTS are important, except that we incorporated the piece about asking workers to answer the questions themselves <em>and predict the distribution of other workers&#8217; responses</em>. We also told them we&#8217;d give them a bonus if their predictions were correct.</p>
<p>We then created a task that asked workers to answer five questions. In this case, the questions were drawn from another study examining participatory features of websites, for which we already possessed validated data collected by research assistants.</p>
<p>All workers answered the same five questions about the same website (<a href="http://www.kiva.org">www.kiva.org</a>) while being exposed to one and only one of the 14 incentive schemes (or a control condition of no scheme). Roughly 2,000 individuals participated in the study, resulting in over 100 subjects in each of the experimental conditions. (The statistics and science nerds out there will be pleased to know that both the drop-out rate and demographic covariates were distributed evenly across conditions.)</p>
<p>To measure worker performance, we used the research assistant responses as correct answers to the questions and then calculated the total number of matching answers (out of five) provided by each worker. The results (aggregated across all treatments) are plotted in a histogram below and show that the average worker answered just over two questions out of five correctly.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/aggperf/" rel="attachment wp-att-2578"><img class="aligncenter size-full wp-image-2578" title="Inexpert raters - Aggregate Performance" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/AggPerf.png" alt="Aggregate performance histogram" width="280" height="280" /></a></p>
<p>&nbsp;</p>
<p>Then, in order to see how the treatments compared against each other relative to the control group, we calculated the mean correct response rate for each condition and conducted difference of means tests to see which of these means were significantly greater than the control group. The results of this comparison appear below (in a new plot that doesn&#8217;t even appear in the paper!):</p>
<p><a href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/inexpert-itt/" rel="attachment wp-att-2579"><img class="aligncenter size-full wp-image-2579" title="inexpert raters - ITT estimates" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/inexpert-ITT.png" alt="ITT estimates per treatment" width="500" height="500" /></a></p>
<p>The orange dots show the value of the mean in each condition, and the blue bars illustrate the 95% confidence interval around that mean. The treatments are sorted by the size of the difference in means from the control. (More hard-core nerd stuff: the means are adjusted using Intent-To-Treat estimators).</p>
<p>From these results, we concluded that our horse race had two clear front-runners: the &#8220;Bayesian Truth Serum&#8221; (BTS) and &#8220;Punishment &#8211; disagreement&#8221; conditions, each of which improved average worker performance by almost half of a correct answer above the 2.08 correct answers in the control group. A few of the other financial and hybrid incentives had fairly large point estimates, but were not significantly different from control once we adjusted the test statistics and corresponding p-values to account for the fact that we were making so many comparisons at once (apologies if this doesn&#8217;t make sense — it&#8217;s yet another precautionary measure to avoid upsetting the stats nerds among you). In a tough turn for the sociologists and psychologists, none of the purely social/psychological treatments had any signficant effects at all.</p>
<p>Why do BTS and punishing workers for disagreement succeed in improving performance significantly where so many of the other incentive schemes failed? The answer hinges on the fact that both conditions tied workers&#8217; payoffs to their ability to think about their peers&#8217; likely responses. (We elaborate on the argument in more detail in the paper.)</p>
<p>Does this mean that we should give up on simple financial or social-psychological incentives? Probably not. The fact that we conducted the experiment on MTurk means that the deck may have been stacked against incentives like the &#8220;trust&#8221; condition I described earlier. Because requesters on MTurk have little oversight, workers are more likely to respond to financial incentives than stated promises. In this sense, the marketplace has structured the interaction between workers and requesters in a way that may limit the opportunities to harness motivations that are not linked to money in some explicit way.</p>
<p>You can <a title="Designing Incentives for Inexpert Human Raters" href="http://cyber.law.harvard.edu/sites/cyber.law.harvard.edu/files/Shaw-Horton-Chen_Designing_Incentives_Inexpert_Human_Raters_2011.pdf">download the full paper</a> to read more.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Breaking Monotony with Meaning: Motivation in Crowdsourcing Markets</title>
		<link>http://blog.crowdflower.com/2010/05/breaking-monotony-with-meaning-motivation-in-crowdsourcing-markets/</link>
		<comments>http://blog.crowdflower.com/2010/05/breaking-monotony-with-meaning-motivation-in-crowdsourcing-markets/#comments</comments>
		<pubDate>Sun, 23 May 2010 22:06:54 +0000</pubDate>
		<dc:creator>Lukas Biewald</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Motivation]]></category>
		<category><![CDATA[motivation]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=531</guid>
		<description><![CDATA[This is a guest post written by my friend Dana Chandler on how the context of a task motivates the person working on it.  He has a longer academic paper on the topic you can find at the bottom of this post.  It once again shows how traditional economic incentives can&#8217;t fully explain workers&#8217; behaviors [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2010/05/breaking-monotony-with-meaning-motivation-in-crowdsourcing-markets/" data-text="Breaking Monotony with Meaning: Motivation in Crowdsourcing Markets" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2010/05/breaking-monotony-with-meaning-motivation-in-crowdsourcing-markets/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2010/05/breaking-monotony-with-meaning-motivation-in-crowdsourcing-markets/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2010/05/breaking-monotony-with-meaning-motivation-in-crowdsourcing-markets/"></g:plusone></div></div><p>This is a guest post written by my friend Dana Chandler on how the context of a task motivates the person working on it.  He has a longer academic paper on the topic you can find at the bottom of this post.  It once again shows how traditional economic incentives can&#8217;t fully explain workers&#8217; behaviors on Mechanical Turk.</p>
<p><img src="http://assets.doloreslabs.com/blog/dana_chandler.jpg" alt="" /></p>
<p>Imagine for a moment that you were a turker from either the US or India, looking at the above image. You are given the task of clicking on the blue circular objects with red borders. What you see is only a fraction of the full image. Each image has 90 blue objects to identify. If you’re as good as the average worker, you’ll complete your first image in a little over five minutes and you’ll earn 10 cents (for an hourly wage of $1.20).</p>
<p><span id="more-531"></span></p>
<p>After your first image, you can either quit and take your 10 cents, or identify points on another image. Over the next four hours, you’ll have the chance to label as many images you want. But there’s a catch—you’ll only be paid 9 cents for the second image, 8 cents for the third, and so on, all the way down to 2 cents. This will lower the hourly wage even more.</p>
<p><!--more--></p>
<p>Before you even qualify for the task, you&#8217;ll have to spend five minutes watching a training video and passing a quiz. During the video, half of you will be given only basic work instructions on how to identify “objects of interest.” The other half will be given both instructions and cues of meaning: recognition for your contribution and an explanation of your task&#8217;s purpose<sup>1</sup>. The reason given here? To help researchers identify cancerous tumor cells.</p>
<p>We posted these HITs on MTurk in January, 2010. Almost 300 people from the U.S. and India accepted the task, becoming unknowing participants in our experiment examining MTurk worker motivation. It is commonly believed (and other researchers have verified with demographic surveys) that Indian workers are more motivated by pecuniary concerns and that US turkers are primarily doing tasks for leisure or other non-pecuniary motives. Is this true?</p>
<p>In both countries, half of the turkers in the experiment were randomly assigned to label nondescript &#8220;objects of interest&#8221; without being given any context or greater purpose &#8212; they were our zero-context group. The other half, our meaningful group, were told they were helping researchers identify cancerous tumor cells. Which group of turkers do you think worked harder? You might be surprised.</p>
<p>Therefore, our experiment compared two groups with and without a clear wage motivation, to see if workers behave differently responded to meaningfulness in their tasks.</p>
<p><strong>Results</strong></p>
<p>We measured three metrics: &#8220;showing up&#8221;, the quantity of work, and the quality of that work. The first two metrics are straightforward. Showing up meant that you sat through our training video, passed our qualification test and helped label at least one image. Quantity of work was simply the number of images labeled.</p>
<p>We repeatedly told both groups of turkers that they needed to click on all points and as closely as possible to each point. Work quality was determined by the fraction of cells that a person clicked on (the recall) and the average distance between the “true center” of each cell and where the user clicked (the centrality).</p>
<p>Our most interesting finding was the extent to which a meaningful task (and giving recognition) motivated US workers, but not Indian workers, to complete a task. As any requester knows, attrition on MTurk is a real problem. We found that adding cues of meaning could motivate turkers to undergo training and label at least one image. In the US, adding cues-of-meaning raised the fraction of turkers who completed our task from, 92% of people who sat through our training video, took our quiz, and labeled an image showed up. This figure compares to only 83% of zero-context group (see figure which also has standard errors). In India, there was no difference between the groups and both groups had a 66% completion rate (attrition being higher due to possible language barriers, slow connection speeds, hardware issues, etc.).</p>
<p>However, once a person did some work, both treatment and control groups did a similar quantity of work: The cues-of-meaning group labeled 6.0 images and the zero-context group labeled 5.7 images. This difference was not statistically significant, so it suggests that once you get turkers to work on a task, they are motivated to label just as many images irrespective of the task’s meaningfulness. Notably, of the people who worked, Indians worked longer and labeled an average of 7.3 images vs. 5.2 in the US.</p>
<p>Surprisingly, all workers did an equally good job identifying points whether they had zero-context or whether they thought they were identifying tumor cells. The quality as measured by the fraction of points identified (the recall) or the average pixel distance (the centrality) was statistically insignificant irrespective of the task&#8217;s meaningfulness.</p>
<p>This finding has important implications for those who employ labor in crowdsourcing markets. Companies and intermediaries should develop an understanding of what motivates the people who work on tasks. Employers must think beyond monetary incentives and consider how they can reward workers through non-monetary incentives such as by changing how workers perceive their task. Alienated workers are less likely to do work if they don&#8217;t know the context of the work they are doing and employers may find they can get more work done for the same wages simply by telling turkers why they are working.</p>
<p><img src="http://assets.doloreslabs.com/blog/dana_chandler2.jpg" alt="" /></p>
<p>For more details of this study, please see our full academic paper at: </span><a href="http://danachandler.com/research" target="_blank"><span style="text-decoration: underline;">http://danachandler.com/index.php/research</span></a>. We welcome any comments and feedback.</p>
<p><span style="text-decoration: underline;">About the authors:<br />
</span>Dana Chandler is a researcher at the University of Chicago’s Becker Center where he works with Steven Levitt, author of Freakonomics. He previously worked as a management consultant at the Boston Consulting Group and at Aureos Capitol, a Colombian private equity company. He will begin his Ph.D. at MIT in the Fall.  Dana’s research interests include digital labor markets, development economics, and randomized experiments in companies. email: dchandler {at} uchicago {dot} edu</p>
<p>Adam Kapelner is currently earning his Ph.D. in Statistics at Wharton. Adam is the founder of  <a href="http://dictionarysquared.com" target="_blank">dictionarysquared.com</a> and the inventor of its vocabulary-learning  technology. While working as an undergraduate researcher at Stanford University, he helped engineer the open-source software, <a href="http://gemident.com" target="_blank">www.gemIdent.com</a>, that enables researchers worldwide to locate cells in microscopic images. GemIdent was recently extended to make use of MTurk for outsourcing of medical image identification. The extension, called <a href="http://distributeeyes.com" target="_blank">www.distributeeyes.com</a>, was  adapted to serve as the platform for this experiment. email: kapelner  {at} wharton {dot} upenn {dot} edu</p>
<p><span style="text-decoration: underline;">Acknowledgments:</span> We thank Professor Susan Holmes of Stanford University for allowing us to adopt DistributeEyes (funded under NIH grant #R01GM086884-02) for use in this study. We would also like to thank Panos Ipeirotis for kindly providing us with demographic and market data that we cite in our study. Lawrence Brown, Patrick DeJarnette, John Horton, Emir Kamenica, Steven Levitt, Susanne Neckermann, Jesse Shapiro, Jorg Spenkuch, Jan Stoop, Chad Syverson, Mike Thomas, Abraham Wyner, and seminar participants at the University of Chicago provided especially helpful comments. </span></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2010/05/breaking-monotony-with-meaning-motivation-in-crowdsourcing-markets/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The Case for Online Experimentation</title>
		<link>http://blog.crowdflower.com/2010/05/the-case-for-online-experimentation/</link>
		<comments>http://blog.crowdflower.com/2010/05/the-case-for-online-experimentation/#comments</comments>
		<pubDate>Sat, 01 May 2010 15:13:58 +0000</pubDate>
		<dc:creator>John Horton</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[experimentation]]></category>
		<category><![CDATA[self-promotion]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/2010/05/the-case-for-online-experimentation/</guid>
		<description><![CDATA[Online labor markets dramatically lower the cost and hassle of conducting experiments. On Amazon&#8217;s Mechanical Turk, it is easy to run multiple experiments per week. Figuring out how to run experiments isn&#8217;t that hard, as there are already some nice tutorials available. However, what I felt was missing from the field was a discussion of [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2010/05/the-case-for-online-experimentation/" data-text="The Case for Online Experimentation" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2010/05/the-case-for-online-experimentation/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2010/05/the-case-for-online-experimentation/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2010/05/the-case-for-online-experimentation/"></g:plusone></div></div><p>Online labor markets dramatically lower the cost and hassle of conducting experiments. On Amazon&#8217;s Mechanical Turk, it is easy to run multiple experiments per week. Figuring out how to run experiments isn&#8217;t that hard, as there are already some nice  <a href="http://www.decisionsciencenews.com/2009/12/17/how-to-run-experiments-on-mechanical-turk/">tutorials available</a>.      </p>
<p>However, what I felt was missing from the field was a discussion of why, precisely, we can trust results from online experiments. This was the motivation for a new paper, jointly written with <a>Dave Rand</a> (who wrote up part  of this study <a href="http://blog.crowdflower.com/2010/01/altruism-on-amazon-mechanical-turk/">here</a> on the Dolores Labs blog) and <a href="http://www.hks.harvard.edu/fs/rzeckhau/">Richard Zeckhauser</a>. </p>
<p><a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1591202">You can download the paper here</a>. </p>
<p><span id="more-228"></span></p>
<p>While we make the practical and theoretical case for online experimentation, we believe that acceptance of online results as &#8220;valid&#8221; will come after people start seeing how easy and reliably one can replicate previous studies. This is why blogs like <a href="http://experimentalturk.wordpress.com/">Experimental Turk</a> and <a href="http://groups.csail.mit.edu/uid/deneme/">Deneme</a>&#8212;both of which report results from AMT experiments&#8212;are so helpful. In our paper, we continue this process by replicating three results that are fairly well established. </p>
<p>In one experiment for the economists, we show&#8212;contra the usual intuition&#8212;that at least some Turkers are financially motivated, despite the very low stakes. After performing an initial text transcription task, workers were offered some randomly chosen amount of money to do an additional transcription. Results show the counts of people who agreed (&#8220;Yes&#8221;) and the counts of people who did not agree (&#8220;No&#8221;), by amount offered.    </p>
<p><a href='http://blog.crowdflower.com/wp-content/uploads/2010/05/ppl_and_money.png' title='Turkers and Money'><img src='http://blog.crowdflower.com/wp-content/uploads/2010/05/ppl_and_money.png' alt='Turkers and Money' /></a></p>
<p>Nothing too surprising&#8212;offer to pay more and more workers will accept&#8212;but at this stage in the development of online experiments as a methodology, &#8220;surprising&#8221; would probably be bad news. </p>
<p>Anyway, the full paper is <a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1591202">here</a>. We&#8217;d love to get comments and feedback&#8212;it&#8217;s not too late to earn a place in our coveted &#8220;thanks&#8221; footnote!  </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2010/05/the-case-for-online-experimentation/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Why People Participate on Mechanical Turk, Now as a Mosaic Plot</title>
		<link>http://blog.crowdflower.com/2010/02/why-people-participate-on-mechanical-turk-now-as-a-mosaic-plot/</link>
		<comments>http://blog.crowdflower.com/2010/02/why-people-participate-on-mechanical-turk-now-as-a-mosaic-plot/#comments</comments>
		<pubDate>Sat, 27 Feb 2010 13:48:08 +0000</pubDate>
		<dc:creator>John Horton</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Human Behavior]]></category>
		<category><![CDATA[motivation]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/2010/02/why-people-participate-on-mechanical-turk-now-as-a-mosaic-plot/</guid>
		<description><![CDATA[&#8220;Who are these people?&#8221; and &#8220;Why do they participate?&#8221; are two perennial questions about AMT. Askers are generally incredulous that AMT workers are willing to do rather tedious tasks for small amounts of money. To investigate this question of motivation, NYU Prof. Panos Ipeirotis asked a bunch of workers their reasons and tabulated the responses [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2010/02/why-people-participate-on-mechanical-turk-now-as-a-mosaic-plot/" data-text="Why People Participate on Mechanical Turk, Now as a Mosaic Plot" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2010/02/why-people-participate-on-mechanical-turk-now-as-a-mosaic-plot/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2010/02/why-people-participate-on-mechanical-turk-now-as-a-mosaic-plot/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2010/02/why-people-participate-on-mechanical-turk-now-as-a-mosaic-plot/"></g:plusone></div></div><p>&#8220;Who are these people?&#8221; and &#8220;Why do they participate?&#8221; are two perennial questions about AMT. Askers are generally incredulous that AMT workers are willing to do rather tedious tasks for small amounts of money.  </p>
<p>To investigate this question of motivation, NYU Prof. Panos Ipeirotis asked a bunch of workers their reasons and tabulated the responses <a href="http://behind-the-enemy-lines.blogspot.com/2008/09/why-people-participate-on-mechanical.html">here</a>. His two posts are actually on the syllabus for a <a href="http://bit.ly/c94nJE">course</a> at Stanford (incidentally the course is taught by one of the creators of <a href="http://vis.stanford.edu/protovis/">Protovis</a>, which is very cool and is on my list of things to learn). There is also this amusing <a href="http://waxy.org/2008/11/the_faces_of_mechanical_turk/">investigation</a>.    </p>
<p><span id="more-207"></span></p>
<p>For a joint project with <a href="http://www.people.fas.harvard.edu/~drand/">Dave Rand</a> and <a href="http://www.hks.harvard.edu/about/faculty-staff-directory/richard-zeckhauser">Richard Zeckhauser</a>, we asked ~ 400 AMT workers both (a) where they are from and (b) the primary reason they participate on AMT. Because economic opportunities differ by country, we might expect that motivation and behavior should also differ by country. The cross tabulation plot is below (reasons are in the &#8220;rows&#8221;, countries in the &#8220;columns&#8221;&#8211;the size of each rectangle is proportional to the number of responses in that cell):</p>
<p><a href='http://blog.crowdflower.com/wp-content/uploads/2010/02/country_motivation.png' title='country_motivation.png'><img src='http://blog.crowdflower.com/wp-content/uploads/2010/02/country_motivation.png' alt='country_motivation.png' /></a></p>
<p>Two things to note:<br />
1) Money is a big motivation for everyone<br />
2) Money aside, people from India are there to learn; people from the US are there to have fun</p>
<p>Although the India/US differences are consistent with the different-countries/different-motivations hypothesis, the most relevant fact is the unconditional importance of money.    </p>
<p>While these findings seem reasonable, I feel compelled to make the standard reliability critique of self-reported data. Our learning/fun AMT workers might also be there for the money, but feel sheepish about saying so. Though this could go the other way as well I suppose: if, for example, a worker has an intrinsic love of image captioning but finds this passion shameful, they might report that they are in it for the money. But this seems less likely than the other scenario of downplaying financial motivations.  </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2010/02/why-people-participate-on-mechanical-turk-now-as-a-mosaic-plot/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

