<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The CrowdFlower Blog &#187; Miscellaneous</title>
	<atom:link href="http://blog.crowdflower.com/topics/misc/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.crowdflower.com</link>
	<description></description>
	<lastBuildDate>Tue, 10 Jan 2012 20:00:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>2011 Retrospective: Good Begets Good</title>
		<link>http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/</link>
		<comments>http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 20:00:35 +0000</pubDate>
		<dc:creator>Vaughn Hester and Lukas Biewald</dc:creator>
				<category><![CDATA[Disaster Relief]]></category>
		<category><![CDATA[Health]]></category>
		<category><![CDATA[Holidays]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Al Jazeera]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[disaster relief]]></category>
		<category><![CDATA[Pakreport]]></category>
		<category><![CDATA[ushahidi]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4640</guid>
		<description><![CDATA[One of the most exciting things about working at CrowdFlower is the ongoing discovery of the wide range of crowdsourcing applications. While our core focus is enterprise solutions, we&#8217;re also involved in a number of social innovation projects. At recent meetups and in recent blog posts, we&#8217;ve described CrowdFlower implementations that help create unprecedented social impact. Many [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/" data-text="2011 Retrospective: Good Begets Good" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/"></g:plusone></div></div><p>One of the most exciting things about working at CrowdFlower is the ongoing discovery of the wide range of crowdsourcing applications. While our core focus is enterprise solutions, we&#8217;re also involved in a number of social innovation projects. At recent meetups and in recent <a href="http://blog.crowdflower.com/2011/11/scientific-research/">blog posts</a>, we&#8217;ve described CrowdFlower implementations that help create unprecedented social impact. Many of these projects involve processing data in support of crisis relief or <a href="http://poptech.org/popcasts/fortune_and_biewald_crowdsourcing_tb_cell_annotation">public health research</a>.<a title="" href="#_ftn1">[1]</a> What we continue to see over time is that there are truly inspirational ripple effects emerging from these efforts.</p>
<p><a href="http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/crowdflower-job-67330-preview/" rel="attachment wp-att-4654"><img class="alignright size-full wp-image-4654" title="CrowdFlower/Somalia Speaks Task" src="http://blog.crowdflower.com/wp-content/uploads/2012/01/CrowdFlower-Job-67330-Preview.jpg" alt="" width="401" height="432" /></a></p>
<p>&nbsp;</p>
<div>
<p>In the fall of 2011, the Nexus for ICTs, Climate Change and Development (NICCD) project at the University of Manchester released a <a href="http://www.niccd.org/casestudies.htm">series of case studies</a> on innovative uses of technology for development. <a href="http://www.niccd.org/NICCD_Disasters_Case_Study_Pakreport.pdf">Pakreport was featured</a> as a tool for reporting among flood-affected communities; but it was also highlighted for its contributions to climate change awareness and natural disaster monitoring in a country at very high risk.</p>
<p><span id="more-4640"></span></p>
<p><em>&#8220;Crowdsourcing was critical to the success of this disaster response system,&#8221; the report says. &#8220;It was integral to the data input model, which would otherwise have relied on much more limited inputs from individual relief agency workers.&#8221;</em></p>
<p>As a result of our involvement with Pakreport, our partners in Pakistan signed up to be a CrowdFlower contributor channel. Since April 2011, we have sent 313,030 microtasks to a pool of 500 Pakistani contributors from underserved communities. Our partners in Pakistan also recently launched <a href="http://pakreport.org/dowevote/">DoWeVote</a>, a map-based effort to improve future civic engagement by visualizing 2008 voter turnout data from across Pakistan. Pakreport continues to be an engine for social change, and its core structure is relatively simple to replicate in any setting or modify for similar projects.</p>
<p>After the <a href="http://poptech.org/world_rebalancing">2011 PopTech conference</a>, our partners at <a href="http://www.ushahidi.com">Ushahidi</a> reached out regarding a project to collect reports from the ground in Somalia. In conjunction with Al Jazeera, Souktel, and the African Diaspora Institute, the <a href="http://www.aljazeera.com/indepth/spotlight/somaliaconflict/somaliaspeaks.html">Somalia Speaks project</a> &#8220;seeks to echo the voices of ordinary Somalis in Somalia so they can be heard in the international media.&#8221; Text messages from the ground are collected, translated, categorized and mapped. The translated messages and maps are shared on Al Jazeera. It is a very powerful experience to read the words of a refugee or survivor on one of the largest news websites in the world; examples of mainstream media including the voices of these populations are few and far between.</p>
<div>
<div>
<div>
<p>Beyond all the do-gooder self-congratulation, however, it is important to note  that these efforts create new challenges and move our colleagues and us into new ethical territory. While CrowdFlower maintains rigorous confidentiality and security measures as part of our standard enterprise engagements, the recent collaborations described here are more creative, collaborative and high-profile. As more data flows through open source software and multistep workflows involving collaboration among multiple organizations, critical questions arise as to the confidentiality of the data involved as it is shared by wider audiences. For example, how can you reconcile the flow of sensitive or personal information with the use of software that emphasizes transparency above all else? How can we be certain that, particularly in conflict situations, these workflows do not create a risk of exposure or vulnerability for the people submitting reports from the ground? Who defines the standards for privacy as you amplify voices from vulnerable groups? A key lesson learned through these projects is that confidentiality is essential when dealing with personal information, but that it can be a challenge to protect confidentiality at every step of these multistep workflows. Finally, as we see more demand for these types of crowdsourcing projects, how can we reduce the start up time and the learning curve for organizations who wish to replicate these projects?</p>
<p>These are small but important achievements. The examples from 2010 directly inspired the examples we&#8217;ve seen in 2011. The successful implementation of these projects and the evolution of the discourse surrounding these disruptive tools are the result of substantial collaborative efforts, often by teams comprised entirely of volunteers located around the world. We feel incredibly privileged to work with the clients, partners, researchers, and visionaries who are redefining the ways that technology benefits society. To all of our partners and supporters, thank you for helping us discover new ways in which data can change the world. We can&#8217;t wait to see what 2012 will bring.</p>
<div>
<p><em>The entire team at CrowdFlower wishes you a joyful and peaceful 2012.</em></p>
</div>
</div>
</div>
</div>
</div>
<div>
<p>&nbsp;</p>
<hr align="left" size="1" width="33%" />
<div>
<p><a title="" href="#_ftnref1">[1]</a> In early 2010, we were part of the <a href="http://www.mission4636.org/">Mission 4636</a> collaboration to translate, categorize, and map SMS messages sent from survivors of the first earthquakes in Haiti. We repurposed the Mission 4636 workflow for another deployment of an Ushahidi instance with the <a href="http://www.pakreport.org">Pakreport</a> group in the wake of heavy flooding in Pakistan in the summer of 2010.</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing Scientific Research: Leveraging the Crowd for Scientific Discovery</title>
		<link>http://blog.crowdflower.com/2011/11/scientific-research/</link>
		<comments>http://blog.crowdflower.com/2011/11/scientific-research/#comments</comments>
		<pubDate>Fri, 04 Nov 2011 22:24:24 +0000</pubDate>
		<dc:creator>Dave Oleson</dc:creator>
				<category><![CDATA[Health]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[contributors]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[Harvard]]></category>
		<category><![CDATA[TB]]></category>
		<category><![CDATA[tuberculosis]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=3978</guid>
		<description><![CDATA[Lab scientists spend countless hours manually reviewing and annotating cells. What if we could give these hours back, and replace the tedious parts of science with a hands-off, fast, cheap, and scalable solution? That’s exactly what we did when we used the crowd to count neurons, an activity that computer vision can’t yet solve. Building [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/11/scientific-research/" data-text="Crowdsourcing Scientific Research: Leveraging the Crowd for Scientific Discovery" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/11/scientific-research/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/11/scientific-research/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/11/scientific-research/"></g:plusone></div></div><div id="attachment_4185" class="wp-caption alignright" style="width: 381px"><a href="http://blog.crowdflower.com/2011/11/scientific-research/all-sizes-cell-counts-flickr-photo-sharing/" rel="attachment wp-att-4185"><img class="size-full wp-image-4185" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/All-sizes-Cell-counts-Flickr-Photo-Sharing.jpg" alt="Cell counts | Flickr - Photo Sharing!" width="371" height="310" /></a><p class="wp-caption-text">Cell counts | Flickr - Photo Sharing!</p></div>
<p>Lab scientists spend countless hours manually reviewing and annotating cells. What if we could give these hours back, and replace the tedious parts of science with a hands-off, fast, cheap, and scalable solution?</p>
<p>That’s exactly what we did when we used the crowd to count neurons, an activity that computer vision can’t yet solve. Building on the work we recently did with the <a title="Harvard Tuberculosis lab" href="http://www.forbes.com/sites/techonomy/2011/10/26/crowdsourcing-scientific-progress-how-crowdflowers-hordes-help-harvard-researchers-study-tb/" target="_blank">Harvard Tuberculosis lab</a>, we were able to take untrained people all over the world (people who might never have learned that DNA Helicase unzips genes…), turn them into image analysts with our task design and quality control, and get results comparable to those provided by trained lab workers.</p>
<h3>Here’s how:</h3>
<p>We took cortex slide images from mice provided by a neuroscience lab at Harvard University. We cut each image into smaller pieces, so they’d be easier for people to work on.</p>
<p>After a brief set of instructions, contributors were instructed to count the neurons in the slide by clicking on each individual neuron.<span id="more-3978"></span></p>
<div id="attachment_3985" class="wp-caption aligncenter" style="width: 639px"><a href="http://blog.crowdflower.com/2011/11/scientific-research/cell1/" rel="attachment wp-att-3985"><img class="size-full wp-image-3985" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/cell1.png" alt="Fig 1: example slide" width="629" height="264" /></a><p class="wp-caption-text">Fig 1: example slide</p></div>
<p>Contributors were given examples of edge cases (i.e. Is this a cell or not? Is this two or three cells?). Contributors identified and clicked on every cell, which placed a green marker on top of each cell. An automated counter kept track of the number of clicks.</p>
<p>We controlled quality using Gold Standard (“Gold”) units, which were images with known cell counts that we added to the task. The benefits here are threefold. First, Gold questions provide training and feedback to our contributors, so that they can get better at the task over time. Second, contributors don’t know which questions are Gold, forcing them to honestly answer all questions. Finally, if a contributor fails to answer enough Gold correctly, we remove them from the job.</p>
<div id="attachment_4208" class="wp-caption aligncenter" style="width: 710px"><a href="http://blog.crowdflower.com/2011/11/scientific-research/world-map-contributors/" rel="attachment wp-att-4208"><img class="size-full wp-image-4208" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/world-map-contributors.png" alt="Fig 2: Map of all contributors on this task. Step it up Iceland!" width="700" height="350" /></a><p class="wp-caption-text">Fig 2: Map of all contributors on this task. Step it up Iceland!</p></div>
<p style="text-align: center;"><a title="batchgeo.com TB Cell Count Contributors" href="http://batchgeo.com/map/faa07e6c77756d3685529b17e9a14a5d" target="_blank">Visit the batchgeo.com navigable map.</a></p>
<p>After removing all of our “untrusted” contributors, we are left with our “trusted” contributors. Each image had four trusted contributors count the neurons. We took the average count less any outliers in order to get the most accurate results.</p>
<p>How did this experiment actually turn out? How did our results compare to those of trained lab workers? In short, we performed extremely well. As you can see in our results below, the average difference between the Crowd and professional lab counts is 2.0%, which can be chalked up to ambiguity in certain clusters of cells.</p>
<table style="width: 364px; margin-left: auto; margin-right: auto; text-align: center;" cellspacing="0px" cellpadding="0px">
<tbody>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Image</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">Lab Count</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">Avg. Crowd Count</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">Difference</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">% Difference</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap">Sox6ko2a-1</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">239</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">229</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-10</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(4.0%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko2a-3</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">157</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">153</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-4</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(2.4%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko2a-4</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">161</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">160</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-1</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(0.6%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko2b-1</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">250</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">240</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-11</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(4.2%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko2b-2</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">179</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">173</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-6</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(3.2%)</td>
</tr>
<tr style="border: 1px solid black;">
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko2b-3</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">134</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">130</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-4</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(3.0%)</td>
</tr>
<tr style="border: 1px solid black;">
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko2b-4</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">153</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">152</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-2</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(1.0%)</td>
</tr>
<tr style="border: 1px solid black;">
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko3a-1</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">209</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">209</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">0</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(0.1%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko3a-2</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">147</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">149</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">2</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">1.2%</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko3a-3</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">134</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">129</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-5</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(3.9%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko3a-4</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">138</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">136</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-2</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(1.6%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko3b-1</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">213</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">212</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-1</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(0.5%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">Sox6ko3b-3</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">78</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">75</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">-4</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">(4.5%)</td>
</tr>
<tr>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="67">sox6ko3b-4</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="58">54</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="101">54</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="63">0</td>
<td style="border: 1px solid black;" valign="bottom" nowrap="nowrap" width="75">0.0</td>
</tr>
</tbody>
</table>
<p>Future iterations of this will include breaking the images into smaller pieces, as well as using automated solutions to tag the high confidence units, leaving only the edge case cells that require human eyes.</p>
<p>We hope that results like these will encourage more scientists, labs, and biotech firms to crowdsource pieces of their research. We believe this could free up their time for more complicated work, decrease the latency of results for experiments, and quicken the pace of scientific discovery.</p>
<p>Yesterday it was <a title="TB Cells" href="http://www.forbes.com/sites/techonomy/2011/10/26/crowdsourcing-scientific-progress-how-crowdflowers-hordes-help-harvard-researchers-study-tb/" target="_blank">TB Cells</a>, today its neuron cells; tomorrow: cancer cells? We’re issuing an open call for any university lab, biotechnology, or pharmaceutical company with large image data sets: contact us to utilize the crowd to shorten the life cycle from idea to major scientific advancement.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/11/scientific-research/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>258 Guys in a Garage! Crowdsourcing an Entire Startup</title>
		<link>http://blog.crowdflower.com/2011/10/258-guys-in-a-garage/</link>
		<comments>http://blog.crowdflower.com/2011/10/258-guys-in-a-garage/#comments</comments>
		<pubDate>Thu, 27 Oct 2011 17:14:49 +0000</pubDate>
		<dc:creator>Philip Rosedale</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[crowdconf]]></category>
		<category><![CDATA[philip rosedale]]></category>
		<category><![CDATA[silicon valley]]></category>
		<category><![CDATA[startup]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=3944</guid>
		<description><![CDATA[About the author: Philip Rosedale is the creator of Second Life and a Co-Founder of LoveMachine, Inc. My co-founder Ryan and I are having so much fun pulling together data and thoughts for my upcoming keynote at CrowdConf next week. It&#8217;s a great opportunity to try and summarize much of what we&#8217;ve learned over the [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/10/258-guys-in-a-garage/" data-text="258 Guys in a Garage! Crowdsourcing an Entire Startup" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/10/258-guys-in-a-garage/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/10/258-guys-in-a-garage/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/10/258-guys-in-a-garage/"></g:plusone></div></div><div id="attachment_3947" class="wp-caption alignright" style="width: 151px"><a href="http://blog.crowdflower.com/2011/10/258-guys-in-a-garage/philip-rosedale3/" rel="attachment wp-att-3947"><img class="size-full wp-image-3947    " title="philip-rosedale" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/philip-rosedale3.jpg" alt="philip-rosedale" width="141" height="212" /></a><p class="wp-caption-text">via: FastCompany</p></div>
<p><em>About the author: Philip Rosedale is the creator of Second Life and a Co-Founder of LoveMachine, Inc.</em></p>
<p>My co-founder Ryan and I are having so much fun pulling together data and thoughts for my upcoming keynote at CrowdConf next week. It&#8217;s a great opportunity to try and summarize much of what we&#8217;ve learned over the last few years about whether and how crowdsourcing can be taken to the next logical (we think) level: to replace a bunch of what we&#8217;ve come to think of as the nature of &#8220;work&#8221; and &#8220;company”.</p>
<p>The Silicon Valley startup formula is now a well-recognized and time-honored strategy, which I think we&#8217;ve all worn into a bit of a rut: 3 or 4 very smart people (usually guys) hunker down in someone&#8217;s garage, work a bleary-eyed 80 hours a week producing a prototype, getting funding, hiring those first handful of key engineers, etc.</p>
<p><span id="more-3944"></span>But what would happen if you never hired anyone at all? What about doing a &#8216;real&#8217; Silicon Valley startup &#8211; meaning a new, risky, design-sensitive idea backed by venture capital &#8211; but without the garage and the crunch time and team of &#8216;A players&#8217;? What if you built a system that instead allowed people all over the world to be paid small amounts of money to help you prototype, build, and then release a high quality new website and software?</p>
<p>Well that&#8217;s what we did, and there is a lot to talk about!</p>
<p>-Philip</p>
<p>&nbsp;</p>
<p><em>Come hear Philip and many other experts in the crowdsourcing field speak at <a title="CrowdConf 2011" href="http://www.crowdconf.com">CrowdConf 2011</a>. It is a great opportunity to discover what is happening in the world of crowdsourcing. It starts next week, November 1-2 at the Mission Bay Conference Center in San Francisco. You can buy your tickets <a title="CrowdConf 2011 Tickets" href="http://crowdconf2011.eventbrite.com/">here</a>. Hope to see you there!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/10/258-guys-in-a-garage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing and Retention: From First-Timers to Seasoned Veterans</title>
		<link>http://blog.crowdflower.com/2011/10/seasonedveterans/</link>
		<comments>http://blog.crowdflower.com/2011/10/seasonedveterans/#comments</comments>
		<pubDate>Tue, 25 Oct 2011 20:10:31 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[engagement]]></category>
		<category><![CDATA[gold]]></category>
		<category><![CDATA[insights]]></category>
		<category><![CDATA[judgment]]></category>
		<category><![CDATA[loyal]]></category>
		<category><![CDATA[retention]]></category>
		<category><![CDATA[veterans]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2589</guid>
		<description><![CDATA[Millions of people have participated in our tasks over the last few years, and tens of thousands of people are active at any given moment. However, crowdsourcing is not a traditional engagement model. Tasks are elective, which means people are free to come and go as they please. It&#8217;s a fair question, then, to ask whether they [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/10/seasonedveterans/" data-text="Crowdsourcing and Retention: From First-Timers to Seasoned Veterans" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/10/seasonedveterans/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/10/seasonedveterans/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/10/seasonedveterans/"></g:plusone></div></div><p>Millions of people have participated in our tasks over the last few years, and tens of thousands of people are active at any given moment. However, crowdsourcing is not a traditional engagement model. Tasks are elective, which means people are free to come and go as they please. It&#8217;s a fair question, then, to ask whether they keep coming back.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/10/seasonedveterans/comebacksoon/" rel="attachment wp-att-3929"><img class="aligncenter size-full wp-image-3929" title="Come Back Soon" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/comebacksoon.jpg" alt="crowdsourcing " width="486" height="334" /></a></p>
<p><em>Do people perform tasks only fleetingly, or has crowdsourcing become more of a long-term engagement? </em><em>Furthermore, just how important is contributor retention in the world of crowdsourcing?</em></p>
<p>While a majority of people fall into the &#8220;one-and-done&#8221; camp, many of the most productive contributors tend to have participated in previous jobs. Within any single job, these seasoned veteran contributors also provide far more work than their less experienced counterparts.</p>
<p><span id="more-2589"></span></p>
<p>Over a period of two months, we ran a series of five very similar jobs, retrieving ratings information for businesses throughout North America. In total, we collected over half a million judgments from a total of 2,901 unique contributors,<sup><a href="#footnote-1">1</a></sup> representing multiple labor channels and 101 countries. As a first test to whether seasoned veterans are common, we looked at how many people participated in more than one job. In total, 2,389 people participated in one job only, meaning that First-Timers accounted for over 82 percent of all contributors.</p>
<p>But this may not be the best number to look at. We&#8217;re really interested in whether certain people recognize and seek out specific types of tasks after having worked on them before. To look for this behavior, we analyzed the most recent job, counting how many people participated in at least one prior iteration of the task.</p>
<p>For a recent job, a total of 906 people participated, 247 of whom had done the task previously. By this measure, Seasoned Vets constitute approximately 27% of the workforce. While the impact of returning contributors is greater under this methodology, the fact remains that these &#8220;loyal&#8221; contributors are firmly in the minority on this series of jobs.</p>
<div id="attachment_3869" class="wp-caption aligncenter" style="width: 631px"><a href="http://blog.crowdflower.com/2011/10/seasonedveterans/contributor_count/" rel="attachment wp-att-3869"><img class="size-full wp-image-3869 " title="contributor_count" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/contributor_count.jpg" alt="crowdsourcing" width="621" height="395" /></a><p class="wp-caption-text">Count of Contributors by Profile</p></div>
<p>&nbsp;</p>
<p>However, <a title="crowdsourcing" href="http://blog.crowdflower.com/2010/12/good-work-knows-no-boundaries/" target="_blank">as we&#8217;ve seen before</a>, the individual impact of contributors varies widely, with a minority of people often providing the vast majority of work. With this in mind, we looked at the contributions of First-Timers and Seasoned Vets and found some striking differences.</p>
<div id="attachment_3870" class="wp-caption aligncenter" style="width: 598px"><a href="http://blog.crowdflower.com/2011/10/seasonedveterans/contributor_share/" rel="attachment wp-att-3870"><img class="size-full wp-image-3870 " title="contributor_share" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/contributor_share.jpg" alt="crowdsourcing" width="588" height="396" /></a><p class="wp-caption-text">Share of Contributions by Profile</p></div>
<p>&nbsp;</p>
<p><strong>Seasoned Vets, while constituting only 27% of the workforce, provided 47% of the total work completed.</strong> <strong>On average, each Seasoned Veteran provided 2.5 times more judgments than their less experienced counterparts.</strong> It&#8217;s also interesting to note that there was no significant difference between the quality of work provided by First-Timers and Seasoned Vets, no doubt due to our suite of <a title="Enterprise Crowdsourcing or: How I learned to stop worrying and trust the crowd" href="http://blog.crowdflower.com/2011/10/stopworrying/">quality control measures</a>.</p>
<p>Given that the people who stick around tend to be far more productive, improving retention is a useful consideration (for these jobs, at least). We&#8217;re now interested in how best to attract people to return to the types of jobs they&#8217;ve already seen. That, however, is a work in progress and a story for another day.</p>
<hr />
<p id="footnote-1" style="text-align: -webkit-auto;">1. Note that this analysis only considers Trusted workers.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/10/seasonedveterans/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Enterprise Crowdsourcing or: How I learned to stop worrying and trust the crowd</title>
		<link>http://blog.crowdflower.com/2011/10/stopworrying/</link>
		<comments>http://blog.crowdflower.com/2011/10/stopworrying/#comments</comments>
		<pubDate>Wed, 05 Oct 2011 23:53:33 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[confidence]]></category>
		<category><![CDATA[contributors]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[demographic]]></category>
		<category><![CDATA[filtering]]></category>
		<category><![CDATA[gold]]></category>
		<category><![CDATA[judgment]]></category>
		<category><![CDATA[location]]></category>
		<category><![CDATA[redundancy]]></category>
		<category><![CDATA[review]]></category>
		<category><![CDATA[training]]></category>
		<category><![CDATA[trust]]></category>
		<category><![CDATA[workflow]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=3692</guid>
		<description><![CDATA[Our recent post about confidence bias, where we showed that most contributors vastly overestimate their own ability to complete tasks correctly, raised a lot of questions about how we manage quality at CrowdFlower. You might remember these themes from such classic posts as: AMT is Fast, Cheap and Good or the Wisdom of Small Crowds series [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/10/stopworrying/" data-text="Enterprise Crowdsourcing or: How I learned to stop worrying and trust the crowd" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/10/stopworrying/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/10/stopworrying/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/10/stopworrying/"></g:plusone></div></div><p>Our recent post about confidence bias, where we showed that <a title="Confidence Bias: Evidence from Crowdsourcing" href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/" target="_blank">most contributors vastly overestimate their own ability to complete tasks correctly</a>, raised a lot of questions about how we manage quality at CrowdFlower. You might remember these themes from such classic posts as: <a title="AMT is fast, cheap, and good for machine learning data" href="http://blog.crowdflower.com/2008/09/amt-fast-cheap-good-machine-learning/">AMT is Fast, Cheap and Good</a> or the Wisdom of Small Crowds series <a title="Wisdom of Small Crowds Part 1" href="http://blog.crowdflower.com/2008/06/aggregate-turker-judgments-threshold-calibration/" target="_blank">[1]</a> <a title="Wisdom of Small Crowds Part 2" href="http://blog.crowdflower.com/2008/08/wisdom-of-small-crowds-part-2-individual-workloads-and-rates/" target="_blank">[2]</a> <a title="Wisdom of Small Crowds Part 3" href="http://blog.crowdflower.com/2008/08/wisdom-of-small-crowds-part-3-another-worker-visualization/" target="_blank">[3]</a>.</p>
<div id="attachment_3786" class="wp-caption alignright" style="width: 240px"><a href="http://blog.crowdflower.com/2011/10/stopworrying/prospector/" rel="attachment wp-att-3786"><img class="size-full wp-image-3786    " title="prospector" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/prospector.jpeg" alt="crowdsourcing" width="230" height="226" /></a><p class="wp-caption-text">via: reddead.wikia.com/</p></div>
<p>The standard CrowdFlower model is agnostic towards the quality of any individual contributor. Typically, we let anyone attempt a task, using our technology to filter out low-quality contributors and score the responses. Without further ado, what follows is quick review of the steps we take to do that filtering.</p>
<p><span id="more-3692"></span><span class="Apple-style-span" style="font-size: 20px; font-weight: bold;">Gold (What is it Good For?)</span></p>
<p>In almost every job, we take a subset of the data to be processed and manually score the correct response. This manually-scored set, which we refer to as Gold Standard Data, is at the core of managing quality in the context of enterprise crowdsourcing:</p>
<ul>
<li><strong>Filtering</strong>: We use Gold to create an up-front test, creating a barrier to entry such that only workers who understand and successfully complete a task are allowed to participate. This allows us to prevent unsavory characters from entering jobs and contaminating results.</li>
<li><strong>On-going Training</strong>: We also use Gold to conduct on-going training, offering corrections for units that are answered incorrectly. This allows us to continually instruct and improve highly prolific contributors.</li>
<li><strong>Dynamic Trust Score</strong>: We use each contributor’s performance on Gold as a basis to determine their overall accuracy within a task. Each contributor must exceed our minimum trust thresholds to continue working on a task. If at any point a contributor falls below the trust threshold, we&#8217;ll exclude his work.</li>
</ul>
<p>Because creating Gold is labor-intensive, we&#8217;ve created <a title="Programmatic Gold: Targeted and Scalable Quality Assurance in Crowdsourcing" href="http://crowdflower.com/images/marketing/papers/HCOMP2011-philosopher-stone.pdf" target="_blank">an automated process to generate Gold</a> using units that have already been completed. This has significantly reduced the time needed for setup and ongoing job creation, without sacrificing our ability to differentiate contributors.</p>
<p>Of course, <a title="Ensuring quality in crowdsourced search relevance evaluation: The effects of training question distribution" href="http://crowdflower.com/images/marketing/papers/SIGIRpaper.pdf" target="_blank">the amount and distribution of Gold</a> is critical. Often, a uniform distribution of Gold across response types is ideal, though in certain situations we&#8217;ll use a skewed Gold set. For example, in an experiment on crowdsourced document review, we used a skewed Gold set to avoid missing relevant documents (reduced &#8220;false negatives,&#8221; if you prefer).</p>
<p style="text-align: center;"><img class="aligncenter" title="eDiscovery Results" src="http://blog.crowdflower.com/wp-content/uploads/2011/04/ediscover_stats.jpg" alt="crowdsourcing" width="835" height="264" /></p>
<h2><span class="Apple-style-span" style="font-size: 26px;">Department of Judgment Redundancy Department</span></h2>
<p>If the purpose of Gold is to manage the quality of individual contributors, we use multiple judgments per unit to improve the accuracy of completed units. The basic premise is simple enough. We look for agreement among trusted workers to indicate correct responses at the unit level. For example, if we ask four people to verify a phone number for a business, the answer is more likely to be correct if all four agree. In fact, every unit processed by CrowdFlower is annotated with a response as well as a Confidence Score (based on agreement weighted by Trust, plus some secret sauce).</p>
<p>More generally, assume we set the trust threshold for a given job at 70 percent (meaning that anyone who doesn&#8217;t answer at least 70 percent of Gold correctly gets booted) and that contributors are uniformly distributed in terms of ability (not true, but convenient). We can easily model the effect of additional judgments on estimated accuracy, showing that the probability that the majority response is correct increases with the number of judgments collected:</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/10/stopworrying/redundancy/" rel="attachment wp-att-3700"><img class="aligncenter size-full wp-image-3700" title="Judgment Redundancy and Accuracy" src="http://blog.crowdflower.com/wp-content/uploads/2011/09/redundancy.jpg" alt="crowdsourcing" width="471" height="308" /></a></p>
<p>Of course, while collecting 10 judgments per unit yields highly accurate results, it may not be the most efficient way to structure a job. Imagine that the first 2, 4 or even 6 contributors agree on how a unit should be classified. At some point, the marginal impact of an additional judgment is not worth the additional cost. We&#8217;ve automated a process to vary the number of judgment each unit receives based on agreement thresholds, so that we can reach accuracy targets more efficiently.</p>
<p>The following shows actual results from a sample job, where we set a minimum confidence threshold of 0.7:</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/10/stopworrying/variable-judgments-viz/" rel="attachment wp-att-3722"><img class="aligncenter size-full wp-image-3722" title="Variable Judgments Viz" src="http://blog.crowdflower.com/wp-content/uploads/2011/09/Variable-Judgments-Viz.jpg" alt="crowdsourcing" width="694" height="454" /></a></p>
<p>Approximately 50 percent of units completed with just 2 judgments and 75 percent completed with 4 or fewer. In any case, each unit received only as many judgments as necessary to reach the confidence threshold. For any job, some subset of units will be ambiguous enough that they won&#8217;t reach a confidence threshold, so we use also set maximum judgments cap to &#8220;stop the bleeding.&#8221; Depending on the specific circumstances, we may reroute those ambiguous units to a parallel process with different structure, contributors, etc. for another round of judgments.</p>
<h2>One More Trick</h2>
<p>For complex tasks, we have developed a <strong>workflow management system</strong> to link together multiple jobs. For example, we might ask one pool of contributors to write a product description, verify the spelling and accuracy with a second pool and rank the subjective quality with a third pool. Alternatively, we might take a business listing and break out each attribute for independent collection and verification, with a separate job for name, address and phone number, or cuisine type, cash-only, types of credit cards accepted, on-site parking, or any other attribute that can be verified online. In general, <strong>peer review</strong> means that we can always give data a second pass to improve accuracy.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/10/stopworrying/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing a Map, to Eat</title>
		<link>http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/</link>
		<comments>http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/#comments</comments>
		<pubDate>Fri, 16 Sep 2011 17:15:26 +0000</pubDate>
		<dc:creator>Aron Hegyi</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[API]]></category>
		<category><![CDATA[Balsamic]]></category>
		<category><![CDATA[book]]></category>
		<category><![CDATA[Campania]]></category>
		<category><![CDATA[city]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[directions]]></category>
		<category><![CDATA[food]]></category>
		<category><![CDATA[geocoding]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Geocoding]]></category>
		<category><![CDATA[Google Refine]]></category>
		<category><![CDATA[guidebook]]></category>
		<category><![CDATA[Italy]]></category>
		<category><![CDATA[Italy Pummel]]></category>
		<category><![CDATA[KML]]></category>
		<category><![CDATA[La Vecchia Dispensa]]></category>
		<category><![CDATA[map]]></category>
		<category><![CDATA[PDF]]></category>
		<category><![CDATA[region]]></category>
		<category><![CDATA[restaurant]]></category>
		<category><![CDATA[self-service]]></category>
		<category><![CDATA[slow food]]></category>
		<category><![CDATA[solutions]]></category>
		<category><![CDATA[Tuscany]]></category>
		<category><![CDATA[Zingerman’s]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=3637</guid>
		<description><![CDATA[Last May, I took a trip to Italy for two weeks. A little bit of history: my friend Jessica and I are both Italophiles, and when her mom sent us a link to a video contest where the prize was a round trip flight to Italy, we knew we had to enter. After a week [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/" data-text="Crowdsourcing a Map, to Eat" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/"></g:plusone></div></div><div class="wp-caption alignright" style="width: 343px"><a href="http://www.flickr.com/photos/aronh/5800985627/in/set-72157626896293314/" target="_blank"><img class="  " src="http://farm4.static.flickr.com/3098/5800985627_40ed3a7cfb.jpg" alt="A parmigiano reggiano aging room" width="333" height="500" /></a><p class="wp-caption-text">Expanse of Cheese: Caseificio di San Silvestro&#39;s Aging Room in Castelvetro (MO)</p></div>
<p>Last May, <span class="s1"><a title="I took a trip to Italy" href="http://www.youtube.com/watch?v=n66L-HplnKI" target="_blank">I took a trip to Italy</a></span> for two weeks. A little bit of history: my friend Jessica and I are both Italophiles, and when her mom sent us a link to <a title="A Video Contest" href="http://www.zingermans.com/BalsamicVideos.aspx" target="_blank">a video contest</a> where the prize was a round trip flight to Italy, we knew we had to enter. After a week of writing and editing lyrics in a Google Doc — half in Italian, half in English — <span class="s1"><a title="the resulting music video" href="http://www.youtube.com/watch?v=ot-ixjYolKM" target="_blank">the resulting music video</a> </span>ended up winning us a trip to the holy land of olive oil, vino, and other delectable edibles.</p>
<p class="p1">Apart from being a passionate eater, I&#8217;m a passionate supporter of the <span class="s1"><a title="Slow Food" href="http://slowfoodusa.org/" target="_blank">Slow Food</a></span> movement, an organization which promotes good, clean, and fair food around the world. Each year, they publish a guidebook to restaurants in Italy that adhere to their principles. In Italy, this usually means each restaurant is handpicked to showcase the traditional food of a particular region; each restaurant supports artisanal methods and products that otherwise might go extinct (were eaters not eating them), and where the food is most likely naturally organic and local anyway.</p>
<p class="p1"><span id="more-3637"></span>But, a problem: it&#8217;s 2011, and I&#8217;m more apt to travel through interactive maps than with old-fashioned guidebooks. More importantly, I don&#8217;t plan, and I needed to know what edible options were around me at any moment in time on my trip. What I really needed was a version of the guidebook, in map form, that I could use on my mobile. No such option existed, of course, and I was left with two options: magically visualize restaurants around me by poring through the guidebook, or, crowdsource it.</p>
<p class="p1">What I needed to do (and, what much of our work at CrowdFlower comprises), was structure unstructured data. To create the map from the book, I went through the following steps:</p>
<ul class="ul1">
<li class="li3"><span class="s2">Obtained <span class="s3"><a title="a PDF version of the book" href="http://ultimabooks.simplicissimus.it/catalog/product/view/id/1874/s/osterie-d-italia-2011/" target="_blank">a PDF version of the book</a></span></span></li>
<li class="li1">Split the book into pages, and uploaded each page as a separate PDF (thanks to <span class="s1"><a title="pdftk" href="http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/" target="_blank">pdftk</a></span>)</li>
<li class="li1">Created a CSV (comma separated values file) with each page&#8217;s PDF link and page number</li>
<li class="li1">Created a crowdsourcing task to structure the data, using the previously uploaded individual PDF pages</li>
<li class="li1">Geocoded the structured data</li>
<li class="li1">Output the geocoded data in KML (<span class="s1"><a title="Keyhole Markup Language" href="http://en.wikipedia.org/wiki/Keyhole_Markup_Language" target="_blank">Keyhole Markup Language</a></span>) form</li>
<li class="li1">Uploaded the KML file to a mapping site (e.g. Google Maps)</li>
</ul>
<p class="p1">Conveniently, each page that outlined a restaurant in the PDF was formatted nearly the same. This made it easy to give instructions to workers, as seen below:</p>
<div class="wp-caption aligncenter" style="width: 582px"><a href="http://publicassets.s3.amazonaws.com/pdf_transcription_test/capture_areas_example.gif"><img class=" " src="http://publicassets.s3.amazonaws.com/pdf_transcription_test/capture_areas_example.gif" alt="crowdsourcing" width="572" height="414" /></a><p class="wp-caption-text">Instructions</p></div>
<p class="p1">For each area on the page, workers were asked to copy and paste specific sections into the task. Each page, then, was split up into corresponding parts (Region, City, Directions, Restaurant Name, and the two Capture Areas). This is the essential concept here: structuring the unstructured data such that I could later geocode it properly, and display it in the way that I needed.</p>
<div class="wp-caption alignnone" style="width: 510px"><a href="http://www.flickr.com/photos/aronh/5801664934/in/set-72157626896293314/" target="_blank"><img class=" " src="http://farm6.static.flickr.com/5199/5801664934_027586411b.jpg" alt="crowdsourcing Pasta" width="500" height="333" /></a><p class="wp-caption-text">The tonnarelli cacio e pepe at Da Felice a Testaccio in Rome (used in the Geocode example)</p></div>
<p class="p1">Once the task finished, I downloaded the resulting CSV file, and whipped out <a title="Google Refine" href="http://code.google.com/p/google-refine/" target="_blank">Google Refine</a> (a.k.a. Excel on crack), which has a feature that allows you to enter a template API call that changes based on specific values in each row. Using the <a title="Google Geocoding API" href="http://code.google.com/apis/maps/documentation/geocoding/" target="_blank">Google Geocoding API</a> (any will do), I constructed the following API call, using the address value in each row as the “address&#8221; parameter for each API call:</p>
<pre style="color: #000; padding: 17px 20px; border: 1px solid #E6DB55; background: lightYellow;">http://maps.googleapis.com/maps/api/geocode/json?address=via+Mastro+Giorgio%2C+26+Roma+Lazio&amp;sensor=false&amp;region=it&amp;language=it</pre>
<p class="p1">After slicing and dicing the rest of the data into bits that I wanted to display on a map, I used Google Refine&#8217;s &#8220;templating&#8221; feature to export each row as a Placemark in KML format. Finally, I uploaded the resulting KML file into several different maps, each representing one region in Italy.</p>
<div id="attachment_3683" class="wp-caption alignnone" style="width: 477px"><a href="http://maps.google.com/maps/ms?msa=0&amp;msid=209651542845453072441.0004a388168c61ef8f9ec" target="_blank"><img class="size-full wp-image-3683  " src="http://blog.crowdflower.com/wp-content/uploads/2011/09/Screen-shot-2011-09-15-at-4.02.58-PM1.png" alt="crowdsource food map" width="467" height="304" /></a><p class="wp-caption-text">A portion of the Campania map - tons of places to explore around here!</p></div>
<p class="p1">Try it out! Check out the map for <a title="Tuscany" href="http://maps.google.com/maps/ms?msa=0&amp;msid=209651542845453072441.0004a2ac2f900cba0b2f3" target="_blank">Tuscany</a>, and the map for <a title="Campania" href="http://maps.google.com/maps/ms?msa=0&amp;msid=209651542845453072441.0004a388168c61ef8f9ec" target="_blank">Campania</a>.</p>
<p class="p1">If you want to give crowdsourcing a spin, head on over to our <a title="Crowdsource Self-Service" href="http://crowdflower.com/solutions/self-service" target="_blank">self-service product</a> and click &#8220;sign up&#8221;. Or, if you&#8217;d prefer a hands-off &#8220;we-do-everything&#8221; approach, <a title="solutions@crowdflower.com" href="mailto:solutions@crowdflower.com" target="_blank">contact us</a> to get started.</p>
<p>Watch the video that started it all: <a href="http://www.youtube.com/watch?v=ot-ixjYolKM?rel=0" target="_blank"><em>Io Sono Balsamico</em> by Balsamico</a></p>
<p><em>Aron is a Crowdsourcing Project Manager at CrowdFlower, and is the resident agriculturalist-eater. Follow him on Twitter (<a href="http://twitter.com/aron" target="_blank">@aron</a>) for more sage bits of agricultural-eating learnings.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Confidence Bias: Evidence from Crowdsourcing</title>
		<link>http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/</link>
		<comments>http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/#comments</comments>
		<pubDate>Wed, 07 Sep 2011 23:02:04 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[bias]]></category>
		<category><![CDATA[confidence]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[experiment]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=3080</guid>
		<description><![CDATA[Evidence in experimental psychology suggests that most people overestimate their own ability to complete objective tasks accurately. This phenomenon, often called confidence bias, refers to &#8220;a systematic error of judgment made by individuals when they assess the correctness of their responses to questions related to intellectual or perceptual problems.&#8221; 1 But does this hold up in crowdsourcing? We ran an experiment to [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/" data-text="Confidence Bias: Evidence from Crowdsourcing" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/"></g:plusone></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/" data-counter="top"></script></div></div><div id="attachment_3593" class="wp-caption alignleft" style="width: 164px"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/overconfidence/" rel="attachment wp-att-3593"><img class="size-full wp-image-3593    " title="psychologytoday.com" src="http://blog.crowdflower.com/wp-content/uploads/2011/09/Overconfidence.gif" alt="crowdsourcing" width="154" height="132" /></a><p class="wp-caption-text">psychologytoday.com</p></div>
<p>Evidence in experimental psychology suggests that most people overestimate their own ability to complete objective tasks accurately. This phenomenon, often called <em>confidence bias, </em>refers to &#8220;a systematic error of judgment made by individuals when they assess the correctness of their responses to questions related to intellectual or perceptual problems.&#8221; <sup><a href="#footnote-1">1</a></sup> But does this hold up in crowdsourcing?</p>
<p>We ran an experiment to test for a persistent difference between people&#8217;s perceptions of their own accuracy and their actual objective accuracy. We used a set of standardized questions, focusing on the Verbal and Math sections of a common standardized test. For the 829 individuals who answered more than 10 of these questions, we asked for the correct answer as well as an indication of how confident they were of the answer they supplied.</p>
<p><span id="more-3080"></span>We didn&#8217;t use any Gold in this experiment. Instead, we incentivized performance by rewarding those finishing in the top 10%, based on objective accuracy.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/sample_problem/" rel="attachment wp-att-3427"><img class="aligncenter size-full wp-image-3427" title="sample_problem" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/sample_problem.jpg" alt="crowdsourcing" width="713" height="520" /></a></p>
<h2>Does Bias Exist?<em> </em></h2>
<p>To estimate confidence bias, we looked at the difference between the average of how confident an individual was of his/her answers and how many he/she answered correctly. If the difference is positive, the individual overestimated how well they did. <strong>Amazingly, over 75% of contributors overestimated their ability to answer multiple choice questions correctly.</strong></p>
<h2><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/histogram_res/" rel="attachment wp-att-3278"><img class="aligncenter size-full wp-image-3278" title="histogram_res" src="http://blog.crowdflower.com/wp-content/uploads/2011/07/histogram_res.jpg" alt="crowdsourcing" width="599" height="341" /></a></h2>
<h2>Are Individuals Consistently Biased?</h2>
<p>Because our dataset consisted of Math and Verbal questions, we looked at each individual contributor&#8217;s confidence bias for both types of questions. In aggregate, people tended to have more trouble with the Verbal questions (average accuracy of 28%, compared to 41% for Math), though the average confidence score was nearly identical (63% +/-1).</p>
<h2><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/scatterplot/" rel="attachment wp-att-3279"><img class="aligncenter size-full wp-image-3279" title="scatterplot" src="http://blog.crowdflower.com/wp-content/uploads/2011/07/scatterplot.jpg" alt="crowdsourcing" width="639" height="380" /></a></h2>
<p>The vast majority of contributors fall into the &#8220;overconfident on both&#8221; quadrant (top right), while only a handful of contributors were overconfident for one question type and underconfident for the other (top left and bottom right quadrants). Overall, there is certainly a correlation between bias scores on the two problem types, suggesting that many individuals are consistently biased on different types of problems. However, this explains only a portion of the variation.</p>
<h2>Does Bias Vary Across Groups?</h2>
<p>Given that overconfidence seems to be a consistent trait, we were curious how this trait varies across the different groups making up our contributor pool. We sliced and diced our contributors into a number of different sub-groups, which are summarized below.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/summary-table/" rel="attachment wp-att-3280"><img class="aligncenter size-full wp-image-3280" title="summary table" src="http://blog.crowdflower.com/wp-content/uploads/2011/07/summary-table.jpg" alt="crowdsourcing" width="703" height="460" /></a></p>
<p>There are a lot of interesting things going on here. To highlight a few, accuracy increases consistently as the contributor&#8217;s education level advances from High School to College, but so does confidence, leaving the bias score nearly unchanged. There&#8217;s a similar pattern with Age, with older contributors tending to be both more accurate and more confident.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/splits/" rel="attachment wp-att-3408"><img class="aligncenter size-full wp-image-3408" title="splits" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/splits.jpg" alt="crowdsourcing splits" width="768" height="252" /></a></p>
<p>Gender and Location also have an effect on confidence bias. Taking the two countries that supplied the most people, contributors from the US were much more accurate and slightly more confident than the average, while those from India were average in terms of accuracy but much more confident. As such, the bias score for contributors from India is nearly double that of contributors from the US. With respect to gender, confidence didn&#8217;t vary much, but women were more accurate and thus less biased than men. Moving on.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/splits2/" rel="attachment wp-att-3405"><img class="aligncenter size-full wp-image-3405" title="splits2" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/splits2.jpg" alt="crowdsourcing" width="773" height="251" /></a></p>
<h2>Further Research</h2>
<p>In the context of experimentation, we decided against using Gold to minimize any selection bias among contributors. However, this makes it difficult to apply these results to enterprise crowdsourcing, at least as practiced by CrowdFlower. In the future, it would be interesting to look at confidence bias among trusted workers only, and particularly among trusted workers with repeated experience in specific job types. We would expect these workers to have a better sense of whether their answers are correct, though it is possible (and perhaps likely) that confidence would increase along with accuracy.</p>
<p>&nbsp;</p>
<hr />
<p id="footnote-1" style="text-align: -webkit-auto;">1. Pallier, G., Wilkinson, R., Danthir, V., Kleitman, S., Knezevic, G., Stankov, L., &amp; Roberts, R. D. (2002). The role of individual differences in the accuracy of conﬁdence judgments. Journal of General Psychology, 129,257–299</p>
<p style="text-align: center;">
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>From Sea to Shining Sea</title>
		<link>http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/</link>
		<comments>http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/#comments</comments>
		<pubDate>Fri, 08 Jul 2011 23:59:38 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2698</guid>
		<description><![CDATA[With Independence Day just behind us, I thought it might make sense to reflect on the people who do CrowdFlower tasks throughout the Union. Sure enough, we do have people doing CrowdFlower tasks in all 50 states. California, Texas, New York, and Florida are well represented among people working on our tasks, which isn&#8217;t especially [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/" data-text="From Sea to Shining Sea" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/"></g:plusone></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/" data-counter="top"></script></div></div><p>With Independence Day just behind us, I thought it might make sense to reflect on the people who do CrowdFlower tasks throughout the Union.</p>
<p><a href="http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/states/" rel="attachment wp-att-2699"><img class="aligncenter size-full wp-image-2699" title="states" src="http://blog.crowdflower.com/wp-content/uploads/2011/07/states.jpg" alt="" width="440" height="239" /></a></p>
<p>Sure enough, we do have people doing CrowdFlower tasks in all 50 states. California, Texas, New York, and Florida are well represented among people working on our tasks, which isn&#8217;t especially surprising given that these are the most populous states. A more interesting question would be how the states stack up relative to their populations:</p>
<p><a href="http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/ranked_states/" rel="attachment wp-att-2930"><img class="aligncenter size-full wp-image-2930" title="ranked_states" src="http://blog.crowdflower.com/wp-content/uploads/2011/07/ranked_states.jpg" alt="" width="810" height="606" /></a></p>
<p>There are some cold weather states, like Vermont and South Dakota, that are punching above their weight class in terms of CrowdFlower judgments submitted, although Maine and North Dakota are lagging behind so that doesn&#8217;t really work. Connecticut and New Jersey, two of the wealthier states in terms of average income, are underperforming relative to their population, but Massachusetts and New York are doing just fine. Mississippi and West Virginia, coming from the opposite end of the spectrum in terms of average income, show up strongly, but Louisiana and Montana don&#8217;t.</p>
<p>Any thoughts about why some states are more represented? Leave &#8216;em in the comments.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/07/from-sea-to-shining-sea/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Designing Incentives for Crowdsourcing Workers</title>
		<link>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/</link>
		<comments>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/#comments</comments>
		<pubDate>Tue, 24 May 2011 19:19:45 +0000</pubDate>
		<dc:creator>Aaron Shaw</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Human Behavior]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[behavior]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[incentives]]></category>
		<category><![CDATA[motivation]]></category>
		<category><![CDATA[social science]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2572</guid>
		<description><![CDATA[In a recent paper, presented at the ACM Conference on Computer Supported Cooperative Work (CSCW), John Horton, Daniel Chen and I used a large-scale experiment to test the effect of different incentive schemes on the quality of crowdsourcing work. The results surprised us. They suggest that workers perform most accurately when the task design credibly [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/" data-text="Designing Incentives for Crowdsourcing Workers" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/"></g:plusone></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/" data-counter="top"></script></div></div><p>In a <a title="Designing Incentives for Inexpert Human Raters, Berkman Center" href="http://cyber.law.harvard.edu/publications/2011/Designing_Incentives_Inexpert_Human_Raters">recent paper</a>, presented at the ACM Conference on Computer Supported Cooperative Work (CSCW), <a title="John Horton, oDesk" href="https://sites.google.com/site/johnjosephhorton/">John Horton</a>, <a title="Daniel Chen, Duke Law School" href="http://www.law.duke.edu/fac/chen">Daniel Chen</a> and <a title="Aaron Shaw, UC Berkeley &amp; Harvard" href="http://aaronshaw.org">I</a> used a large-scale experiment to test the effect of different incentive schemes on the quality of crowdsourcing work.</p>
<p>The results surprised us. They suggest that workers perform most accurately when the task design credibly links payoffs to a worker&#8217;s ability to think about the answers that their peers are likely to provide.</p>
<p style="text-align: center;">
<div id="attachment_2577" class="wp-caption aligncenter" style="width: 549px"><a href="http://www.flickr.com/photos/iyoupapa/"><img class="size-full wp-image-2577 " title="Horserace!" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/3757438159_horserace-iyoupapa-altered.jpg" alt="Horserace!" width="539" height="264" /></a><p class="wp-caption-text">a horserace experiment! (photo cc-by-sa by iyoupapa)</p></div>
<p><span id="more-2572"></span></p>
<p>The idea for this study came out of our sense that, as social scientists, we had something unique to offer the existing research on human computation. <a title="AMT is fast, cheap, and good for machine learning data" href="http://blog.crowdflower.com/2008/09/amt-fast-cheap-good-machine-learning/">Early</a> and <a title="&quot;Get Another Label?&quot; Ipeirotis et al. 2008" href="http://archive.nyu.edu/handle/2451/25882">influential</a> crowdsourcing research has focused on how to filter the judgments of the crowd to find the best answers. We wanted to know whether simple task-design changes could improve the quality of data coming into a crowdsourcing system in the first place.</p>
<p>To test this idea, we chose 14 different incentive schemes and framing techniques developed and validated across the social sciences and set up a horse race experiment to see which schemes/techniques would work best.</p>
<p>Consistent with our personal biases (John and Daniel are both economists, and I&#8217;m a sociologist), some of the schemes were financially oriented, some were social or psychological, and some were hybrids combining social and financial incentives. The details of all the schemes are included <a title="Designing Incentives for Inexpert Human Raters" href="http://cyber.law.harvard.edu/publications/2011/Designing_Incentives_Inexpert_Human_Raters">in the paper</a> (it&#8217;s a long list, and some of them are kind of involved), but it&#8217;s worth giving some examples.</p>
<p>On the financial end of the incentives spectrum, we had one condition we called &#8220;reward-accuracy,&#8221; which was pretty much what you&#8217;d expect: we told workers, &#8220;we&#8217;ll pay you a bonus if you get the answers right.&#8221; We also had one called &#8220;punishment-accuracy,&#8221; the gist of which you can deduce. On the purely social-psychological side, we had one we called &#8220;trust,&#8221; in which we told workers, &#8220;we&#8217;ll pay you for this job no matter how bad your performance, we trust that you&#8217;ll still make your best effort.&#8221;</p>
<p>One of the weirdest schemes turns out to be important, so I need to explain that one. Called &#8220;Bayesian Truth Serum&#8221; (BTS), it incorporates a design from the work of <a title="Drazen Prelec" href="http://econ-www.mit.edu/faculty/dprelec">Drazen Prelec</a>, a behavioral economist at MIT, who realized that research subjects could probably provide useful information regarding the expected distribution for subjective, qualitative questions (<em>nb</em>, the mechanics of how he does this are arcane in a way that is almost sure to delight the geeks among you, so I encourage you to <a title="Bayesian Truth Serum" href="http://econ-www.mit.edu/files/1966">read his paper</a>). Few of the details of <em>real</em> BTS are important, except that we incorporated the piece about asking workers to answer the questions themselves <em>and predict the distribution of other workers&#8217; responses</em>. We also told them we&#8217;d give them a bonus if their predictions were correct.</p>
<p>We then created a task that asked workers to answer five questions. In this case, the questions were drawn from another study examining participatory features of websites, for which we already possessed validated data collected by research assistants.</p>
<p>All workers answered the same five questions about the same website (<a href="http://www.kiva.org">www.kiva.org</a>) while being exposed to one and only one of the 14 incentive schemes (or a control condition of no scheme). Roughly 2,000 individuals participated in the study, resulting in over 100 subjects in each of the experimental conditions. (The statistics and science nerds out there will be pleased to know that both the drop-out rate and demographic covariates were distributed evenly across conditions.)</p>
<p>To measure worker performance, we used the research assistant responses as correct answers to the questions and then calculated the total number of matching answers (out of five) provided by each worker. The results (aggregated across all treatments) are plotted in a histogram below and show that the average worker answered just over two questions out of five correctly.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/aggperf/" rel="attachment wp-att-2578"><img class="aligncenter size-full wp-image-2578" title="Inexpert raters - Aggregate Performance" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/AggPerf.png" alt="Aggregate performance histogram" width="280" height="280" /></a></p>
<p>&nbsp;</p>
<p>Then, in order to see how the treatments compared against each other relative to the control group, we calculated the mean correct response rate for each condition and conducted difference of means tests to see which of these means were significantly greater than the control group. The results of this comparison appear below (in a new plot that doesn&#8217;t even appear in the paper!):</p>
<p><a href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/inexpert-itt/" rel="attachment wp-att-2579"><img class="aligncenter size-full wp-image-2579" title="inexpert raters - ITT estimates" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/inexpert-ITT.png" alt="ITT estimates per treatment" width="500" height="500" /></a></p>
<p>The orange dots show the value of the mean in each condition, and the blue bars illustrate the 95% confidence interval around that mean. The treatments are sorted by the size of the difference in means from the control. (More hard-core nerd stuff: the means are adjusted using Intent-To-Treat estimators).</p>
<p>From these results, we concluded that our horse race had two clear front-runners: the &#8220;Bayesian Truth Serum&#8221; (BTS) and &#8220;Punishment &#8211; disagreement&#8221; conditions, each of which improved average worker performance by almost half of a correct answer above the 2.08 correct answers in the control group. A few of the other financial and hybrid incentives had fairly large point estimates, but were not significantly different from control once we adjusted the test statistics and corresponding p-values to account for the fact that we were making so many comparisons at once (apologies if this doesn&#8217;t make sense — it&#8217;s yet another precautionary measure to avoid upsetting the stats nerds among you). In a tough turn for the sociologists and psychologists, none of the purely social/psychological treatments had any signficant effects at all.</p>
<p>Why do BTS and punishing workers for disagreement succeed in improving performance significantly where so many of the other incentive schemes failed? The answer hinges on the fact that both conditions tied workers&#8217; payoffs to their ability to think about their peers&#8217; likely responses. (We elaborate on the argument in more detail in the paper.)</p>
<p>Does this mean that we should give up on simple financial or social-psychological incentives? Probably not. The fact that we conducted the experiment on MTurk means that the deck may have been stacked against incentives like the &#8220;trust&#8221; condition I described earlier. Because requesters on MTurk have little oversight, workers are more likely to respond to financial incentives than stated promises. In this sense, the marketplace has structured the interaction between workers and requesters in a way that may limit the opportunities to harness motivations that are not linked to money in some explicit way.</p>
<p>You can <a title="Designing Incentives for Inexpert Human Raters" href="http://cyber.law.harvard.edu/sites/cyber.law.harvard.edu/files/Shaw-Horton-Chen_Designing_Incentives_Inexpert_Human_Raters_2011.pdf">download the full paper</a> to read more.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Announcing the TREC 2011 Crowdsourcing Track, co-sponsored by CrowdFlower</title>
		<link>http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/</link>
		<comments>http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/#comments</comments>
		<pubDate>Tue, 26 Apr 2011 19:46:39 +0000</pubDate>
		<dc:creator>Vaughn Hester</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2411</guid>
		<description><![CDATA[We&#8217;re proud to announce CrowdFlower&#8217;s participation in one of the most prestigious academic conferences worldwide. The 20th Annual Text REtrieval Conference (TREC 2011), organized by the National Institute of Standards (NIST), will take place Nov 15-18, 2011, at NIST&#8217;s campus in Gaithersburg, MD. TREC 2011 will feature a new Crowdsourcing Track that will investigate: How [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/" data-text="Announcing the TREC 2011 Crowdsourcing Track, co-sponsored by CrowdFlower" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/"></g:plusone></div></div><p>We&#8217;re proud to announce CrowdFlower&#8217;s participation in one of the most prestigious academic conferences worldwide.</p>
<p>The 20th Annual <a href="http://trec.nist.gov/">Text REtrieval Conference (TREC 2011)</a>, organized by the National Institute of Standards (NIST), will take place Nov 15-18, 2011, at NIST&#8217;s campus in Gaithersburg, MD. TREC 2011 will feature a new <a href="https://sites.google.com/site/treccrowd2011/">Crowdsourcing Track</a> that will investigate:</p>
<ul>
<li>How to obtain high-quality relevance judgments from individual crowd workers;</li>
<li>How to effectively compute consensus judgments from individual judgments;</li>
<li>Interaction between these (i.e., worker accuracy vs. subsequent consensus accuracy achieved).</li>
</ul>
<p><span id="more-2411"></span></p>
<p>Teams can choose to participate in either or both tasks: (1) collecting judgments and (2) computing consensus.  We plan to evaluate using both ranking and classification metrics, giving teams the flexibility to focus on either one or both.</p>
<p>We welcome participation from those who have never before participated in TREC; note there is an agreement form to submit in advance; the <a href="http://ir.nist.gov/trecsubmit.open/application.html">form</a> is available from the track website.</p>
<p>View the guidelines <a href="https://sites.google.com/site/treccrowd2011/home/TREC2011CrowdsourcingTrack.pdf?attredirects=0&#038;d=1">here</a>.</p>
<p><strong>Tentative Schedule</strong></p>
<p><strong>May 6, 2011:</strong> Initial deadline for participating teams to email organizers with team name and<br />
intent to participate (will affect the partition of data distributed to all participants)</p>
<p><strong>May 13:</strong> Finalized guidelines released, Stage 1 training sets distributed to participants</p>
<p><strong>May 20:</strong> Stage 2 training set distributed to participants</p>
<p><strong>August 1:</strong> Release of Stage 1 test data; submission system opens immediately</p>
<p><strong>August 8:</strong> Stage 1 submissions due</p>
<p><strong>August 15:</strong> Stage 2 test topics distributed</p>
<p><strong>August 22:</strong> Stage 2 submissions due</p>
<p><strong>~September 15:</strong> Preliminary results announced to participants</p>
<p><strong>~October 15:</strong> Team papers due to NIST for inclusion in TREC conference<br />
notebook</p>
<p><strong>November 15-18:</strong> Text REtrieval Conference (TREC 2011) in Gaithersburg, MD</p>
<p><strong>~February 1, 2012:</strong> Final team papers due to NIST for inclusion in TREC conference proceedings</p>
<p>For more information, please contact Track Organizers <a href="http://www.gabriella-kazai.com/">Gabriella Kazai</a>  (Microsoft Research) or <a href="http://www.ischool.utexas.edu/~ml/">Matthew Lease</a> (University of Texas at Austin).</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing Meetup at Stanford Institute of Design</title>
		<link>http://blog.crowdflower.com/2011/04/crowdsourcing-meetup-at-stanford-institute-of-design/</link>
		<comments>http://blog.crowdflower.com/2011/04/crowdsourcing-meetup-at-stanford-institute-of-design/#comments</comments>
		<pubDate>Fri, 22 Apr 2011 22:49:02 +0000</pubDate>
		<dc:creator>Mollie Allick</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2396</guid>
		<description><![CDATA[Join us at the Stanford Institute of Design on Tuesday, May 3, for a night of networking, interactive presentations, and thought-provoking conversations. The following speakers will discuss developments in modern crowdsourcing work, how this impacts the world of design, and demonstrate their latest findings: Speakers: Ed Chi, Research Scientist, Google Gagan Palrecha, CEO and Co-Founder, [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/04/crowdsourcing-meetup-at-stanford-institute-of-design/" data-text="Crowdsourcing Meetup at Stanford Institute of Design" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/04/crowdsourcing-meetup-at-stanford-institute-of-design/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/04/crowdsourcing-meetup-at-stanford-institute-of-design/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/04/crowdsourcing-meetup-at-stanford-institute-of-design/"></g:plusone></div></div><div id="attachment_1357" class="wp-caption alignnone" style="width: 522px"><a rel="attachment wp-att-1357" href="http://blog.crowdflower.com/2010/09/crowdsourcing-work-meetup-south-bay-edition/crowd/"><img class="size-full wp-image-1357" title="crowd" src="http://blog.crowdflower.com/wp-content/uploads/2010/09/crowd.jpg" alt="a crowd, courtesy of PictFactory" width="512" height="341.5" /></a><p class="wp-caption-text">Photo of crowd, courtesy of PictFactory</p></div>
<p>Join us at the Stanford Institute of Design on Tuesday, May 3, for a night of networking, interactive presentations, and thought-provoking conversations. </p>
<p>The following speakers will discuss developments in modern crowdsourcing work, how this impacts the world of design, and demonstrate their latest findings:</p>
<p><span id="more-2396"></span><br />
Speakers:</p>
<p><a href="https://profiles.google.com/ed.h.chi/about">Ed Chi</a>, Research Scientist, Google</p>
<p><a href="http://www.linkedin.com/in/gaganpalrecha?goback=.cps_1247149767187_1">Gagan Palrecha</a>, CEO and Co-Founder, <a href="http://www.chirply.com/">Chirply</a></p>
<p><a href="http://www.stanford.edu/~spdow/">Steven Dow</a>, Postdoctoral Scholar, Stanford</p>
<p><a href="http://www.adobe.com/technology/people/sanfrancisco/dontcheva.html">Mira Dontcheva</a>, Research Scientist, Adobe</p>
<p><a href="http://bjoern.org/">Bjoern Hartmann</a>, Professor, UC Berkeley</p>
<p><a href="http://mattmireles.com/">Matt Mireles</a>, CEO, <a href="http://speakertext.com/">SpeakerText</a> </p>
<p><strong>Where:</strong><br />
Stanford <a href="http://dschool.stanford.edu/">d.school</a><br />
Building 550<br />
416, Escaondido Mall<br />
Stanford, CA</p>
<p>There are no valet restrictions after 4pm, so parking will be readily available. The nearest parking structure is at the corner of <a href="http://maps.google.com/maps?client=safari&#038;q=via+ortega+and+panama+street&#038;oe=UTF-8&#038;ie=UTF8&#038;hq=&#038;hnear=Panama+St+%26+V%C3%ADa+Ortega,+Stanford,+Santa+Clara,+California+94305&#038;gl=us&#038;z=16">Via Ortega and Panama Street</a>.</p>
<p>When:<br />
May from 6:00 &mdash; 8:30 p.m. Speakers start at 6:30 p.m.</p>
<p>Please <a href="http://www.meetup.com/Distributed-Work/events/17391066/">RSVP</a> to guarantee your space. </p>
<p>Interested in hosting the next meetup? Have suggestions for speakers? Please <a href="mailto:mollie@crowdflower.com">email me</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/04/crowdsourcing-meetup-at-stanford-institute-of-design/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CrowdFlower and Citizen Effect: So Much Madness</title>
		<link>http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/</link>
		<comments>http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/#comments</comments>
		<pubDate>Fri, 08 Apr 2011 17:47:02 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Charity]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Office Life]]></category>
		<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2333</guid>
		<description><![CDATA[After hearing that our friends at Citizen Effect were sponsoring Brackets with Benefits, a cool new March Madness pool where the proceeds from all the gambling go to one of the many awesome community development projects they support, we thought it would be fun to get involved. Considering how most people** watched their brackets go [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/" data-text="CrowdFlower and Citizen Effect: So Much Madness" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/"></g:plusone></div></div><p>After hearing that our friends at <a href="http://www.citizeneffect.org" target="_blank">Citizen Effect</a> were sponsoring Brackets with Benefits, a cool new March Madness pool where the proceeds from all the gambling go to one of the many awesome community development projects they support, we thought it would be fun to get involved. Considering how most people** watched their brackets go up in flames by the time the Sweet Sixteen rolled around, we thought it would be fun to give everyone a second chance to play.</p>
<p>We teamed up with Citizen Effect to invite all Brackets with Benefits participants, as well as the general public, to pick the results of the Sweet Sixteen and Final Four, a total of 10 games. We had one clear winner, who correctly predicted the outcome of 8 games (and missed Duke/Arizona and Florida/BYU, for those keeping track at home) and will be going home with a fancy new camcorder. For what it&#8217;s worth, the aggregated response of the crowd correctly picked 7 games.</p>
<p><a rel="attachment wp-att-2334" href="http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/bwb_picks/"><img class="aligncenter size-full wp-image-2334" title="bwb_picks" src="http://blog.crowdflower.com/wp-content/uploads/2011/04/bwb_picks.jpg" alt="" width="530" height="387" /></a></p>
<p>In a tournament full of upsets, there were two games that surprised our participants most. Not surprisingly, these were the games where the two #1 seeds (Duke and Ohio State) lost in the Sweet Sixteen, which just under 10 percent of people picked. There was one person from Glenolden, PA (perhaps an embittered Villanova fan?) who picked both Duke and Ohio State to lose, but ended up picking only three other games correctly.</p>
<p>All in all, a fun little experiment. We want to send a big thanks to our friends over at Citizen Effect for putting together the pool and inviting us to participate.</p>
<hr />
**This doesn&#8217;t include our VP of Engineering, a proud Butler alum who took his alma mater all the way to the championship. Sorry, Brian, we all wanted your boys to win, too.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>March Meetup Video</title>
		<link>http://blog.crowdflower.com/2011/03/march-meetup-video/</link>
		<comments>http://blog.crowdflower.com/2011/03/march-meetup-video/#comments</comments>
		<pubDate>Thu, 31 Mar 2011 01:44:09 +0000</pubDate>
		<dc:creator>Mollie Allick</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Meetup]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Office Life]]></category>
		<category><![CDATA[Translation]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2316</guid>
		<description><![CDATA[Couldn&#8217;t make the March Meetup at CrowdFlower? Check out the video highlights below. If you like what you see above, join our Crowdsourcing Work Meetup Group for news on upcoming Meetups.]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/03/march-meetup-video/" data-text="March Meetup Video" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/03/march-meetup-video/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/03/march-meetup-video/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/03/march-meetup-video/"></g:plusone></div></div><p>Couldn&#8217;t make the March Meetup at CrowdFlower? Check out the video highlights below.</p>
<p><iframe src="http://player.vimeo.com/video/21709430" width="400" height="300" frameborder="200"></iframe></p>
<p>If you like what you see above, join our <a href="http://www.meetup.com/Distributed-Work/">Crowdsourcing Work Meetup Group</a> for news on upcoming Meetups.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/03/march-meetup-video/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing Meetup &#8212; March edition</title>
		<link>http://blog.crowdflower.com/2011/03/crowdsourcing-meetup-march-edition/</link>
		<comments>http://blog.crowdflower.com/2011/03/crowdsourcing-meetup-march-edition/#comments</comments>
		<pubDate>Tue, 22 Mar 2011 16:32:02 +0000</pubDate>
		<dc:creator>Mollie Allick</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Meetup]]></category>
		<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2296</guid>
		<description><![CDATA[It&#8217;s meetup time again here at CrowdFlower. We will have a great lineup of speakers here this Wednesday, March 23. Come by to see: Wyatt Nordstrom, CEO at&#160;Maven Research Jacob Colker, CEO and Co-Founder at&#160;Sparked.com Ray Solnik, Customer Development In Action Nathan Waterhouse, Co-Founder at&#160;OpenIDEO Omar Alonso, Technical Lead at&#160;Bing, Microsoft Brian McConnell, Co-Founder at&#160;Der [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/03/crowdsourcing-meetup-march-edition/" data-text="Crowdsourcing Meetup &#8212; March edition" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/03/crowdsourcing-meetup-march-edition/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/03/crowdsourcing-meetup-march-edition/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/03/crowdsourcing-meetup-march-edition/"></g:plusone></div></div><div id="attachment_1357" class="wp-caption alignnone" style="width: 522px"><a rel="attachment wp-att-1357" href="http://blog.crowdflower.com/2010/09/crowdsourcing-work-meetup-south-bay-edition/crowd/"><img class="size-full wp-image-1357" title="crowd" src="http://blog.crowdflower.com/wp-content/uploads/2010/09/crowd.jpg" alt="a crowd, courtesy of PictFactory" width="512" height="341.5" /></a><p class="wp-caption-text">Photo of crowd, courtesy of PictFactory</p></div>
<p>It&#8217;s meetup time again here at CrowdFlower.  We will have a great lineup of speakers here this Wednesday, March 23. Come by to see:</p>
<p><span id="more-2296"></span></p>
<p>  <a href="http://www.linkedin.com/in/wyattnordstrom">Wyatt Nordstrom</a>, CEO at&nbsp;<a href="http://www.mavenresearch.com/">Maven Research</a></p>
<p>  <a href="http://www.linkedin.com/in/jacobcolker">Jacob Colker</a>, CEO and Co-Founder at&nbsp;<a href="http://www.sparked.com/">Sparked.com</a></p>
<p>  <a href="http://www.linkedin.com/in/raysolnik">Ray Solnik</a>, Customer Development In Action</p>
<p>  <a href="http://www.linkedin.com/in/nathanwaterhouse">Nathan Waterhouse</a>, Co-Founder at&nbsp;<a href="http://openideo.com/">OpenIDEO</a></p>
<p>  <a href="http://www.linkedin.com/profile/view?id=569789&amp;authType=name&amp;authToken=gGID&amp;pvs=pp&amp;trk=ppro_viewmore">Omar Alons</a><a href="http://wwwcsif.cs.ucdavis.edu/~alonsoom/">o</a>, Technical Lead at&nbsp;<a href="http://www.bing.com/">Bing</a>, Microsoft</p>
<p>  <a href="http://www.linkedin.com/in/briansmcconnell">Brian McConnell</a>, Co-Founder at&nbsp;<a href="http://www.dermundo.com/">Der Mundo</a></p>
<p>Where:<br />
<a href="http://crowdflower.com/"> CrowdFlower</a><br />
2111 Mission Street, Suite 302<br />
San Francisco, CA 94110</p>
<p>The office is one block South of the 16th Street Mission BART Station.<br />
For a train schedule, please see: <a href="http://www.bart.gov/schedules/bystation.aspx  ">http://www.bart.gov/schedules/bystation.aspx</a></p>
<p>When:<br />
March 23 from 5:30–8:30 p.m.<br />
Speakers start at 6:30 p.m.</p>
<p>Please <a href="http://www.meetup.com/Distributed-Work/events/16878378/">RSVP</a> to guarantee your space.</p>
<p>Hope you can make it,</p>
<p>Mollie Allick<br />
CrowdFlower</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/03/crowdsourcing-meetup-march-edition/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Oscar Fever: The Sequel!</title>
		<link>http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/</link>
		<comments>http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/#comments</comments>
		<pubDate>Fri, 04 Mar 2011 22:51:03 +0000</pubDate>
		<dc:creator>Patrick Philips and Joseph Childress</dc:creator>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Wisdom of Small Crowds]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2190</guid>
		<description><![CDATA[The votes are in from our Oscar crowdsourcing experiment, and the crowd successfully picked the winners of 14 of the academy awards. For reference, Roger Ebert got 15 predictions correct so we&#8217;d have to conclude that the crowd performed reasonably well at predicting the winners of this glorified popularity contest. One fascinating thing about aggregating responses [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/" data-text="Oscar Fever: The Sequel!" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/"></g:plusone></div></div><p>The votes are in from <a href="http://blog.crowdflower.com/2011/02/oscar-fever/" target="_blank">our Oscar crowdsourcing experiment</a>, and the crowd successfully picked the winners of 14 of the academy awards. For reference, <a href="http://rogerebert.suntimes.com/apps/pbcs.dll/article?AID=/20110210/OSCARS/110219999" target="_blank">Roger Ebert got 15 predictions correct</a> so we&#8217;d have to conclude that the crowd performed reasonably well at predicting the winners of this glorified popularity contest.</p>
<div id="attachment_2191" class="wp-caption aligncenter" style="width: 808px"><a rel="attachment wp-att-2191" href="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/actual_results/"><img class="size-full wp-image-2191" title="actual_results" src="http://blog.crowdflower.com/wp-content/uploads/2011/03/actual_results.jpg" alt="movie picks" width="798" height="414" /></a><p class="wp-caption-text">Predicted and Actual Winners of the 2011 Academy Awards</p></div>
<p><span id="more-2190"></span><br />
One fascinating thing about aggregating responses is that the crowd as a whole will often outperform the average worker. In this case, among the 500 people we polled, the majority of respondents picked fewer than 10 awards correctly (mean of 9.6 and median of 9). And yet, by aggregating all the responses, such that the nominee with the most &#8220;votes&#8221; is predicted to win, the crowd as a whole correctly picked 14 awards. While the &#8220;wisdom of crowds&#8221; doesn&#8217;t come as much of a surprise, it&#8217;s always reassuring to see it confirmed in new applications.</p>
<p><a rel="attachment wp-att-2199" href="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/correct_histogram1/"><img class="aligncenter size-full wp-image-2199" title="correct_histogram1" src="http://blog.crowdflower.com/wp-content/uploads/2011/03/correct_histogram1.jpg" alt="" width="614" height="445" /></a></p>
<p>As we noted in<a href="http://blog.crowdflower.com/2011/02/oscar-fever/"> our earlier post</a>, though, the more interesting question was whether workers who indicated higher confidence in their responses would outperform workers with lower confidence. Looking at the results, however, we saw no significant correlation between a worker&#8217;s predicted accuracy and  actual performance.</p>
<div id="attachment_2222" class="wp-caption aligncenter" style="width: 823px"><a rel="attachment wp-att-2222" href="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/scatts/"><img class="size-full wp-image-2222" title="scatts" src="http://blog.crowdflower.com/wp-content/uploads/2011/03/scatts.jpg" alt="" width="813" height="547" /></a><p class="wp-caption-text">&quot;Squint all you want, but there&#39;s no pattern&quot;</p></div>
<p>While it&#8217;s certainly possible that we didn&#8217;t offer enough of an incentive for workers to estimate their own accuracy, the more likely explanation is that predicting the winners of the Oscars is not something that a person can do with any degree of certainty. Confident or not, the people we polled did not see the &#8220;Inside Job&#8221; coming.</p>
<p>As a final exercise, we ran a regression on every explanatory variable we could find, including what state workers came from, what day they made their predictions, whether they made their predictions during the day or at night, how long they spent making their predictions and even their historical accuracy on other CrowdFlower tasks. The only variable with any significance turned out to be how long they spent on making their predictions, and while it was significant (at p=0.001), no model we could come up with explained more than 5 percent of the total variation in accuracy.</p>
<p>While the wisdom of crowds seems to extend to picking Oscar winners, the more interesting experiment of having workers self-select as trustworthy is ongoing. In the future, it would be worthwhile to repeat this experiment with questions that can be answered objectively and without uncertainty (solving algebra problems seems like a good candidate), to see if any correlation emerges between predicted and actual accuracy.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

