<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The CrowdFlower Blog &#187; Experiments</title>
	<atom:link href="http://blog.crowdflower.com/topics/experiments/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.crowdflower.com</link>
	<description></description>
	<lastBuildDate>Tue, 10 Jan 2012 20:00:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Did you say &#8220;Great!&#8221;, or &#8220;Oh Great!&#8221;?</title>
		<link>http://blog.crowdflower.com/2011/11/crowdsourcing-sentiment-analysis-herman-cain/</link>
		<comments>http://blog.crowdflower.com/2011/11/crowdsourcing-sentiment-analysis-herman-cain/#comments</comments>
		<pubDate>Mon, 14 Nov 2011 19:24:39 +0000</pubDate>
		<dc:creator>Jodie Ellis</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[automated sentiment analysis]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsorcing]]></category>
		<category><![CDATA[crowdsource]]></category>
		<category><![CDATA[crowdsourced]]></category>
		<category><![CDATA[herman cain]]></category>
		<category><![CDATA[sentiment analysis]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4240</guid>
		<description><![CDATA[Being tapped to write a blog post here at CrowdFlower is usually left to the experts. So with that, let me begin by making the disclaimer that I am neither a political analyst nor a data scientist. But I do have a personal fervor for politics and access to some impressive tools, thanks to my job [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/11/crowdsourcing-sentiment-analysis-herman-cain/" data-text="Did you say &#8220;Great!&#8221;, or &#8220;Oh Great!&#8221;?" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/11/crowdsourcing-sentiment-analysis-herman-cain/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/11/crowdsourcing-sentiment-analysis-herman-cain/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/11/crowdsourcing-sentiment-analysis-herman-cain/"></g:plusone></div></div><div class="wp-caption alignright" style="width: 281px"><a href="http://www.empowernewsmag.com/userfiles/cain.jpg"><img class=" " title="Cain" src="http://www.empowernewsmag.com/userfiles/cain.jpg" alt="" width="271" height="361" /></a><p class="wp-caption-text">Image by Sarah Butrymowicz</p></div>
<p>Being tapped to write a blog post here at CrowdFlower is usually left to the experts. So with that, let me begin by making the disclaimer that I am neither a political analyst nor a data scientist. But I do have a personal fervor for politics and access to some impressive tools, thanks to my job here at <a title="the leader in enterprise crowdsourcing" href="http://crowdflower.com/" target="_blank">CrowdFlower</a>.</p>
<p>For those who aren&#8217;t familiar with CrowdFlower, we specialize in tapping human contributors worldwide to do massive amounts of simple, repetitive tasks (especially tasks that are hard for computers to do by themselves). Here&#8217;s a <a title="How It Works!" href="http://vimeo.com/26878855">quick how-it-works animation</a>.</p>
<p>I had been reading some old blog posts on the CrowdFlower blog when I came across an interesting <a title="crowdsourcing media bias" href="http://blog.crowdflower.com/2008/03/crowdsourcing-to-find-media-bias-hillary-vs-obama/" target="_blank">2008 post on election media bias</a>.</p>
<p>I determined that this could be a great opportunity to revisit sentiment analysis, and specifically set out to see if automated sentiment detection tools vs. human assessments could yield any blog-worthy findings.</p>
<p>To see how far the automated sentiment tools have come, I began by using an enterprise-grade social media monitoring tool that provides sentiment analysis.</p>
<p>I ran a few quick monitoring searches of my own to see how the current Republican Primary election was tracking — it seemed a topical place that would be chock full of good commentary.</p>
<p>The instant access to well-organized data from blogs, news sources, and a variety of social media sources was outstanding.</p>
<p>However, I was surprised to find that for each search I conducted, <strong>the automated sentiment detection tool consistently returned an overwhelming proportion of &#8220;Neutral&#8221; ratings (frequently exceeding 90%)</strong>. This seemed funny to me, given the typically emotive nature of politics.<span id="more-4240"></span></p>
<p><strong>It&#8217;s important to note that this particular tool uses a default value of &#8220;Neutral&#8221; for any post it cannot interpret.</strong></p>
<p>A particularly interesting subset of the data was several thousand tweets about Herman Cain immediately following the news of alleged sexual harassment by Cain during his time as leader of the National Restaurant Association. Surely this would yield some sentiment-rich commentary that even machines couldn&#8217;t resist tagging.</p>
<p>For the posts about Herman Cain on Oct 31st, here is what the machine detected on just under 3,000 posts:</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/wp-content/uploads/2011/11/cainauto.png"><img class="aligncenter size-full wp-image-4629" title="cainauto" src="http://blog.crowdflower.com/wp-content/uploads/2011/11/cainauto.png" alt="" width="513" height="306" /></a></p>
<p>Naturally, I took to the CrowdFlower platform and decided I would run the same data through a simple sentiment analysis workflow.  With the help of our team of crowdsourcing gurus, I utilized some simple, but effective best practices to control for quality (you can get a good overview <a title="crowdsourcing quality control" href="http://blog.crowdflower.com/2011/10/stopworrying/" target="_blank">here</a>). Here is what the CrowdFlower contributors detected:</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/wp-content/uploads/2011/11/caincf.png"><img class="aligncenter size-full wp-image-4630" title="caincf" src="http://blog.crowdflower.com/wp-content/uploads/2011/11/caincf.png" alt="" width="527" height="324" /></a></p>
<p>Here are just a couple of posts marked &#8220;Neutral&#8221; by the machine and &#8220;Negative&#8221; and &#8220;Positive&#8221;, respectively, by CrowdFlower contributors:</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/wp-content/uploads/2011/11/caintweet1.png"><img class="aligncenter size-full wp-image-4631" title="caintweet1" src="http://blog.crowdflower.com/wp-content/uploads/2011/11/caintweet1.png" alt="" width="505" height="204" /></a><a href="http://blog.crowdflower.com/wp-content/uploads/2011/11/caintweet2.png"><img class="aligncenter size-full wp-image-4632" title="caintweet2" src="http://blog.crowdflower.com/wp-content/uploads/2011/11/caintweet2.png" alt="" width="478" height="142" /></a></p>
<h2>Takeaways</h2>
<p>A spot check of the results on the automated set confirmed that when the machine actually tagged a post as positive or negative, it was usually very accurate (good precision).</p>
<p>However, <strong>the large amount of data that the machine was unable to make a determination on suggests that the pervasive problem of &#8216;recall&#8217; is still the big challenge with automated sentiment detection.</strong></p>
<p><strong></strong>This graph illustrates the recall difference a bit more clearly. The need for human analysis when dealing with the subtleties of language could not be more apparent.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/wp-content/uploads/2011/11/caintweets.png"><img class="aligncenter size-full wp-image-4633" title="caintweets" src="http://blog.crowdflower.com/wp-content/uploads/2011/11/caintweets.png" alt="" width="650" height="254" /></a></p>
<p><strong>Automated Tool</strong>: Good precision. Poor recall.</p>
<p><strong>CrowdFlower Tool</strong>: Good precision. Good recall.</p>
<h2>Sentiment Analysis is Insightful AND Entertaining</h2>
<p>In addition to the Herman Cain Twitter data, I looked at headlines, blogs, and a broad swath of social media commentary on all the candidates. The conclusion I can draw from my effort is that sentiment detection, is indeed, still a very challenging problem to solve through automation.</p>
<p>This is consistent with what I see here at CrowdFlower daily — in today&#8217;s data-wealthy world, there are countless tasks that require human attention (good to know if my blogging career never gets off the ground).</p>
<p>Hopefully I&#8217;ll get the chance to continue exploring the sentiment about topical news as it breaks, and will look forward to sharing future findings.</p>
<p>Have experience monitoring sentiment? Let us know if this is consistent with what you&#8217;ve seen. Leave a comment.</p>
<p style="text-align: center;">***</p>
<p>To find out more about how CrowdFlower technology is used for sentiment analysis and a wide range of other human powered projects, visit the <a title="enterprise crowdsourcing products" href="http://crowdflower.com/products" target="_blank">CrowdFlower products page</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/11/crowdsourcing-sentiment-analysis-herman-cain/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing and Retention: From First-Timers to Seasoned Veterans</title>
		<link>http://blog.crowdflower.com/2011/10/seasonedveterans/</link>
		<comments>http://blog.crowdflower.com/2011/10/seasonedveterans/#comments</comments>
		<pubDate>Tue, 25 Oct 2011 20:10:31 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[engagement]]></category>
		<category><![CDATA[gold]]></category>
		<category><![CDATA[insights]]></category>
		<category><![CDATA[judgment]]></category>
		<category><![CDATA[loyal]]></category>
		<category><![CDATA[retention]]></category>
		<category><![CDATA[veterans]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2589</guid>
		<description><![CDATA[Millions of people have participated in our tasks over the last few years, and tens of thousands of people are active at any given moment. However, crowdsourcing is not a traditional engagement model. Tasks are elective, which means people are free to come and go as they please. It&#8217;s a fair question, then, to ask whether they [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/10/seasonedveterans/" data-text="Crowdsourcing and Retention: From First-Timers to Seasoned Veterans" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/10/seasonedveterans/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/10/seasonedveterans/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/10/seasonedveterans/"></g:plusone></div></div><p>Millions of people have participated in our tasks over the last few years, and tens of thousands of people are active at any given moment. However, crowdsourcing is not a traditional engagement model. Tasks are elective, which means people are free to come and go as they please. It&#8217;s a fair question, then, to ask whether they keep coming back.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/10/seasonedveterans/comebacksoon/" rel="attachment wp-att-3929"><img class="aligncenter size-full wp-image-3929" title="Come Back Soon" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/comebacksoon.jpg" alt="crowdsourcing " width="486" height="334" /></a></p>
<p><em>Do people perform tasks only fleetingly, or has crowdsourcing become more of a long-term engagement? </em><em>Furthermore, just how important is contributor retention in the world of crowdsourcing?</em></p>
<p>While a majority of people fall into the &#8220;one-and-done&#8221; camp, many of the most productive contributors tend to have participated in previous jobs. Within any single job, these seasoned veteran contributors also provide far more work than their less experienced counterparts.</p>
<p><span id="more-2589"></span></p>
<p>Over a period of two months, we ran a series of five very similar jobs, retrieving ratings information for businesses throughout North America. In total, we collected over half a million judgments from a total of 2,901 unique contributors,<sup><a href="#footnote-1">1</a></sup> representing multiple labor channels and 101 countries. As a first test to whether seasoned veterans are common, we looked at how many people participated in more than one job. In total, 2,389 people participated in one job only, meaning that First-Timers accounted for over 82 percent of all contributors.</p>
<p>But this may not be the best number to look at. We&#8217;re really interested in whether certain people recognize and seek out specific types of tasks after having worked on them before. To look for this behavior, we analyzed the most recent job, counting how many people participated in at least one prior iteration of the task.</p>
<p>For a recent job, a total of 906 people participated, 247 of whom had done the task previously. By this measure, Seasoned Vets constitute approximately 27% of the workforce. While the impact of returning contributors is greater under this methodology, the fact remains that these &#8220;loyal&#8221; contributors are firmly in the minority on this series of jobs.</p>
<div id="attachment_3869" class="wp-caption aligncenter" style="width: 631px"><a href="http://blog.crowdflower.com/2011/10/seasonedveterans/contributor_count/" rel="attachment wp-att-3869"><img class="size-full wp-image-3869 " title="contributor_count" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/contributor_count.jpg" alt="crowdsourcing" width="621" height="395" /></a><p class="wp-caption-text">Count of Contributors by Profile</p></div>
<p>&nbsp;</p>
<p>However, <a title="crowdsourcing" href="http://blog.crowdflower.com/2010/12/good-work-knows-no-boundaries/" target="_blank">as we&#8217;ve seen before</a>, the individual impact of contributors varies widely, with a minority of people often providing the vast majority of work. With this in mind, we looked at the contributions of First-Timers and Seasoned Vets and found some striking differences.</p>
<div id="attachment_3870" class="wp-caption aligncenter" style="width: 598px"><a href="http://blog.crowdflower.com/2011/10/seasonedveterans/contributor_share/" rel="attachment wp-att-3870"><img class="size-full wp-image-3870 " title="contributor_share" src="http://blog.crowdflower.com/wp-content/uploads/2011/10/contributor_share.jpg" alt="crowdsourcing" width="588" height="396" /></a><p class="wp-caption-text">Share of Contributions by Profile</p></div>
<p>&nbsp;</p>
<p><strong>Seasoned Vets, while constituting only 27% of the workforce, provided 47% of the total work completed.</strong> <strong>On average, each Seasoned Veteran provided 2.5 times more judgments than their less experienced counterparts.</strong> It&#8217;s also interesting to note that there was no significant difference between the quality of work provided by First-Timers and Seasoned Vets, no doubt due to our suite of <a title="Enterprise Crowdsourcing or: How I learned to stop worrying and trust the crowd" href="http://blog.crowdflower.com/2011/10/stopworrying/">quality control measures</a>.</p>
<p>Given that the people who stick around tend to be far more productive, improving retention is a useful consideration (for these jobs, at least). We&#8217;re now interested in how best to attract people to return to the types of jobs they&#8217;ve already seen. That, however, is a work in progress and a story for another day.</p>
<hr />
<p id="footnote-1" style="text-align: -webkit-auto;">1. Note that this analysis only considers Trusted workers.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/10/seasonedveterans/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing a Map, to Eat</title>
		<link>http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/</link>
		<comments>http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/#comments</comments>
		<pubDate>Fri, 16 Sep 2011 17:15:26 +0000</pubDate>
		<dc:creator>Aron Hegyi</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[API]]></category>
		<category><![CDATA[Balsamic]]></category>
		<category><![CDATA[book]]></category>
		<category><![CDATA[Campania]]></category>
		<category><![CDATA[city]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[directions]]></category>
		<category><![CDATA[food]]></category>
		<category><![CDATA[geocoding]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Geocoding]]></category>
		<category><![CDATA[Google Refine]]></category>
		<category><![CDATA[guidebook]]></category>
		<category><![CDATA[Italy]]></category>
		<category><![CDATA[Italy Pummel]]></category>
		<category><![CDATA[KML]]></category>
		<category><![CDATA[La Vecchia Dispensa]]></category>
		<category><![CDATA[map]]></category>
		<category><![CDATA[PDF]]></category>
		<category><![CDATA[region]]></category>
		<category><![CDATA[restaurant]]></category>
		<category><![CDATA[self-service]]></category>
		<category><![CDATA[slow food]]></category>
		<category><![CDATA[solutions]]></category>
		<category><![CDATA[Tuscany]]></category>
		<category><![CDATA[Zingerman’s]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=3637</guid>
		<description><![CDATA[Last May, I took a trip to Italy for two weeks. A little bit of history: my friend Jessica and I are both Italophiles, and when her mom sent us a link to a video contest where the prize was a round trip flight to Italy, we knew we had to enter. After a week [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/" data-text="Crowdsourcing a Map, to Eat" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/"></g:plusone></div></div><div class="wp-caption alignright" style="width: 343px"><a href="http://www.flickr.com/photos/aronh/5800985627/in/set-72157626896293314/" target="_blank"><img class="  " src="http://farm4.static.flickr.com/3098/5800985627_40ed3a7cfb.jpg" alt="A parmigiano reggiano aging room" width="333" height="500" /></a><p class="wp-caption-text">Expanse of Cheese: Caseificio di San Silvestro&#39;s Aging Room in Castelvetro (MO)</p></div>
<p>Last May, <span class="s1"><a title="I took a trip to Italy" href="http://www.youtube.com/watch?v=n66L-HplnKI" target="_blank">I took a trip to Italy</a></span> for two weeks. A little bit of history: my friend Jessica and I are both Italophiles, and when her mom sent us a link to <a title="A Video Contest" href="http://www.zingermans.com/BalsamicVideos.aspx" target="_blank">a video contest</a> where the prize was a round trip flight to Italy, we knew we had to enter. After a week of writing and editing lyrics in a Google Doc — half in Italian, half in English — <span class="s1"><a title="the resulting music video" href="http://www.youtube.com/watch?v=ot-ixjYolKM" target="_blank">the resulting music video</a> </span>ended up winning us a trip to the holy land of olive oil, vino, and other delectable edibles.</p>
<p class="p1">Apart from being a passionate eater, I&#8217;m a passionate supporter of the <span class="s1"><a title="Slow Food" href="http://slowfoodusa.org/" target="_blank">Slow Food</a></span> movement, an organization which promotes good, clean, and fair food around the world. Each year, they publish a guidebook to restaurants in Italy that adhere to their principles. In Italy, this usually means each restaurant is handpicked to showcase the traditional food of a particular region; each restaurant supports artisanal methods and products that otherwise might go extinct (were eaters not eating them), and where the food is most likely naturally organic and local anyway.</p>
<p class="p1"><span id="more-3637"></span>But, a problem: it&#8217;s 2011, and I&#8217;m more apt to travel through interactive maps than with old-fashioned guidebooks. More importantly, I don&#8217;t plan, and I needed to know what edible options were around me at any moment in time on my trip. What I really needed was a version of the guidebook, in map form, that I could use on my mobile. No such option existed, of course, and I was left with two options: magically visualize restaurants around me by poring through the guidebook, or, crowdsource it.</p>
<p class="p1">What I needed to do (and, what much of our work at CrowdFlower comprises), was structure unstructured data. To create the map from the book, I went through the following steps:</p>
<ul class="ul1">
<li class="li3"><span class="s2">Obtained <span class="s3"><a title="a PDF version of the book" href="http://ultimabooks.simplicissimus.it/catalog/product/view/id/1874/s/osterie-d-italia-2011/" target="_blank">a PDF version of the book</a></span></span></li>
<li class="li1">Split the book into pages, and uploaded each page as a separate PDF (thanks to <span class="s1"><a title="pdftk" href="http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/" target="_blank">pdftk</a></span>)</li>
<li class="li1">Created a CSV (comma separated values file) with each page&#8217;s PDF link and page number</li>
<li class="li1">Created a crowdsourcing task to structure the data, using the previously uploaded individual PDF pages</li>
<li class="li1">Geocoded the structured data</li>
<li class="li1">Output the geocoded data in KML (<span class="s1"><a title="Keyhole Markup Language" href="http://en.wikipedia.org/wiki/Keyhole_Markup_Language" target="_blank">Keyhole Markup Language</a></span>) form</li>
<li class="li1">Uploaded the KML file to a mapping site (e.g. Google Maps)</li>
</ul>
<p class="p1">Conveniently, each page that outlined a restaurant in the PDF was formatted nearly the same. This made it easy to give instructions to workers, as seen below:</p>
<div class="wp-caption aligncenter" style="width: 582px"><a href="http://publicassets.s3.amazonaws.com/pdf_transcription_test/capture_areas_example.gif"><img class=" " src="http://publicassets.s3.amazonaws.com/pdf_transcription_test/capture_areas_example.gif" alt="crowdsourcing" width="572" height="414" /></a><p class="wp-caption-text">Instructions</p></div>
<p class="p1">For each area on the page, workers were asked to copy and paste specific sections into the task. Each page, then, was split up into corresponding parts (Region, City, Directions, Restaurant Name, and the two Capture Areas). This is the essential concept here: structuring the unstructured data such that I could later geocode it properly, and display it in the way that I needed.</p>
<div class="wp-caption alignnone" style="width: 510px"><a href="http://www.flickr.com/photos/aronh/5801664934/in/set-72157626896293314/" target="_blank"><img class=" " src="http://farm6.static.flickr.com/5199/5801664934_027586411b.jpg" alt="crowdsourcing Pasta" width="500" height="333" /></a><p class="wp-caption-text">The tonnarelli cacio e pepe at Da Felice a Testaccio in Rome (used in the Geocode example)</p></div>
<p class="p1">Once the task finished, I downloaded the resulting CSV file, and whipped out <a title="Google Refine" href="http://code.google.com/p/google-refine/" target="_blank">Google Refine</a> (a.k.a. Excel on crack), which has a feature that allows you to enter a template API call that changes based on specific values in each row. Using the <a title="Google Geocoding API" href="http://code.google.com/apis/maps/documentation/geocoding/" target="_blank">Google Geocoding API</a> (any will do), I constructed the following API call, using the address value in each row as the “address&#8221; parameter for each API call:</p>
<pre style="color: #000; padding: 17px 20px; border: 1px solid #E6DB55; background: lightYellow;">http://maps.googleapis.com/maps/api/geocode/json?address=via+Mastro+Giorgio%2C+26+Roma+Lazio&amp;sensor=false&amp;region=it&amp;language=it</pre>
<p class="p1">After slicing and dicing the rest of the data into bits that I wanted to display on a map, I used Google Refine&#8217;s &#8220;templating&#8221; feature to export each row as a Placemark in KML format. Finally, I uploaded the resulting KML file into several different maps, each representing one region in Italy.</p>
<div id="attachment_3683" class="wp-caption alignnone" style="width: 477px"><a href="http://maps.google.com/maps/ms?msa=0&amp;msid=209651542845453072441.0004a388168c61ef8f9ec" target="_blank"><img class="size-full wp-image-3683  " src="http://blog.crowdflower.com/wp-content/uploads/2011/09/Screen-shot-2011-09-15-at-4.02.58-PM1.png" alt="crowdsource food map" width="467" height="304" /></a><p class="wp-caption-text">A portion of the Campania map - tons of places to explore around here!</p></div>
<p class="p1">Try it out! Check out the map for <a title="Tuscany" href="http://maps.google.com/maps/ms?msa=0&amp;msid=209651542845453072441.0004a2ac2f900cba0b2f3" target="_blank">Tuscany</a>, and the map for <a title="Campania" href="http://maps.google.com/maps/ms?msa=0&amp;msid=209651542845453072441.0004a388168c61ef8f9ec" target="_blank">Campania</a>.</p>
<p class="p1">If you want to give crowdsourcing a spin, head on over to our <a title="Crowdsource Self-Service" href="http://crowdflower.com/solutions/self-service" target="_blank">self-service product</a> and click &#8220;sign up&#8221;. Or, if you&#8217;d prefer a hands-off &#8220;we-do-everything&#8221; approach, <a title="solutions@crowdflower.com" href="mailto:solutions@crowdflower.com" target="_blank">contact us</a> to get started.</p>
<p>Watch the video that started it all: <a href="http://www.youtube.com/watch?v=ot-ixjYolKM?rel=0" target="_blank"><em>Io Sono Balsamico</em> by Balsamico</a></p>
<p><em>Aron is a Crowdsourcing Project Manager at CrowdFlower, and is the resident agriculturalist-eater. Follow him on Twitter (<a href="http://twitter.com/aron" target="_blank">@aron</a>) for more sage bits of agricultural-eating learnings.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/09/crowdsourcing-a-map-to-eat/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Confidence Bias: Evidence from Crowdsourcing</title>
		<link>http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/</link>
		<comments>http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/#comments</comments>
		<pubDate>Wed, 07 Sep 2011 23:02:04 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[bias]]></category>
		<category><![CDATA[confidence]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[experiment]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=3080</guid>
		<description><![CDATA[Evidence in experimental psychology suggests that most people overestimate their own ability to complete objective tasks accurately. This phenomenon, often called confidence bias, refers to &#8220;a systematic error of judgment made by individuals when they assess the correctness of their responses to questions related to intellectual or perceptual problems.&#8221; 1 But does this hold up in crowdsourcing? We ran an experiment to [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/" data-text="Confidence Bias: Evidence from Crowdsourcing" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/"></g:plusone></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/" data-counter="top"></script></div></div><div id="attachment_3593" class="wp-caption alignleft" style="width: 164px"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/overconfidence/" rel="attachment wp-att-3593"><img class="size-full wp-image-3593    " title="psychologytoday.com" src="http://blog.crowdflower.com/wp-content/uploads/2011/09/Overconfidence.gif" alt="crowdsourcing" width="154" height="132" /></a><p class="wp-caption-text">psychologytoday.com</p></div>
<p>Evidence in experimental psychology suggests that most people overestimate their own ability to complete objective tasks accurately. This phenomenon, often called <em>confidence bias, </em>refers to &#8220;a systematic error of judgment made by individuals when they assess the correctness of their responses to questions related to intellectual or perceptual problems.&#8221; <sup><a href="#footnote-1">1</a></sup> But does this hold up in crowdsourcing?</p>
<p>We ran an experiment to test for a persistent difference between people&#8217;s perceptions of their own accuracy and their actual objective accuracy. We used a set of standardized questions, focusing on the Verbal and Math sections of a common standardized test. For the 829 individuals who answered more than 10 of these questions, we asked for the correct answer as well as an indication of how confident they were of the answer they supplied.</p>
<p><span id="more-3080"></span>We didn&#8217;t use any Gold in this experiment. Instead, we incentivized performance by rewarding those finishing in the top 10%, based on objective accuracy.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/sample_problem/" rel="attachment wp-att-3427"><img class="aligncenter size-full wp-image-3427" title="sample_problem" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/sample_problem.jpg" alt="crowdsourcing" width="713" height="520" /></a></p>
<h2>Does Bias Exist?<em> </em></h2>
<p>To estimate confidence bias, we looked at the difference between the average of how confident an individual was of his/her answers and how many he/she answered correctly. If the difference is positive, the individual overestimated how well they did. <strong>Amazingly, over 75% of contributors overestimated their ability to answer multiple choice questions correctly.</strong></p>
<h2><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/histogram_res/" rel="attachment wp-att-3278"><img class="aligncenter size-full wp-image-3278" title="histogram_res" src="http://blog.crowdflower.com/wp-content/uploads/2011/07/histogram_res.jpg" alt="crowdsourcing" width="599" height="341" /></a></h2>
<h2>Are Individuals Consistently Biased?</h2>
<p>Because our dataset consisted of Math and Verbal questions, we looked at each individual contributor&#8217;s confidence bias for both types of questions. In aggregate, people tended to have more trouble with the Verbal questions (average accuracy of 28%, compared to 41% for Math), though the average confidence score was nearly identical (63% +/-1).</p>
<h2><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/scatterplot/" rel="attachment wp-att-3279"><img class="aligncenter size-full wp-image-3279" title="scatterplot" src="http://blog.crowdflower.com/wp-content/uploads/2011/07/scatterplot.jpg" alt="crowdsourcing" width="639" height="380" /></a></h2>
<p>The vast majority of contributors fall into the &#8220;overconfident on both&#8221; quadrant (top right), while only a handful of contributors were overconfident for one question type and underconfident for the other (top left and bottom right quadrants). Overall, there is certainly a correlation between bias scores on the two problem types, suggesting that many individuals are consistently biased on different types of problems. However, this explains only a portion of the variation.</p>
<h2>Does Bias Vary Across Groups?</h2>
<p>Given that overconfidence seems to be a consistent trait, we were curious how this trait varies across the different groups making up our contributor pool. We sliced and diced our contributors into a number of different sub-groups, which are summarized below.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/summary-table/" rel="attachment wp-att-3280"><img class="aligncenter size-full wp-image-3280" title="summary table" src="http://blog.crowdflower.com/wp-content/uploads/2011/07/summary-table.jpg" alt="crowdsourcing" width="703" height="460" /></a></p>
<p>There are a lot of interesting things going on here. To highlight a few, accuracy increases consistently as the contributor&#8217;s education level advances from High School to College, but so does confidence, leaving the bias score nearly unchanged. There&#8217;s a similar pattern with Age, with older contributors tending to be both more accurate and more confident.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/splits/" rel="attachment wp-att-3408"><img class="aligncenter size-full wp-image-3408" title="splits" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/splits.jpg" alt="crowdsourcing splits" width="768" height="252" /></a></p>
<p>Gender and Location also have an effect on confidence bias. Taking the two countries that supplied the most people, contributors from the US were much more accurate and slightly more confident than the average, while those from India were average in terms of accuracy but much more confident. As such, the bias score for contributors from India is nearly double that of contributors from the US. With respect to gender, confidence didn&#8217;t vary much, but women were more accurate and thus less biased than men. Moving on.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/splits2/" rel="attachment wp-att-3405"><img class="aligncenter size-full wp-image-3405" title="splits2" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/splits2.jpg" alt="crowdsourcing" width="773" height="251" /></a></p>
<h2>Further Research</h2>
<p>In the context of experimentation, we decided against using Gold to minimize any selection bias among contributors. However, this makes it difficult to apply these results to enterprise crowdsourcing, at least as practiced by CrowdFlower. In the future, it would be interesting to look at confidence bias among trusted workers only, and particularly among trusted workers with repeated experience in specific job types. We would expect these workers to have a better sense of whether their answers are correct, though it is possible (and perhaps likely) that confidence would increase along with accuracy.</p>
<p>&nbsp;</p>
<hr />
<p id="footnote-1" style="text-align: -webkit-auto;">1. Pallier, G., Wilkinson, R., Danthir, V., Kleitman, S., Knezevic, G., Stankov, L., &amp; Roberts, R. D. (2002). The role of individual differences in the accuracy of conﬁdence judgments. Journal of General Psychology, 129,257–299</p>
<p style="text-align: center;">
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/09/confidence-bias-evidence-from-crowdsourcing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing and SEM (now with even more cat pics)</title>
		<link>http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/</link>
		<comments>http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/#comments</comments>
		<pubDate>Fri, 26 Aug 2011 18:58:20 +0000</pubDate>
		<dc:creator>Zoe Vance</dc:creator>
				<category><![CDATA[Challenges]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[cats]]></category>
		<category><![CDATA[challenge]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[customers]]></category>
		<category><![CDATA[gold]]></category>
		<category><![CDATA[keyword]]></category>
		<category><![CDATA[ninja]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[SEM]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=3450</guid>
		<description><![CDATA[Every modern business wrestles with the elusive lady that is the search engine and the potential she offers to connect with customers. Google and Bing make it easy for anyone to buy keywords and drive customers to a website, but what keywords are our customers searching for? Would a sales manager frustrated with the average [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/" data-text="Crowdsourcing and SEM (now with even more cat pics)" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/"></g:plusone></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/" data-counter="top"></script></div></div><p>Every modern business wrestles with the elusive lady that is the search engine and the potential she offers to connect with customers. Google and Bing make it easy for anyone to buy keywords and drive customers to a website, but what keywords are our customers searching for? Would a sales manager frustrated with the average 70-80% accuracy of business listings bought from data providers search for &#8220;crowdsourcing,&#8221; &#8220;address checking,&#8221; or something else entirely? Since we&#8217;re a crowdsourcing company, we had to try crowdsourcing the solution &#8230;</p>
<p>In the last two weeks of my summer internship at CrowdFlower, the marketing team challenged me to generate the widest range of search engine seed terms that could be used in SEM keyword tools to generate &#8220;hot&#8221; search phrases. For those of you who’ve dealt with SEM, you know that thinking of seed phrases to plug into these tools can be a painfully frustrating and surprisingly difficult task. (For those of you who haven’t and don’t believe me, try right now to describe what your company does — or anything for that matter — in 10 significantly different ways.)</p>
<p>Keyword tools are based on your thought process, which takes care of the customers who are thinking in the same way you are, but what about all the people of a different mindset who are trying to find your solution? For example, if I were looking for pet grooming services, depending on my thought process, vocabulary range, and amount of sleep the night before, I could search anything from “pet grooming salon” to “quality feline hair cuts” to “kitty bad hair day.” The challenge was to understand the full breadth of how the crowd approaches a certain problem, essentially the perfect task for the crowd.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/pet-grooming-salon-google-search/" rel="attachment wp-att-3453"><img class="aligncenter size-full wp-image-3453" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/pet-grooming-salon-Google-Search.jpg" alt="crowdsourcing seo" width="753" height="167" /></a> <a href="http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/quality-feline-haircuts-google-search/" rel="attachment wp-att-3454"><img class="aligncenter size-full wp-image-3454" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/quality-feline-haircuts-Google-Search.jpg" alt="crowd sourcing seo" width="756" height="170" /></a></p>
<div id="attachment_3452" class="wp-caption aligncenter" style="width: 766px"><a href="http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/kitty-bad-hair-day-google-search/" rel="attachment wp-att-3452"><img class="size-full wp-image-3452 " src="http://blog.crowdflower.com/wp-content/uploads/2011/08/kitty-bad-hair-day-Google-Search.jpg" alt="crowdsourcing sem" width="756" height="294" /></a><p class="wp-caption-text">Looks like PetSmart forgot to buy an ad for &quot;kitty bad hair day&quot;</p></div>
<p><span id="more-3450"></span><br />
I set up the job to have the contributor imagine working for a business that could use CrowdFlower’s services (whether or not they know of them) and then have the contributors write search queries they would use to find a solution to their problem on the Internet. I created numerous versions for each scenario, varying both the industry jargon and the background we gave the contributor, so that I could fully mimic the knowledge and background of a real customer, and thus elicit as wide a range of responses as possible. I then ran a second job to rate those queries to identify quality results worth pursuing. Finally, I took those quality terms and fed them back into a keyword generator tool, making sure that we had the full range of potential search phrases optimized to what people would search for the most.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/crowdflower-job-58631-preview/" rel="attachment wp-att-3457"><img class="aligncenter size-full wp-image-3457" src="http://blog.crowdflower.com/wp-content/uploads/2011/08/CrowdFlower-Job-58631-Preview.jpg" alt="crowd sourcing sem" width="620" height="312" /></a> An unspoken secondary challenge was &#8220;Can you become a full-fledged crowdsourcing ninja before you leave?&#8221; This final job I ran was by far the most challenging because crowdsourcing content generation requires a seemingly daunting use of CrowdFlower’s automated workflow system, which uses gold units and an active, real-time peer review system. I successfully passed this final test, with the help of our in-house content generation team.</p>
<div id="attachment_3533" class="wp-caption alignnone" style="width: 575px"><a href="http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/job-sample-results1-2/" rel="attachment wp-att-3533"><img class="size-full wp-image-3533 " src="http://blog.crowdflower.com/wp-content/uploads/2011/08/Job-Sample-Results11.gif" alt="crowdsourcing sem" width="565" height="274" /></a><p class="wp-caption-text">Sample Generated Keyword Seeds</p></div>
<p>The keyword gen job yielded really interesting results. There were some genuinely clever and alternative thought process seeds, some that we were already using, and some that were borderline ridiculous. My personal favorite keyword seed though, as the office intern, had to be &#8220;how to find an intern&#8221; — the obvious solution to all life’s problems.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/08/crowdsourcing-and-sem-now-with-even-more-cat-pics/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Designing Incentives for Crowdsourcing Workers</title>
		<link>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/</link>
		<comments>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/#comments</comments>
		<pubDate>Tue, 24 May 2011 19:19:45 +0000</pubDate>
		<dc:creator>Aaron Shaw</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Human Behavior]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[behavior]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[incentives]]></category>
		<category><![CDATA[motivation]]></category>
		<category><![CDATA[social science]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2572</guid>
		<description><![CDATA[In a recent paper, presented at the ACM Conference on Computer Supported Cooperative Work (CSCW), John Horton, Daniel Chen and I used a large-scale experiment to test the effect of different incentive schemes on the quality of crowdsourcing work. The results surprised us. They suggest that workers perform most accurately when the task design credibly [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/" data-text="Designing Incentives for Crowdsourcing Workers" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/"></g:plusone></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/" data-counter="top"></script></div></div><p>In a <a title="Designing Incentives for Inexpert Human Raters, Berkman Center" href="http://cyber.law.harvard.edu/publications/2011/Designing_Incentives_Inexpert_Human_Raters">recent paper</a>, presented at the ACM Conference on Computer Supported Cooperative Work (CSCW), <a title="John Horton, oDesk" href="https://sites.google.com/site/johnjosephhorton/">John Horton</a>, <a title="Daniel Chen, Duke Law School" href="http://www.law.duke.edu/fac/chen">Daniel Chen</a> and <a title="Aaron Shaw, UC Berkeley &amp; Harvard" href="http://aaronshaw.org">I</a> used a large-scale experiment to test the effect of different incentive schemes on the quality of crowdsourcing work.</p>
<p>The results surprised us. They suggest that workers perform most accurately when the task design credibly links payoffs to a worker&#8217;s ability to think about the answers that their peers are likely to provide.</p>
<p style="text-align: center;">
<div id="attachment_2577" class="wp-caption aligncenter" style="width: 549px"><a href="http://www.flickr.com/photos/iyoupapa/"><img class="size-full wp-image-2577 " title="Horserace!" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/3757438159_horserace-iyoupapa-altered.jpg" alt="Horserace!" width="539" height="264" /></a><p class="wp-caption-text">a horserace experiment! (photo cc-by-sa by iyoupapa)</p></div>
<p><span id="more-2572"></span></p>
<p>The idea for this study came out of our sense that, as social scientists, we had something unique to offer the existing research on human computation. <a title="AMT is fast, cheap, and good for machine learning data" href="http://blog.crowdflower.com/2008/09/amt-fast-cheap-good-machine-learning/">Early</a> and <a title="&quot;Get Another Label?&quot; Ipeirotis et al. 2008" href="http://archive.nyu.edu/handle/2451/25882">influential</a> crowdsourcing research has focused on how to filter the judgments of the crowd to find the best answers. We wanted to know whether simple task-design changes could improve the quality of data coming into a crowdsourcing system in the first place.</p>
<p>To test this idea, we chose 14 different incentive schemes and framing techniques developed and validated across the social sciences and set up a horse race experiment to see which schemes/techniques would work best.</p>
<p>Consistent with our personal biases (John and Daniel are both economists, and I&#8217;m a sociologist), some of the schemes were financially oriented, some were social or psychological, and some were hybrids combining social and financial incentives. The details of all the schemes are included <a title="Designing Incentives for Inexpert Human Raters" href="http://cyber.law.harvard.edu/publications/2011/Designing_Incentives_Inexpert_Human_Raters">in the paper</a> (it&#8217;s a long list, and some of them are kind of involved), but it&#8217;s worth giving some examples.</p>
<p>On the financial end of the incentives spectrum, we had one condition we called &#8220;reward-accuracy,&#8221; which was pretty much what you&#8217;d expect: we told workers, &#8220;we&#8217;ll pay you a bonus if you get the answers right.&#8221; We also had one called &#8220;punishment-accuracy,&#8221; the gist of which you can deduce. On the purely social-psychological side, we had one we called &#8220;trust,&#8221; in which we told workers, &#8220;we&#8217;ll pay you for this job no matter how bad your performance, we trust that you&#8217;ll still make your best effort.&#8221;</p>
<p>One of the weirdest schemes turns out to be important, so I need to explain that one. Called &#8220;Bayesian Truth Serum&#8221; (BTS), it incorporates a design from the work of <a title="Drazen Prelec" href="http://econ-www.mit.edu/faculty/dprelec">Drazen Prelec</a>, a behavioral economist at MIT, who realized that research subjects could probably provide useful information regarding the expected distribution for subjective, qualitative questions (<em>nb</em>, the mechanics of how he does this are arcane in a way that is almost sure to delight the geeks among you, so I encourage you to <a title="Bayesian Truth Serum" href="http://econ-www.mit.edu/files/1966">read his paper</a>). Few of the details of <em>real</em> BTS are important, except that we incorporated the piece about asking workers to answer the questions themselves <em>and predict the distribution of other workers&#8217; responses</em>. We also told them we&#8217;d give them a bonus if their predictions were correct.</p>
<p>We then created a task that asked workers to answer five questions. In this case, the questions were drawn from another study examining participatory features of websites, for which we already possessed validated data collected by research assistants.</p>
<p>All workers answered the same five questions about the same website (<a href="http://www.kiva.org">www.kiva.org</a>) while being exposed to one and only one of the 14 incentive schemes (or a control condition of no scheme). Roughly 2,000 individuals participated in the study, resulting in over 100 subjects in each of the experimental conditions. (The statistics and science nerds out there will be pleased to know that both the drop-out rate and demographic covariates were distributed evenly across conditions.)</p>
<p>To measure worker performance, we used the research assistant responses as correct answers to the questions and then calculated the total number of matching answers (out of five) provided by each worker. The results (aggregated across all treatments) are plotted in a histogram below and show that the average worker answered just over two questions out of five correctly.</p>
<p style="text-align: center;"><a href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/aggperf/" rel="attachment wp-att-2578"><img class="aligncenter size-full wp-image-2578" title="Inexpert raters - Aggregate Performance" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/AggPerf.png" alt="Aggregate performance histogram" width="280" height="280" /></a></p>
<p>&nbsp;</p>
<p>Then, in order to see how the treatments compared against each other relative to the control group, we calculated the mean correct response rate for each condition and conducted difference of means tests to see which of these means were significantly greater than the control group. The results of this comparison appear below (in a new plot that doesn&#8217;t even appear in the paper!):</p>
<p><a href="http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/inexpert-itt/" rel="attachment wp-att-2579"><img class="aligncenter size-full wp-image-2579" title="inexpert raters - ITT estimates" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/inexpert-ITT.png" alt="ITT estimates per treatment" width="500" height="500" /></a></p>
<p>The orange dots show the value of the mean in each condition, and the blue bars illustrate the 95% confidence interval around that mean. The treatments are sorted by the size of the difference in means from the control. (More hard-core nerd stuff: the means are adjusted using Intent-To-Treat estimators).</p>
<p>From these results, we concluded that our horse race had two clear front-runners: the &#8220;Bayesian Truth Serum&#8221; (BTS) and &#8220;Punishment &#8211; disagreement&#8221; conditions, each of which improved average worker performance by almost half of a correct answer above the 2.08 correct answers in the control group. A few of the other financial and hybrid incentives had fairly large point estimates, but were not significantly different from control once we adjusted the test statistics and corresponding p-values to account for the fact that we were making so many comparisons at once (apologies if this doesn&#8217;t make sense — it&#8217;s yet another precautionary measure to avoid upsetting the stats nerds among you). In a tough turn for the sociologists and psychologists, none of the purely social/psychological treatments had any signficant effects at all.</p>
<p>Why do BTS and punishing workers for disagreement succeed in improving performance significantly where so many of the other incentive schemes failed? The answer hinges on the fact that both conditions tied workers&#8217; payoffs to their ability to think about their peers&#8217; likely responses. (We elaborate on the argument in more detail in the paper.)</p>
<p>Does this mean that we should give up on simple financial or social-psychological incentives? Probably not. The fact that we conducted the experiment on MTurk means that the deck may have been stacked against incentives like the &#8220;trust&#8221; condition I described earlier. Because requesters on MTurk have little oversight, workers are more likely to respond to financial incentives than stated promises. In this sense, the marketplace has structured the interaction between workers and requesters in a way that may limit the opportunities to harness motivations that are not linked to money in some explicit way.</p>
<p>You can <a title="Designing Incentives for Inexpert Human Raters" href="http://cyber.law.harvard.edu/sites/cyber.law.harvard.edu/files/Shaw-Horton-Chen_Designing_Incentives_Inexpert_Human_Raters_2011.pdf">download the full paper</a> to read more.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/05/designing-incentives-for-crowdsourcing-workers/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>eDiscovery, meet Crowd</title>
		<link>http://blog.crowdflower.com/2011/05/ediscovery-meet-crowd/</link>
		<comments>http://blog.crowdflower.com/2011/05/ediscovery-meet-crowd/#comments</comments>
		<pubDate>Fri, 06 May 2011 19:58:01 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Law]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[document review]]></category>
		<category><![CDATA[eDiscovery]]></category>
		<category><![CDATA[legal]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2441</guid>
		<description><![CDATA[Once upon a time, I had a job that included looking through boxes of documents that were supposedly related to environmental litigation, but were generally (a) unrelated, (b) dusty and (c) mind-numbingly dull. Earlier this year, as I looked back on those dark days, it seemed to me that crowdsourcing would be a great tool [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/05/ediscovery-meet-crowd/" data-text="eDiscovery, meet Crowd" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/05/ediscovery-meet-crowd/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div></div><p>Once upon a time, I had a job that included looking through boxes of documents that were supposedly related to environmental litigation, but were generally (a) unrelated, (b) dusty and (c) mind-numbingly dull. Earlier this year, as I looked back on those dark days, it seemed to me that crowdsourcing would be a great tool for a first pass through documents, helping a legal team focus its efforts away from documents that are obviously not responsive to a given request.</p>
<p><span id="more-2441"></span></p>
<p>To test this suspicion, we used a dataset of ~2,700 documents that were pre-coded with relevance assessments by a team of legal experts associated with the TREC 2010 Legal Learning task.<sup><a href="#footnote-1">1</a></sup> We asked multiple workers whether each document, emails made public during the course of the Enron investigation, was responsive to a request for Residential Real Estate (full instructions <a href="http://plg1.uwaterloo.ca/~gvcormac/treclegal09/topic.txt">here</a>):</p>
<p><img class="aligncenter size-full wp-image-2442" title="ediscovery_ui" src="http://blog.crowdflower.com/wp-content/uploads/2011/04/ediscovery_ui.jpg" alt="" width="647" height="425" /></p>
<p style="text-align: -webkit-auto;"><strong><span style="font-size: 26px;">&#8220;So a Team of Lawyers walks into a Conference Room&#8230;&#8221;</span></strong></p>
<p style="text-align: -webkit-auto;">We ran two iterations of the document review task. In the first, we used a Gold <sup><a href="#footnote-2">2</a></sup> distribution of 50% Relevant and 50% Not Relevant. For the second iteration, we increased the proportion of Relevant documents in our Gold set to 60%. Our thinking was that, at least in the context of litigation, returning an irrelevant document (false positive) was preferable to missing a Relevant document (false negative).</p>
<p style="text-align: center;"><a rel="attachment wp-att-2448" href="http://blog.crowdflower.com/2011/05/ediscovery-meet-crowd/ediscover_stats/"><img class="aligncenter size-full wp-image-2448" title="ediscover_stats" src="http://blog.crowdflower.com/wp-content/uploads/2011/04/ediscover_stats.jpg" alt="" width="835" height="264" /></a></p>
<p style="text-align: -webkit-auto;">In the chart above, <strong>Recall</strong> is the percentage of all responsive documents that were returned; it measures how thorough the search is. <strong>Precision</strong> is the percentage of returned documents that are responsive; it measures how accurate the process is. <strong>F1</strong> is the harmonic mean, a simple summary measure that rewards high values in both. We used a simple majority among workers to determine a document&#8217;s relevance.</p>
<p style="text-align: -webkit-auto;">Before looking at how this performance compares with manual and automated document review, it&#8217;s worth noting that changing the Gold distribution had very little effect on the overall accuracy, but it had a large effect on the distribution of errors. By increasing the frequency of Relevant Gold units, we cut the number of false negative errors by nearly 50%. In a context where one type of error is relatively more &#8220;expensive&#8221; than another, this is a useful tool to be aware of.</p>
<p style="text-align: -webkit-auto;">Without running the same dataset through crowdsourced, automated, and manual document review, it&#8217;s difficult to compare performance across methods. Nevertheless, Grossman and Cormack (2011) discuss manual and automated document review, finding that average recall for manual review of documents can be as low as 20-50%, though typically with much higher precision. For automated review on a dataset similar to the one we used, recall averaged 77%, though with average precision of 85%.<sup><a href="#footnote-3">3</a></p>
<p></sup></p>
<p style="text-align: -webkit-auto;"><strong><span style="font-size: 26px;">Living in a (non)Binary World</span></strong></p>
<p>Every document in our test was also graded with a probabilistic measure of Relevance by default. Because we asked multiple reviewers whether a given document was Relevant, we used inter-coder agreement to suggest the likelihood that a document is responsive. Further, because we tracked each individual worker&#8217;s performance on Gold, we weighted each worker&#8217;s contribution to the agreement by his/her estimated accuracy.</p>
<p>For this exercise, we remapped our confidence scores such that a document that was the least likely to be Relevant received a Relevance Score of 0.01, while the documents most likely to be Relevant received a Relevance Score of 0.99. <sup><a href="#footnote-4">4</a></sup> The distribution of documents by Relevance Score is included below.</p>
<p><a rel="attachment wp-att-2453" href="http://blog.crowdflower.com/2011/05/ediscovery-meet-crowd/ediscover_counts/"><img class="aligncenter size-full wp-image-2453" title="ediscover_counts" src="http://blog.crowdflower.com/wp-content/uploads/2011/04/ediscover_counts.jpg" alt="" width="519" height="389" /></a></p>
<p>Note that because most documents in this test received judgments from three different workers, there isn&#8217;t much variation in the middle of the distribution. Most units had either unanimous agreement or a 2/1 split. Nevertheless, the Relevance Score makes it possible to set a threshold on what should be considered Relevant.</p>
<p>By changing the threshold, we can include any document that received at least one judgment of Relevant (increased Recall) or to include only documents that did not receive any judgment of Not Relevant (increased Precision). As shown below, different thresholds dramatically influence the number and characteristics of the documents returned as Relevant.</p>
<p><a rel="attachment wp-att-2539" href="http://blog.crowdflower.com/2011/05/ediscovery-meet-crowd/pr_flip/"><img class="aligncenter size-full wp-image-2539" title="PR_flip" src="http://blog.crowdflower.com/wp-content/uploads/2011/05/PR_flip.jpg" alt="" width="640" height="642" /></a></p>
<p style="text-align: -webkit-auto;">While there is no substitute for trained legal experts, these results show that crowdsourcing is an effective complement to eDiscovery document review. The promise of putting multiple pairs of eyes on every document dramatically decreases the likelihood of missing a relevant document. And consider that in less than 24 hours, we collected over 15,000 unique relevance judgments on nearly 3,000 documents, and for much less than the billing rate of your average attorney.</p>
<hr />
<p id="footnote-1" style="text-align: -webkit-auto;">1 <a href="http://plg1.uwaterloo.ca/~gvcormac/treclegal09/">http://plg1.uwaterloo.ca/~gvcormac/treclegal09/</a></p>
<p id="footnote-2" style="text-align: -webkit-auto;">2 One of the ways that we control for quality is by randomly inserting a subset of units for which we already know the answers. We refer to this data as Gold. We track worker performance on these Gold units as a proxy for overall accuracy. Additional documentation is <a href="http://crowdflower.com/docs/gold">here.</a></p>
<p id="footnote-3" style="text-align: -webkit-auto;">3 Maura R. Grossman &amp; Gordon V. Cormack, Technology-Assisted Review in E-Discovery Can Be More Effective and More Efficient Than Exhaustive Manual Review, XVII  RICH. J.L. &amp; TECH. 11 (2011), <a href="http://jolt.richmond.edu/v17i3/article11.pdf">http://jolt.richmond.edu/v17i3/article11.pdf</a></p>
<p id="footnote-4" style="text-align: -webkit-auto;">4 For &#8220;Relevant&#8221; documents, P(Relevance)=0.99*(Confidence). For &#8220;Not Relevant&#8221; documents, P(Relevance)=1-0.99(Confidence)</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/05/ediscovery-meet-crowd/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Announcing the TREC 2011 Crowdsourcing Track, co-sponsored by CrowdFlower</title>
		<link>http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/</link>
		<comments>http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/#comments</comments>
		<pubDate>Tue, 26 Apr 2011 19:46:39 +0000</pubDate>
		<dc:creator>Vaughn Hester</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2411</guid>
		<description><![CDATA[We&#8217;re proud to announce CrowdFlower&#8217;s participation in one of the most prestigious academic conferences worldwide. The 20th Annual Text REtrieval Conference (TREC 2011), organized by the National Institute of Standards (NIST), will take place Nov 15-18, 2011, at NIST&#8217;s campus in Gaithersburg, MD. TREC 2011 will feature a new Crowdsourcing Track that will investigate: How [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/" data-text="Announcing the TREC 2011 Crowdsourcing Track, co-sponsored by CrowdFlower" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/"></g:plusone></div></div><p>We&#8217;re proud to announce CrowdFlower&#8217;s participation in one of the most prestigious academic conferences worldwide.</p>
<p>The 20th Annual <a href="http://trec.nist.gov/">Text REtrieval Conference (TREC 2011)</a>, organized by the National Institute of Standards (NIST), will take place Nov 15-18, 2011, at NIST&#8217;s campus in Gaithersburg, MD. TREC 2011 will feature a new <a href="https://sites.google.com/site/treccrowd2011/">Crowdsourcing Track</a> that will investigate:</p>
<ul>
<li>How to obtain high-quality relevance judgments from individual crowd workers;</li>
<li>How to effectively compute consensus judgments from individual judgments;</li>
<li>Interaction between these (i.e., worker accuracy vs. subsequent consensus accuracy achieved).</li>
</ul>
<p><span id="more-2411"></span></p>
<p>Teams can choose to participate in either or both tasks: (1) collecting judgments and (2) computing consensus.  We plan to evaluate using both ranking and classification metrics, giving teams the flexibility to focus on either one or both.</p>
<p>We welcome participation from those who have never before participated in TREC; note there is an agreement form to submit in advance; the <a href="http://ir.nist.gov/trecsubmit.open/application.html">form</a> is available from the track website.</p>
<p>View the guidelines <a href="https://sites.google.com/site/treccrowd2011/home/TREC2011CrowdsourcingTrack.pdf?attredirects=0&#038;d=1">here</a>.</p>
<p><strong>Tentative Schedule</strong></p>
<p><strong>May 6, 2011:</strong> Initial deadline for participating teams to email organizers with team name and<br />
intent to participate (will affect the partition of data distributed to all participants)</p>
<p><strong>May 13:</strong> Finalized guidelines released, Stage 1 training sets distributed to participants</p>
<p><strong>May 20:</strong> Stage 2 training set distributed to participants</p>
<p><strong>August 1:</strong> Release of Stage 1 test data; submission system opens immediately</p>
<p><strong>August 8:</strong> Stage 1 submissions due</p>
<p><strong>August 15:</strong> Stage 2 test topics distributed</p>
<p><strong>August 22:</strong> Stage 2 submissions due</p>
<p><strong>~September 15:</strong> Preliminary results announced to participants</p>
<p><strong>~October 15:</strong> Team papers due to NIST for inclusion in TREC conference<br />
notebook</p>
<p><strong>November 15-18:</strong> Text REtrieval Conference (TREC 2011) in Gaithersburg, MD</p>
<p><strong>~February 1, 2012:</strong> Final team papers due to NIST for inclusion in TREC conference proceedings</p>
<p>For more information, please contact Track Organizers <a href="http://www.gabriella-kazai.com/">Gabriella Kazai</a>  (Microsoft Research) or <a href="http://www.ischool.utexas.edu/~ml/">Matthew Lease</a> (University of Texas at Austin).</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/04/announcing-the-trec-2011-crowdsourcing-track-co-sponsored-by-crowdflower/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CrowdFlower and Citizen Effect: So Much Madness</title>
		<link>http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/</link>
		<comments>http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/#comments</comments>
		<pubDate>Fri, 08 Apr 2011 17:47:02 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Charity]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Office Life]]></category>
		<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2333</guid>
		<description><![CDATA[After hearing that our friends at Citizen Effect were sponsoring Brackets with Benefits, a cool new March Madness pool where the proceeds from all the gambling go to one of the many awesome community development projects they support, we thought it would be fun to get involved. Considering how most people** watched their brackets go [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/" data-text="CrowdFlower and Citizen Effect: So Much Madness" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/"></g:plusone></div></div><p>After hearing that our friends at <a href="http://www.citizeneffect.org" target="_blank">Citizen Effect</a> were sponsoring Brackets with Benefits, a cool new March Madness pool where the proceeds from all the gambling go to one of the many awesome community development projects they support, we thought it would be fun to get involved. Considering how most people** watched their brackets go up in flames by the time the Sweet Sixteen rolled around, we thought it would be fun to give everyone a second chance to play.</p>
<p>We teamed up with Citizen Effect to invite all Brackets with Benefits participants, as well as the general public, to pick the results of the Sweet Sixteen and Final Four, a total of 10 games. We had one clear winner, who correctly predicted the outcome of 8 games (and missed Duke/Arizona and Florida/BYU, for those keeping track at home) and will be going home with a fancy new camcorder. For what it&#8217;s worth, the aggregated response of the crowd correctly picked 7 games.</p>
<p><a rel="attachment wp-att-2334" href="http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/bwb_picks/"><img class="aligncenter size-full wp-image-2334" title="bwb_picks" src="http://blog.crowdflower.com/wp-content/uploads/2011/04/bwb_picks.jpg" alt="" width="530" height="387" /></a></p>
<p>In a tournament full of upsets, there were two games that surprised our participants most. Not surprisingly, these were the games where the two #1 seeds (Duke and Ohio State) lost in the Sweet Sixteen, which just under 10 percent of people picked. There was one person from Glenolden, PA (perhaps an embittered Villanova fan?) who picked both Duke and Ohio State to lose, but ended up picking only three other games correctly.</p>
<p>All in all, a fun little experiment. We want to send a big thanks to our friends over at Citizen Effect for putting together the pool and inviting us to participate.</p>
<hr />
**This doesn&#8217;t include our VP of Engineering, a proud Butler alum who took his alma mater all the way to the championship. Sorry, Brian, we all wanted your boys to win, too.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/04/crowdflower-and-citizen-effect-so-much-madness/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>March Meetup Video</title>
		<link>http://blog.crowdflower.com/2011/03/march-meetup-video/</link>
		<comments>http://blog.crowdflower.com/2011/03/march-meetup-video/#comments</comments>
		<pubDate>Thu, 31 Mar 2011 01:44:09 +0000</pubDate>
		<dc:creator>Mollie Allick</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Meetup]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Office Life]]></category>
		<category><![CDATA[Translation]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2316</guid>
		<description><![CDATA[Couldn&#8217;t make the March Meetup at CrowdFlower? Check out the video highlights below. If you like what you see above, join our Crowdsourcing Work Meetup Group for news on upcoming Meetups.]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/03/march-meetup-video/" data-text="March Meetup Video" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/03/march-meetup-video/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/03/march-meetup-video/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/03/march-meetup-video/"></g:plusone></div></div><p>Couldn&#8217;t make the March Meetup at CrowdFlower? Check out the video highlights below.</p>
<p><iframe src="http://player.vimeo.com/video/21709430" width="400" height="300" frameborder="200"></iframe></p>
<p>If you like what you see above, join our <a href="http://www.meetup.com/Distributed-Work/">Crowdsourcing Work Meetup Group</a> for news on upcoming Meetups.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/03/march-meetup-video/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Oscar Fever: The Sequel!</title>
		<link>http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/</link>
		<comments>http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/#comments</comments>
		<pubDate>Fri, 04 Mar 2011 22:51:03 +0000</pubDate>
		<dc:creator>Patrick Philips and Joseph Childress</dc:creator>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Wisdom of Small Crowds]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2190</guid>
		<description><![CDATA[The votes are in from our Oscar crowdsourcing experiment, and the crowd successfully picked the winners of 14 of the academy awards. For reference, Roger Ebert got 15 predictions correct so we&#8217;d have to conclude that the crowd performed reasonably well at predicting the winners of this glorified popularity contest. One fascinating thing about aggregating responses [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/" data-text="Oscar Fever: The Sequel!" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/"></g:plusone></div></div><p>The votes are in from <a href="http://blog.crowdflower.com/2011/02/oscar-fever/" target="_blank">our Oscar crowdsourcing experiment</a>, and the crowd successfully picked the winners of 14 of the academy awards. For reference, <a href="http://rogerebert.suntimes.com/apps/pbcs.dll/article?AID=/20110210/OSCARS/110219999" target="_blank">Roger Ebert got 15 predictions correct</a> so we&#8217;d have to conclude that the crowd performed reasonably well at predicting the winners of this glorified popularity contest.</p>
<div id="attachment_2191" class="wp-caption aligncenter" style="width: 808px"><a rel="attachment wp-att-2191" href="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/actual_results/"><img class="size-full wp-image-2191" title="actual_results" src="http://blog.crowdflower.com/wp-content/uploads/2011/03/actual_results.jpg" alt="movie picks" width="798" height="414" /></a><p class="wp-caption-text">Predicted and Actual Winners of the 2011 Academy Awards</p></div>
<p><span id="more-2190"></span><br />
One fascinating thing about aggregating responses is that the crowd as a whole will often outperform the average worker. In this case, among the 500 people we polled, the majority of respondents picked fewer than 10 awards correctly (mean of 9.6 and median of 9). And yet, by aggregating all the responses, such that the nominee with the most &#8220;votes&#8221; is predicted to win, the crowd as a whole correctly picked 14 awards. While the &#8220;wisdom of crowds&#8221; doesn&#8217;t come as much of a surprise, it&#8217;s always reassuring to see it confirmed in new applications.</p>
<p><a rel="attachment wp-att-2199" href="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/correct_histogram1/"><img class="aligncenter size-full wp-image-2199" title="correct_histogram1" src="http://blog.crowdflower.com/wp-content/uploads/2011/03/correct_histogram1.jpg" alt="" width="614" height="445" /></a></p>
<p>As we noted in<a href="http://blog.crowdflower.com/2011/02/oscar-fever/"> our earlier post</a>, though, the more interesting question was whether workers who indicated higher confidence in their responses would outperform workers with lower confidence. Looking at the results, however, we saw no significant correlation between a worker&#8217;s predicted accuracy and  actual performance.</p>
<div id="attachment_2222" class="wp-caption aligncenter" style="width: 823px"><a rel="attachment wp-att-2222" href="http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/scatts/"><img class="size-full wp-image-2222" title="scatts" src="http://blog.crowdflower.com/wp-content/uploads/2011/03/scatts.jpg" alt="" width="813" height="547" /></a><p class="wp-caption-text">&quot;Squint all you want, but there&#39;s no pattern&quot;</p></div>
<p>While it&#8217;s certainly possible that we didn&#8217;t offer enough of an incentive for workers to estimate their own accuracy, the more likely explanation is that predicting the winners of the Oscars is not something that a person can do with any degree of certainty. Confident or not, the people we polled did not see the &#8220;Inside Job&#8221; coming.</p>
<p>As a final exercise, we ran a regression on every explanatory variable we could find, including what state workers came from, what day they made their predictions, whether they made their predictions during the day or at night, how long they spent making their predictions and even their historical accuracy on other CrowdFlower tasks. The only variable with any significance turned out to be how long they spent on making their predictions, and while it was significant (at p=0.001), no model we could come up with explained more than 5 percent of the total variation in accuracy.</p>
<p>While the wisdom of crowds seems to extend to picking Oscar winners, the more interesting experiment of having workers self-select as trustworthy is ongoing. In the future, it would be worthwhile to repeat this experiment with questions that can be answered objectively and without uncertainty (solving algebra problems seems like a good candidate), to see if any correlation emerges between predicted and actual accuracy.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/03/oscar-fever-the-sequel/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Oscar Fever</title>
		<link>http://blog.crowdflower.com/2011/02/oscar-fever/</link>
		<comments>http://blog.crowdflower.com/2011/02/oscar-fever/#comments</comments>
		<pubDate>Sat, 26 Feb 2011 18:48:16 +0000</pubDate>
		<dc:creator>Patrick Philips and Joseph Childress</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=2142</guid>
		<description><![CDATA[With the most glamorous award ceremony of the year just around the corner, we asked 500 people from across the United States to help predict who the big winners are going to be. Below are their predictions, sorted in descending order of agreement. Based on these early results, Pixar looks like a lock for yet another [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/02/oscar-fever/" data-text="Oscar Fever" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/02/oscar-fever/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/02/oscar-fever/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/02/oscar-fever/"></g:plusone></div></div><p>With the most glamorous award ceremony of the year just around the corner, we asked 500 people from across the United States to help predict who the big winners are going to be. Below are their predictions, sorted in descending order of agreement.</p>
<div id="attachment_2185" class="wp-caption aligncenter" style="width: 676px"><a rel="attachment wp-att-2185" href="http://blog.crowdflower.com/2011/02/oscar-fever/predictions/"><img class="size-full wp-image-2185" title="predictions" src="http://blog.crowdflower.com/wp-content/uploads/2011/02/prediction_table.jpg" alt="" width="666" height="449" /></a><p class="wp-caption-text">&quot;I&#39;d like to thank all the little people...&quot;</p></div>
<p>Based on these early results, Pixar looks like a lock for yet another Best Animated Picture award and Natalie Portman had better start polishing her acceptance speech.</p>
<p><span id="more-2142"></span></p>
<p>Anyone can make anonymous predictions, so we made things interesting by rewarding workers for answering correctly. We structured the job such that everyone receives 5 cents for completing the survey, plus an additional 2 cents for each correct answer.</p>
<p>To spice things up even more, we asked workers to guess how many predictions they would answer correctly (their &#8220;Magic Number,&#8221; with a minimum of 5), with the stipulation that we will pay bonuses only to those workers who answer at least their Magic Number of predictions correctly.</p>
<p>As you can see below, while quite a few workers played it safe by selecting the minimum number of correct responses (&#8220;5&#8243;), the most frequent choice was a Magic Number of 10. Also, more than a few brave souls thought they would get <em>every</em> prediction correct.</p>
<p><a rel="attachment wp-att-2144" href="http://blog.crowdflower.com/2011/02/oscar-fever/magic_numbers/"><img class="aligncenter size-full wp-image-2144" title="magic_numbers" src="http://blog.crowdflower.com/wp-content/uploads/2011/02/magic_numbers.jpg" alt="" width="680" height="449" /></a></p>
<p>But the big question, apart from whether Darren Aronofsky (Black Swan) can edge out David Fincher (The Social Network) for Best Director, is whether any correlation exists between worker confidence and actual performance. When workers have a cash incentive to estimate their own accuracy correctly, do the self-labeled &#8220;experts&#8221; perform any better  than the &#8220;novices&#8221; at predicting Oscar winners?</p>
<p>Make some popcorn, grab your Kleenex and stay tuned. After they&#8217;ve rolled up the red carpet, we&#8217;ll come back and see who did a better job of predicting the winners.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/02/oscar-fever/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>And the crowd goes wild (revisited)</title>
		<link>http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/</link>
		<comments>http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/#comments</comments>
		<pubDate>Thu, 03 Feb 2011 15:45:51 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=1947</guid>
		<description><![CDATA[A few months ago, we created a crowdsourced ranking of NFL players according to their fantasy value. For those just tuning in, we took the Top 75 players in ESPN&#8217;s preseason preview as the experimental sample, paired them against each other, then had workers through the CrowdFlower platform select the better of the two. Now [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/" data-text="And the crowd goes wild (revisited)" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/"></g:plusone></div></div><p>A few months ago, we <a href="http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/">created a crowdsourced ranking of NFL players</a> according to their fantasy value. For those just tuning in, we took the Top 75 players in ESPN&#8217;s preseason preview as the experimental sample, paired them against each other, then had workers through the CrowdFlower platform select the better of the two.</p>
<p><a rel="attachment wp-att-1949" href="http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/fastasy_ui/"><img class="aligncenter size-full wp-image-1949" src="http://blog.crowdflower.com/wp-content/uploads/2011/01/fastasy_UI.jpg" alt="" width="757" height="369" /></a> Now that the regular season has finished, let&#8217;s see who did a better job of predicting the final ranking of this sample. Of the 75 players in the sample, only 28 finished in the Top 50. In other words, ESPN&#8217;s team of experts failed to predict nearly half of the most valuable players in 2010.</p>
<p><span id="more-1947"></span></p>
<p><a rel="attachment wp-att-2000" href="http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/final_ranks/"><img class="aligncenter size-full wp-image-2000" src="http://blog.crowdflower.com/wp-content/uploads/2011/01/final_ranks.jpg" alt="" width="521" height="586" /></a></p>
<p>We brushed off this halting start by Team ESPN and looked at those players in the preseason sample who did finish in the Top 50. Did Team ESPN or Team CrowdFlower do a better job of ranking them? To find out, we looked at which of the two ranking systems contained more of the top players at a number of intervals.</p>
<p>For example, of the seven most valuable players, five were in the preseason sample. Team CrowdFlower correctly picked four of the five, which we&#8217;ll call a Precision Score at 7 of 0.8 (that is 4 out of 5 players, or 80 percent). Team ESPN, on the other hand, picked only two of the five players to finish in the Top 7, corresponding with a Precision Score at 7 of 0.4.</p>
<p style="text-align: center">
<p><a rel="attachment wp-att-1998" href="http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/precisionatk1/"><img class="aligncenter size-full wp-image-1998" src="http://blog.crowdflower.com/wp-content/uploads/2011/01/PrecisionAtK1.jpg" alt="" width="656" height="462" /></a></p>
<p>As you can see in the graph above, crowdsourced workers were at least as good as ESPN&#8217;s experts. For the Top 5 Rated players, Team CrowdFlower predicted all three of the players in the preseason sample, while Team ESPN picked zero. For the Top 20 Rated players, Team CrowdFlower predicted 9 of the 13 players in the preseason sample (Precision Score at 20 of 0.69) while Team ESPN predicted only 6 of the 13 players (Precision Score at 20 of 0.46).</p>
<p>Across the board, crowdsourced workers never performed worse than a professional team of experts at predicting the value of Fantasy Football players. Approximately 80 percent of the time, crowdsourced workers were better. Most interesting, the biggest difference between crowdsourced workers and experts was for the most valuable players.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2011/02/and-the-crowd-goes-wild-revisited/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Visions and revisions</title>
		<link>http://blog.crowdflower.com/2010/10/visions-and-revisions/</link>
		<comments>http://blog.crowdflower.com/2010/10/visions-and-revisions/#comments</comments>
		<pubDate>Sat, 30 Oct 2010 00:46:42 +0000</pubDate>
		<dc:creator>Josh Eveleth</dc:creator>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Wisdom of Small Crowds]]></category>
		<category><![CDATA[art]]></category>
		<category><![CDATA[Hobbes]]></category>
		<category><![CDATA[Sandburg]]></category>
		<category><![CDATA[Shakespeare]]></category>
		<category><![CDATA[Writing]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=1734</guid>
		<description><![CDATA[Writing is easy. Just sit in front of a typewriter, open up a vein and bleed it out drop by drop. &#8211; Red Smith When I was in college, a professor I respected said that one of the best ways to demystify writing is to write like people you admire. Specifically, to find passages that [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2010/10/visions-and-revisions/" data-text="Visions and revisions" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2010/10/visions-and-revisions/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2010/10/visions-and-revisions/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2010/10/visions-and-revisions/"></g:plusone></div></div><blockquote><p>
Writing is easy. Just sit in front of a typewriter, open up a vein and bleed it out drop by drop.<br />
<span style="font-size: 13.3333px;">&#8211; Red Smith</span></p></blockquote>
<div id="attachment_1434" class="wp-caption alignnone" style="width: 330px"><img src="http://blog.crowdflower.com/wp-content/uploads/2010/10/Underwoodfive.jpg" width="320" height="240" /></a><p class="wp-caption-text">Underwood Five Typewriter</p></div>
<p>When I was in college, a professor I respected said that one of the best ways to demystify writing is to write like people you admire. Specifically, to find passages that you love, and try to revise them in your own words. This exercise proved invaluable. It allowed me to walk in their literary footsteps, shedding light on why they chose &#8212; or avoided &#8212; certain words, punctuation, and syntax.</p>
<p>With this in mind, I recently wondered whether you can crowdsource writing, specifically, revising.</p>
<p><span id="more-1734"></span></p>
<p>I posted a task through CrowdFlower that asked the crowd to rewrite four famous quotations, pithily, while preserving their meaning.</p>
<p><img src="http://blog.crowdflower.com/wp-content/uploads/2010/10/Screen-shot-2010-10-29-at-11.59.44-AM.png"></p>
<p>In one evening, I was able to get 20 revisions of each quotation from people across the country. I won&#8217;t summarize them all here, but I will pull a few highlights.</p>
<p><strong>Original Quotation 1 (from <em><a href="http://www.bartleby.com/100/138.31.118.html">Macbeth</a></em>, William Shakespeare):</strong></p>
<blockquote><p>
To-morrow, and to-morrow, and to-morrow,/ Creeps in this petty pace from day to day,/ To the last syllable of recorded time;/ And all our yesterdays have lighted fools/ The way to dusty death. Out, out, brief candle!/ Life&#8217;s but a walking shadow, a poor player/ That struts and frets his hour upon the stage/ And then is heard no more. It is a tale/ Told by an idiot, full of sound and fury/ Signifying nothing.</p></blockquote>
<p><strong>Revision 1:</strong></p>
<blockquote><p>
Time marches on, and everyone dies; life is meaningless.<br />
<span style="font-size: 13.3333px;">&#8211; Hatboro, PA</span></p></blockquote>
<blockquote><p>
Life creeps along and ends suddenly like the end of a bad play. The play is dramatic and had poor acting, and has no point or moral in the end.<br />
<span style="font-size: 13.3333px;">&#8211;Salt Lake City, UT</span></p></blockquote>
<p><strong>Original Quotation 2 (from &#8220;<a href="http://www.bartleby.com/100/160.2.html">The Leviathan</a>,&#8221; Thomas Hobbes):</strong></p>
<blockquote><p>
No arts, no letters, no society, and which is worst of all, continual fear and danger of violent death, and the life of man solitary, poor, nasty, brutish, and short.</p></blockquote>
<p><strong>Revision 2:</strong></p>
<blockquote><p>
The life of a man on his own is barbaric and degrading.<br />
<span style="font-size: 13.3333px;">&#8211; East Aurora, NY</span></p></blockquote>
<blockquote><p>
No art, letters or society. Worst of all, living in fear of being alone, poor and short.<br />
<span style="font-size: 13.3333px;">&#8211; Arlington, TX</span></p></blockquote>
<blockquote><p>
Life is all but a mere scam.<br />
<span style="font-size: 13.3333px;">&#8211;Overland Park, KS</span></p></blockquote>
<p><strong>Original Quotation 3 (&#8220;<a href="http://www.bartleby.com/124/pres31.html">First Inaugural Address</a>,&#8221; Abraham Lincoln):</strong></p>
<blockquote><p>
We are not enemies, but friends. We must not be enemies. Though passion may have strained it must not break our bonds of affection. The mystic chords of memory, stretching from every battlefield and patriot grave to every living heart and hearthstone all over this broad land, will yet swell the chorus of the Union, when again touched, as surely they will be, by the better angels of our nature.</p></blockquote>
<p><strong>Revision 3:</strong></p>
<blockquote><p>
We&#8217;re fools for fighting each other. We should co-operate, and let our example bring everyone together.<br />
<span style="font-size: 13.3333px;">&#8211; Plattsburgh, NY</span></p></blockquote>
<blockquote><p>
We must be friends, we should now forget each other. Our memories will always stay through our rough times and good times.<br />
<span style="font-size: 13.3333px;">&#8211; Tallahassee, FL</span></p></blockquote>
<p><strong>Original Quotation 4 (from &#8220;<a href="http://www.bartleby.com/165/1.html">Chicago</a>,&#8221; Carl Sandburg):</strong></p>
<blockquote><p>
&#8220;They tell me you are wicked and I believe them, for I have seen your painted women under the gas lamps luring the farm boys.&#8221;</p></blockquote>
<p><strong>Revision 4:</strong></p>
<blockquote><p>
Your reputation follows you and it is a bad one, makeup and lust.<br />
<span style="font-size: 13.3333px;">&#8211; Iola, WI</span></p></blockquote>
<blockquote><p>
This city isn&#8217;t a nice place, it&#8217;s full of hookers.<br />
<span style="font-size: 13.3333px;">&#8211; Milledgeville, GA</span></p></blockquote>
<p>The full data is available <a href="http://blog.crowdflower.com/wp-content/uploads/2010/10/f17005-2.csv">here</a>.</p>
<p>What&#8217;s your opinion? Can the revision process be crowdsourced?</p>
<p>I&#8217;d love to hear your thoughts.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2010/10/visions-and-revisions/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>And the crowd goes wild &#8230;</title>
		<link>http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/</link>
		<comments>http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/#comments</comments>
		<pubDate>Mon, 18 Oct 2010 15:00:29 +0000</pubDate>
		<dc:creator>Patrick Philips and Joseph Childress</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Sports]]></category>
		<category><![CDATA[sports]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=1559</guid>
		<description><![CDATA[As the NFL season loomed on the horizon back in August, there was a lot of fantasy football talk around the CrowdFlower office. Naturally, we decided to crowdsource a killer fantasy football team. We asked CrowdFlower workers to help us build a ranked list of the Top 75 players to guide our fantasy football draft. [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:left;"><div class="socialize-in-button socialize-in-button-left"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/" data-text="And the crowd goes wild &#8230;" data-count="vertical" data-via="crowdflower" ><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-left"><script>
			<!-- 
			var fbShare = {
				url: "http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/",
				size: "large",
				google_analytics: "true"
			}
			//-->
			</script>
                        <script src="http://widgets.fbshare.me/files/fbshare.js"></script></div><div class="socialize-in-button socialize-in-button-left"><script type="in/share" data-url="http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/" data-counter="top"></script></div><div class="socialize-in-button socialize-in-button-left"><g:plusone size="small" href="http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/"></g:plusone></div></div><p>As the NFL season loomed on the horizon back in August, there was a lot of fantasy football talk around the CrowdFlower office. Naturally, we decided to crowdsource a killer fantasy football team. </p>
<div id="attachment_1602" class="wp-caption alignnone" style="width: 210px"><a href="http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/fantasyfootballtackle-2/" rel="attachment wp-att-1602"><img class="centered" src="http://blog.crowdflower.com/wp-content/uploads/2010/10/fantasyfootballtackle1.jpg" alt="Fantasy football tackle" title="fantasyfootballtackle"/></a><p class="wp-caption-text">Fantasy football tackle. Photo Credit: dennis</p></div>
<p><span id="more-1559"></span></p>
<p>We asked CrowdFlower workers to help us build a ranked list of the Top 75 players to guide our fantasy football draft. We used pair-wise comparisons to determine rankings. Specifically, we presented workers with two players, asking them to pick which player they thought would be more valuable. </p>
<p>We used the Top 75 players from ESPN’s 2010 Fantasy Football Draft Kit<sup>1</sup>, matching each player with his 74 counterparts, giving us a total of 2,775 player pairs. We also included each player’s position and team, as well as a link to more detailed statistics.</p>
<div id="attachment_1563" class="wp-caption alignnone" style="width: 912px"><a href="http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/fantasyfootballtask/" rel="attachment wp-att-1563"><img class="centered" src="http://blog.crowdflower.com/wp-content/uploads/2010/10/fantasyfootballtask.png" alt="Fantasy football task on CrowdFlower" title="fantasyfootballtask"/></a><p class="wp-caption-text">Screenshot of fantasy football task.</p></div>
<p>After the job finished, we ordered the players according to number of head-to-head victories.</p>
<div id="attachment_1565" class="wp-caption alignnone" style="width: 670px"><a href="http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/player_rankings/" rel="attachment wp-att-1565"><img class="centered" src="http://blog.crowdflower.com/wp-content/uploads/2010/10/player_rankings.png" alt="player rankings" title="player_rankings" /></a><p class="wp-caption-text">Crowdsourced list of the Top 25 most valuable fantasy football players.</p></div><br />
(Download the full results <a href="http://blog.crowdflower.com/wp-content/uploads/2010/10/initial-ranking.xlsx">here</a>.)</p>
<p>We found that workers choose the player on the left 53 percent of the time, even though each match-up appears twice, once with Player A on the left and again with Player A on the right. With 5,500 data points, this is significant bias toward the player on the left. However, the final ranking doesn’t change even after accounting for this bias.</p>
<p>We settled on two likely explanations for the bias:</p>
<ol>
<li>The anchoring effect of putting something on the left (and as the first response option) may have caused more workers to select the first player. </li>
<li>Our <a href="http://crowdflower.com/docs/gold" "target=_blank">Gold</a> was slightly biased toward the player on the left, which we have previously seen<sup>2</sup> to have an effect on the overall distribution of the answers.</li>
</ol>
<p>As a first comparison of our crowd of football fans with ESPN’s Fantasy Football brain trust, we plotted each player by the difference between his two rankings. </p>
<p><div id="attachment_1561" class="wp-caption alignnone" style="width: 912px"><a href="http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/espn_vs_crowd/" rel="attachment wp-att-1561"><img class="centered" src="http://blog.crowdflower.com/wp-content/uploads/2010/10/espn_vs_crowd.png" alt="crowd rankings vs. espn rankings" title="espn_vs_crowd"/></a><p class="wp-caption-text">Players on the left were ranked higher by ESPN, while players on the right were ranked higher by AMT.</p></div>
<p>Will Chad Ochocinco and Matt Ryan vindicate the crowd? Stay tuned to find out.</p>
<hr />
1. <a href="http://games.espn.go.com/frontpage/ffldraftkit" "target=_blank">http://games.espn.go.com/frontpage/ffldraftkit</a><br />
2. <a href="http://www.ischool.utexas.edu/~cse2010/slides/le.pptx" "target=_blank">http://www.ischool.utexas.edu/~cse2010/slides/le.pptx</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2010/10/and-the-crowd-goes-wild/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

