<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The CrowdFlower Blog &#187; Media</title>
	<atom:link href="http://blog.crowdflower.com/topics/media/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.crowdflower.com</link>
	<description></description>
	<lastBuildDate>Tue, 31 Aug 2010 22:03:38 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Crowdsourcing the Goldman Sachs Investigation</title>
		<link>http://blog.crowdflower.com/2010/06/crowdsourcing-the-goldman-sachs-investigation/</link>
		<comments>http://blog.crowdflower.com/2010/06/crowdsourcing-the-goldman-sachs-investigation/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 23:31:51 +0000</pubDate>
		<dc:creator>Josh Eveleth</dc:creator>
				<category><![CDATA[Media]]></category>
		<category><![CDATA[Economy]]></category>
		<category><![CDATA[Goldman]]></category>
		<category><![CDATA[Government]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=693</guid>
		<description><![CDATA[When federal investigators asked Goldman Sachs for its transactions with insurance giant AIG, Goldman turned over the information — several hundred billion pages’ worth. John Carney, senior editor at CNBC.com, had an idea for sifting through the data —&#160;crowdsource it. We agree. In fact, CrowdFlower will categorize and tag the first 100,000 documents at no [...]]]></description>
			<content:encoded><![CDATA[<p>When federal investigators asked Goldman Sachs for its transactions with insurance giant AIG, Goldman turned over the information — several hundred billion pages’ worth.</p>
<p>John Carney, senior editor at CNBC.com, had an idea for sifting through the data —&nbsp;<a href="http://www.cnbc.com/id/37619147" TARGET="_blank">crowdsource it</a>. </p>
<p>We agree. In fact, CrowdFlower will categorize and tag the first 100,000 documents at no cost to the government.</p>
<p>If you’re just tuning in, the federal Financial Crisis Inquiry Commission (FCIC) subpoenaed Goldman for its AIG transactions, following accusations that Goldman cooked up a mortgage investment scheme that was rigged to fail.</p>
<p>FCIC has around 50 employees, an $8 million budget, and roughly six months to pore over the five terabytes of data. (Can you say, “Too small to succeed”?)</p>
<p><span id="more-693"></span></p>
<p>Clearly, technology presents a double-edged sword for investigators and other regulators.</p>
<p>On the one hand, companies under investigation can use technology to more efficiently bury investigators in terabytes of data (paging Goldman Sachs). On the other hand, technology provides tools for deftly sifting through the data (enter crowdsourcing).</p>
<p>Crowdsourcing public documents may be relatively new, but it’s not unprecedented. In fact, the British Parliament is under way with a project that uses crowdsourcing to <a href="http://mps-expenses.guardian.co.uk/" TARGET="_blank">investigate MPs’ expenses</a>. </p>
<p>We’ll keep you posted on whether the government takes up our offer.</p>
<p>&#8211; Additional contributions by Anisha Sekar.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2010/06/crowdsourcing-the-goldman-sachs-investigation/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing to find media bias: Hillary vs. Obama</title>
		<link>http://blog.crowdflower.com/2008/03/crowdsourcing-to-find-media-bias-hillary-vs-obama/</link>
		<comments>http://blog.crowdflower.com/2008/03/crowdsourcing-to-find-media-bias-hillary-vs-obama/#comments</comments>
		<pubDate>Thu, 27 Mar 2008 03:41:52 +0000</pubDate>
		<dc:creator>Brendan O'Connor</dc:creator>
				<category><![CDATA[Media]]></category>
		<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.doloreslabs.com/?p=21</guid>
		<description><![CDATA[As anyone who follows political races knows, different sources can report the same event in very different ways. We took nearly six thousand recent articles over the past month about Clinton and Obama and sent them to Mechanical Turk to be classified as favorable or unfavorable for the respective candidates. We scraped the articles from [...]]]></description>
			<content:encoded><![CDATA[<p>As anyone who follows political races knows, different sources can report the same event in very different ways.  We took nearly six thousand recent articles over the past month about Clinton and Obama and sent them to Mechanical Turk to be classified as favorable or unfavorable for the respective candidates.  We scraped the articles from <a href="http://news.google.com/">Google News</a> restricted to several sources, and threw in front page headlines from <a href="http://digg.com/news">Digg</a>.</p>
<p>Here is the graph for favorability scores, aggregated by source.  We found that Digg was far and away the most favorable for Obama.</p>
<p><a href='http://blog.doloreslabs.com/wp-content/uploads/2008/03/obama-hillary-bysource3.png' title='obama-hillary-bysource3.png'><img src='http://blog.doloreslabs.com/wp-content/uploads/2008/03/obama-hillary-bysource3.png' alt='obama-hillary-bysource3.png' class='centered' /></a></p>
<p>
The next graph tracks overall news favorability by date.  To provide some context, we compared it with the change in Obama stock on the <a href="http://intrade.com/">Intrade</a> prediction market.</p>
<p><a href='http://blog.doloreslabs.com/wp-content/uploads/2008/03/obama-hillary-overtime.png' title='obama-hillary-overtime2.png'><img src='http://blog.doloreslabs.com/wp-content/uploads/2008/03/obama-hillary-overtime2.png' alt='obama-hillary-overtime2.png' class='centered' /></a></p>
<p>More details after the jump:</p>
<p><span id="more-21"></span></p>
<p>We created our data set by doing two separate searches, one for &#8220;Barack Obama&#8221; and one for &#8220;Hillary Clinton&#8221;.  This did a pretty good job ensuring that results from Google News or Digg&#8217;s search facility demonstrated how the article was about the given candidate.  For each article, we showed the headline, search result snippet, and link to several Turkers.  They reported whether it was positive, neutral, or negative toward the candidate.</p>
<p>The favorability metric was created by averaging the ratings across articles.  Pro-Obama and anti-Hillary articles were both worth 1 point; anti-Obama and pro-Hillary both worth -1, and neutrals 0.</p>
<p>Therefore, if all articles are either positive towards Obama or negative towards Hillary, the rating is +100%; and vice-versa for -100%.</p>
<p>The data is very noisy.  The question of favorability is extremely tricky: it includes a combination of expectations, sentiment, and the objective events a newspaper chooses to report.  All of these are hard to reliably assess or even define.  (And whether anything you measure constitutes &#8220;media bias&#8221; is another complicated question!)</p>
<p>Despite all this philosophical intractability, the data must be showing something real, because we have a statistically sound result: the difference between Digg and the others was statistically significant (<a href="http://en.wikipedia.org/wiki/Student%27s_t-test">t-test</a>, p&lt;.001).  The differences within the mainstream media were not statistically significant.</p>
<p>-<a href="http://socialscienceplusplus.blogspot.com/">Brendan</a>, <a href="http://vandev.com/">Chris</a>, <a href="http://www.lukasbiewald.com/">Lukas</a>, <a href="http://mike-love.net/">Mike</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2008/03/crowdsourcing-to-find-media-bias-hillary-vs-obama/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Less white people, more football: Sports Illustrated covers since 1954</title>
		<link>http://blog.crowdflower.com/2008/03/sports-and-race-on-sports-illustrated-magazine-covers/</link>
		<comments>http://blog.crowdflower.com/2008/03/sports-and-race-on-sports-illustrated-magazine-covers/#comments</comments>
		<pubDate>Thu, 13 Mar 2008 17:36:33 +0000</pubDate>
		<dc:creator>Brendan O'Connor</dc:creator>
				<category><![CDATA[Media]]></category>
		<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.doloreslabs.com/?p=10</guid>
		<description><![CDATA[Human annotators are great at providing basic information about images. We were wondering if we could find something interesting about magazine covers. Stumbling upon 2800 Sports Illustrated cover images going back to 1954, we sent them to Mechanical Turk, asking people to identify the race and gender of the person featured (if any), and what [...]]]></description>
			<content:encoded><![CDATA[<p>Human annotators are great at providing basic information about images.  We were wondering if we could find something interesting about magazine covers.  Stumbling upon <a href="http://www.coverbrowser.com/covers/sports-illustrated">2800 Sports Illustrated cover images going back to 1954</a>, we sent <a href="http://assets.doloreslabs.com/jobs/si_sample.html" target="_blank">them</a> to Mechanical Turk, asking people to identify the race and gender of the person featured (if any), and what sport was depicted. There are lots of interesting things in this data; this post will touch on just a few we’ve had time to whip together some graphs for.</p>
<p>Here is a historical graph of the frequency of how often people of different races appear on the cover of Sports Illustrated.  The story is simple and striking:</p>
<p style="text-align: center"><a href="http://blog.doloreslabs.com/wp-content/uploads/2008/03/race.png" title="race.png"><img src="http://blog.doloreslabs.com/wp-content/uploads/2008/03/race.png" class='centered' /></a></p>
<p>Next: which sports get featured on the  cover?  Here’s a chart for several sports over that same time.</p>
<p align="center"><a href="http://blog.doloreslabs.com/wp-content/uploads/2008/03/sports.png" title="sports.png"><img src="http://blog.doloreslabs.com/wp-content/uploads/2008/03/sports.png" class='centered' /></a></p>
<p>It might be possible to find links between the careers of famous athletes and rises and falls their sports’ popularity; for example, boxing peaks in the 70’s (Muhammad Ali?), basketball peaks in the 90’s (Michael Jordan?) and golf bounces back in the 90’s after a long decline (Tiger Woods?).</p>
<p>Many other sports appear in the data, too; for this chart, we made sure to pick the three most common, and a few other particularly interesting ones.  Percentages don’t add up to 100% because we didn&#8217;t plot all the other sports, including things like horse racing which used to be much more popular.  If you’re really curious, <a href="http://blog.doloreslabs.com/wp-content/uploads/2008/03/sports21.png">here’s the full chart of all sports we asked about</a>, including many of the smaller ones.</p>
<p>-<a href="http://socialscienceplusplus.blogspot.com">Brendan</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2008/03/sports-and-race-on-sports-illustrated-magazine-covers/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
	</channel>
</rss>
