<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Bing is an Improvement over Live, but Still Not Google Quality: Evaluating Bing With Mechanical Turk</title>
	<atom:link href="http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/</link>
	<description></description>
	<lastBuildDate>Wed, 07 Dec 2011 22:49:55 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: gino</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-2430</link>
		<dc:creator>gino</dc:creator>
		<pubDate>Sat, 27 Feb 2010 00:17:08 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-2430</guid>
		<description>I&#039;d like to know if someone has already carried out a blind test on search engine results.

It could be very interesting to ask users which results are considered best without knowing the name of the search engine that produced that particular SERP. This could avoid the risk that users&#039; opinions could be influenced by brand related &#039;noise&#039;.

Obviously the results could be compared only on a semantic basis, but I think that the statistical reliability could be significantly better.</description>
		<content:encoded><![CDATA[<p>I&#8217;d like to know if someone has already carried out a blind test on search engine results.</p>
<p>It could be very interesting to ask users which results are considered best without knowing the name of the search engine that produced that particular SERP. This could avoid the risk that users&#8217; opinions could be influenced by brand related &#8216;noise&#8217;.</p>
<p>Obviously the results could be compared only on a semantic basis, but I think that the statistical reliability could be significantly better.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jessica</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1504</link>
		<dc:creator>Jessica</dc:creator>
		<pubDate>Mon, 21 Sep 2009 13:35:29 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1504</guid>
		<description>Bing is currently dropping sites and pages like theres no tomorrow (just like MSN always has done)i&#039;ve noticed this with several sites.. they do come back though in the majority of cases.
Bing is a joke.. the all new search engine, yet behaves exactly the same as MSN and returns the same results.</description>
		<content:encoded><![CDATA[<p>Bing is currently dropping sites and pages like theres no tomorrow (just like MSN always has done)i&#8217;ve noticed this with several sites.. they do come back though in the majority of cases.<br />
Bing is a joke.. the all new search engine, yet behaves exactly the same as MSN and returns the same results.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Santiago</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1402</link>
		<dc:creator>Santiago</dc:creator>
		<pubDate>Sun, 09 Aug 2009 18:29:06 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1402</guid>
		<description>Hi, I´ve contacted you by email some days ago and did not have any feedback. My email address is in the email box</description>
		<content:encoded><![CDATA[<p>Hi, I´ve contacted you by email some days ago and did not have any feedback. My email address is in the email box</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bset</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1271</link>
		<dc:creator>bset</dc:creator>
		<pubDate>Mon, 22 Jun 2009 16:02:34 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1271</guid>
		<description>Similar to the experiment using 100 random queries, here is another example of the same type, where users can plugin their queries and select the most relevant search engine themselves. &lt;a href=&quot;http://bset.royans,net&quot; rel=&quot;nofollow&quot;&gt;bset.royans.net&lt;/a&gt;

Google and Yahoo both seem to be much better than Bing, though Google is a leader by a long margin. Whats also interesting is that, it looks like different search engines might be better for different types of content ( or could be based on location, language)</description>
		<content:encoded><![CDATA[<p>Similar to the experiment using 100 random queries, here is another example of the same type, where users can plugin their queries and select the most relevant search engine themselves. <a href="http://bset.royans,net" rel="nofollow">bset.royans.net</a></p>
<p>Google and Yahoo both seem to be much better than Bing, though Google is a leader by a long margin. Whats also interesting is that, it looks like different search engines might be better for different types of content ( or could be based on location, language)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: LisaStratus</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1250</link>
		<dc:creator>LisaStratus</dc:creator>
		<pubDate>Sun, 14 Jun 2009 13:00:39 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1250</guid>
		<description>Very much a prompt reply :)</description>
		<content:encoded><![CDATA[<p>Very much a prompt reply :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rourbboob</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1238</link>
		<dc:creator>rourbboob</dc:creator>
		<pubDate>Thu, 11 Jun 2009 22:58:13 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1238</guid>
		<description>visit us!
newsbox.cc
newsbox.us
nbstatus.wordpress.com
NOW!</description>
		<content:encoded><![CDATA[<p>visit us!<br />
newsbox.cc<br />
newsbox.us<br />
nbstatus.wordpress.com<br />
NOW!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1235</link>
		<dc:creator>David</dc:creator>
		<pubDate>Thu, 11 Jun 2009 13:18:36 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1235</guid>
		<description>@Offbeatmammal: The problem with Blind Search is that the search engine used to retrieve each column of results in clearly marked in the page&#039;s source code. Anyone who can use &quot;View Source&quot; can check which column is which and vote accordingly.</description>
		<content:encoded><![CDATA[<p>@Offbeatmammal: The problem with Blind Search is that the search engine used to retrieve each column of results in clearly marked in the page&#8217;s source code. Anyone who can use &#8220;View Source&#8221; can check which column is which and vote accordingly.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Vaibhav</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1234</link>
		<dc:creator>Vaibhav</dc:creator>
		<pubDate>Thu, 11 Jun 2009 12:03:58 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1234</guid>
		<description>Hi Anil,

I had to write a post on this topic after reading yours. Here&#039;s a link: http://blog.gadodia.net/bing-vs-google-no-fancy-analytics-pure-personal-experience/</description>
		<content:encoded><![CDATA[<p>Hi Anil,</p>
<p>I had to write a post on this topic after reading yours. Here&#8217;s a link: <a href="http://blog.gadodia.net/bing-vs-google-no-fancy-analytics-pure-personal-experience/" rel="nofollow">http://blog.gadodia.net/bing-vs-google-no-fancy-analytics-pure-personal-experience/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael S.</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1233</link>
		<dc:creator>Michael S.</dc:creator>
		<pubDate>Thu, 11 Jun 2009 08:58:44 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1233</guid>
		<description>Do any of the results vary much by location, or previous search history?  Google&#039;s results seem to vary significantly by location, for example, which is frequently annoying, but I guess it must affect the results for the better if the search engines are doing it.</description>
		<content:encoded><![CDATA[<p>Do any of the results vary much by location, or previous search history?  Google&#8217;s results seem to vary significantly by location, for example, which is frequently annoying, but I guess it must affect the results for the better if the search engines are doing it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Vaibhav</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1231</link>
		<dc:creator>Vaibhav</dc:creator>
		<pubDate>Thu, 11 Jun 2009 07:11:26 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1231</guid>
		<description>Let me put it this way, even though I switched my default search engine to Bing, I consistently have to go back to Google to get what I want (after searching on Bing first).

I would love for Bing to be better, for the simple reason that even Google doesn&#039;t do the best job in search. 

But Bing is just not competing, IMO - look at the difference between these two queries:
http://tinyurl.com/l52fkt and http://tinyurl.com/ltbyl9

It looks to me that Google understands user intent better</description>
		<content:encoded><![CDATA[<p>Let me put it this way, even though I switched my default search engine to Bing, I consistently have to go back to Google to get what I want (after searching on Bing first).</p>
<p>I would love for Bing to be better, for the simple reason that even Google doesn&#8217;t do the best job in search. </p>
<p>But Bing is just not competing, IMO &#8211; look at the difference between these two queries:<br />
<a href="http://tinyurl.com/l52fkt" rel="nofollow">http://tinyurl.com/l52fkt</a> and <a href="http://tinyurl.com/ltbyl9" rel="nofollow">http://tinyurl.com/ltbyl9</a></p>
<p>It looks to me that Google understands user intent better</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: brendano</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1230</link>
		<dc:creator>brendano</dc:creator>
		<pubDate>Wed, 10 Jun 2009 19:08:44 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1230</guid>
		<description>err, i don&#039;t mean differences, i mean the comparison responses (of course)</description>
		<content:encoded><![CDATA[<p>err, i don&#8217;t mean differences, i mean the comparison responses (of course)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: brendano</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1229</link>
		<dc:creator>brendano</dc:creator>
		<pubDate>Wed, 10 Jun 2009 17:19:45 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1229</guid>
		<description>Xianhang: that&#039;s what the null hypothesis tests are for.  The first graph (bing vs google) has about p=0.04, which means that if people were guessing randomly, there&#039;s less than a 4% chance we would have seen the sort of results we saw.  5% is the customary threshold for &quot;statistical significance.&quot;  The second graph, as we stated, could have been due to chance (it&#039;s p-value was higher).

That&#039;s why we say we think google is slightly better than bing, but it&#039;s a little bit of a wash whether bing is better than live.

Panos: i think it was a one-sample t-test of the differences, but i&#039;m not sure</description>
		<content:encoded><![CDATA[<p>Xianhang: that&#8217;s what the null hypothesis tests are for.  The first graph (bing vs google) has about p=0.04, which means that if people were guessing randomly, there&#8217;s less than a 4% chance we would have seen the sort of results we saw.  5% is the customary threshold for &#8220;statistical significance.&#8221;  The second graph, as we stated, could have been due to chance (it&#8217;s p-value was higher).</p>
<p>That&#8217;s why we say we think google is slightly better than bing, but it&#8217;s a little bit of a wash whether bing is better than live.</p>
<p>Panos: i think it was a one-sample t-test of the differences, but i&#8217;m not sure</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Panos Ipeirotis</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1228</link>
		<dc:creator>Panos Ipeirotis</dc:creator>
		<pubDate>Wed, 10 Jun 2009 17:14:39 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1228</guid>
		<description>Lukas, what test did you run to confirm the statistical significance?</description>
		<content:encoded><![CDATA[<p>Lukas, what test did you run to confirm the statistical significance?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1227</link>
		<dc:creator>David</dc:creator>
		<pubDate>Wed, 10 Jun 2009 16:13:26 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1227</guid>
		<description>Kragen - tinfoil hat much?</description>
		<content:encoded><![CDATA[<p>Kragen &#8211; tinfoil hat much?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Offbeatmammal</title>
		<link>http://blog.crowdflower.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1225</link>
		<dc:creator>Offbeatmammal</dc:creator>
		<pubDate>Wed, 10 Jun 2009 15:12:40 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/06/bing-an-improvement-over-live-but-still-not-google-quality-evaluating-bing-with-mechanical-turk/#comment-1225</guid>
		<description>For a quick and simple way to &quot;taste test&quot; the difference yourself check out the amazing Blind Search - http://blindsearch.fejus.com/

Adds Yahoo to the mix and removes the logos so you don&#039;t know</description>
		<content:encoded><![CDATA[<p>For a quick and simple way to &#8220;taste test&#8221; the difference yourself check out the amazing Blind Search &#8211; <a href="http://blindsearch.fejus.com/" rel="nofollow">http://blindsearch.fejus.com/</a></p>
<p>Adds Yahoo to the mix and removes the logos so you don&#8217;t know</p>
]]></content:encoded>
	</item>
</channel>
</rss>

