<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Age and Gender Stereotypes</title>
	<atom:link href="http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/</link>
	<description></description>
	<lastBuildDate>Wed, 21 Jul 2010 00:42:40 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: Nicholas</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-1058</link>
		<dc:creator>Nicholas</dc:creator>
		<pubDate>Mon, 13 Apr 2009 19:54:31 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-1058</guid>
		<description>Hi,
Variable sized buckets is just like varying the bandwidth. The problem with the binning procedure is that since the weights are equal for every point, as you increase the bucket size, the standard error will decrease. If you want to show the variability inherent in the data, the raw data potted with alpha blending would show that, if you want to show the variability in the means, use simultaneous intervals for the smoother. If all you care about is the population trend than, the bin level intervals don&#039;t matter. 

Futzing with the kernel density estimates is not a big deal, it is just that on the edges they give an erroneous picture. One of these days I will send Deepayan a patch with some more sensible options. It is a well known phenomenon that kernel smoothers have really bad problems at the edges of the observed data.

Glad you liked the ellipses.</description>
		<content:encoded><![CDATA[<p>Hi,<br />
Variable sized buckets is just like varying the bandwidth. The problem with the binning procedure is that since the weights are equal for every point, as you increase the bucket size, the standard error will decrease. If you want to show the variability inherent in the data, the raw data potted with alpha blending would show that, if you want to show the variability in the means, use simultaneous intervals for the smoother. If all you care about is the population trend than, the bin level intervals don&#8217;t matter. </p>
<p>Futzing with the kernel density estimates is not a big deal, it is just that on the edges they give an erroneous picture. One of these days I will send Deepayan a patch with some more sensible options. It is a well known phenomenon that kernel smoothers have really bad problems at the edges of the observed data.</p>
<p>Glad you liked the ellipses.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: brendano</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-1019</link>
		<dc:creator>brendano</dc:creator>
		<pubDate>Thu, 02 Apr 2009 20:53:09 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-1019</guid>
		<description>Nicholas -- we ended hacking around the problem by making a new graph that uses variable-sized buckets, so larger on the ends.  Even with the kind of dumb one-sample independent intervals, the phenomenon for the attractiveness plot emerges.  Here&#039;s the progression of graphs:
http://assets.doloreslabs.com/blog/aag_buckets_all.pdf

Yes, smoothed density plots are problematic.  We&#039;re using whatever smoother that &quot;smoothScatter&quot; uses; Lukas (not me) looked into the detail there.  All kernel smoothers are kinda weird -- i&#039;d have a hard time believing there are any good reasons to futz with them beyond changing the bandwidth/span parameter.  At least, any reasons that are both (1) principled, and (2) whose assumptions apply to real-world data analysis scenarios.

I like the ellipses.

Alex: Ooh, interesting hypothesis about age of viewers.  That&#039;s something to look in to...</description>
		<content:encoded><![CDATA[<p>Nicholas &#8212; we ended hacking around the problem by making a new graph that uses variable-sized buckets, so larger on the ends.  Even with the kind of dumb one-sample independent intervals, the phenomenon for the attractiveness plot emerges.  Here&#8217;s the progression of graphs:<br />
<a href="http://assets.doloreslabs.com/blog/aag_buckets_all.pdf" rel="nofollow">http://assets.doloreslabs.com/blog/aag_buckets_all.pdf</a></p>
<p>Yes, smoothed density plots are problematic.  We&#8217;re using whatever smoother that &#8220;smoothScatter&#8221; uses; Lukas (not me) looked into the detail there.  All kernel smoothers are kinda weird &#8212; i&#8217;d have a hard time believing there are any good reasons to futz with them beyond changing the bandwidth/span parameter.  At least, any reasons that are both (1) principled, and (2) whose assumptions apply to real-world data analysis scenarios.</p>
<p>I like the ellipses.</p>
<p>Alex: Ooh, interesting hypothesis about age of viewers.  That&#8217;s something to look in to&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alex</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-1018</link>
		<dc:creator>Alex</dc:creator>
		<pubDate>Thu, 02 Apr 2009 19:41:50 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-1018</guid>
		<description>Interesting study!

It&#039;d be interesting to understand how the age of the viewer impacts this. I&#039;m guessing that your population is skewed in some way (perhaps overrepresented around that 20-25 age group), and my (untested) hypothesis is that people are better at discerning differences in people closer to their own age. Any chance of incorporating viewer age data?

Drawing envelopes on your error bars might make the data more &quot;fair&quot;; as it is, your solid lines visually overstate their own accuracy, just because they&#039;re so prominent.</description>
		<content:encoded><![CDATA[<p>Interesting study!</p>
<p>It&#8217;d be interesting to understand how the age of the viewer impacts this. I&#8217;m guessing that your population is skewed in some way (perhaps overrepresented around that 20-25 age group), and my (untested) hypothesis is that people are better at discerning differences in people closer to their own age. Any chance of incorporating viewer age data?</p>
<p>Drawing envelopes on your error bars might make the data more &#8220;fair&#8221;; as it is, your solid lines visually overstate their own accuracy, just because they&#8217;re so prominent.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nicholas</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-997</link>
		<dc:creator>Nicholas</dc:creator>
		<pubDate>Wed, 25 Mar 2009 23:04:48 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-997</guid>
		<description>just to echo Hadley&#039;s, since age is a continuous variable and so is your response, you should be using simultaneous error bands. Also which smoother are you using? For correlation coefficients ellipses are better, Deepayan has an example at
http://lmdvr.r-forge.r-project.org/figures/figures.html
look at figure 13.5 
Lastly, the I don&#039;t love the density plots based on kernel smoothing, the singletons are not treated very well, and you end up with the blurred dots around the edges of the density. 
 Plotting the singletons separately sometimes helps.</description>
		<content:encoded><![CDATA[<p>just to echo Hadley&#8217;s, since age is a continuous variable and so is your response, you should be using simultaneous error bands. Also which smoother are you using? For correlation coefficients ellipses are better, Deepayan has an example at<br />
<a href="http://lmdvr.r-forge.r-project.org/figures/figures.html" rel="nofollow">http://lmdvr.r-forge.r-project.org/figures/figures.html</a><br />
look at figure 13.5<br />
Lastly, the I don&#8217;t love the density plots based on kernel smoothing, the singletons are not treated very well, and you end up with the blurred dots around the edges of the density.<br />
 Plotting the singletons separately sometimes helps.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hadley</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-941</link>
		<dc:creator>Hadley</dc:creator>
		<pubDate>Mon, 23 Feb 2009 14:50:42 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-941</guid>
		<description>Thanks for the explanation - and I&#039;m glad to hear that you&#039;re finding plyr helpful :)</description>
		<content:encoded><![CDATA[<p>Thanks for the explanation &#8211; and I&#8217;m glad to hear that you&#8217;re finding plyr helpful :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brendan O'Connor</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-937</link>
		<dc:creator>Brendan O'Connor</dc:creator>
		<pubDate>Fri, 20 Feb 2009 22:14:02 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-937</guid>
		<description>@Hadley: no, not a two-sample t-test.  The bars are 95% CI&#039;s from a one-sample t-test with a two-sided alternative.  What you get by calling R&#039;s t.test() on a single vector of numbers.  Lots of the extreme buckets have just a few instances, some have only singletons.  Giving the counts is probably more useful than all this t-test stuff.

When the CI&#039;s are so big it&#039;s silly to have single-year buckets.  This was just a first attempt.  We have a better graph since making this one, that uses bigger age buckets at the extremes.  With &lt;a href=&quot;http://had.co.nz/plyr&quot; rel=&quot;nofollow&quot;&gt;plyr&lt;/a&gt; :)

@Bruce: yeah we definitely need to get on that.  I figure that binning by race will get people the most riled up as possible...</description>
		<content:encoded><![CDATA[<p>@Hadley: no, not a two-sample t-test.  The bars are 95% CI&#8217;s from a one-sample t-test with a two-sided alternative.  What you get by calling R&#8217;s t.test() on a single vector of numbers.  Lots of the extreme buckets have just a few instances, some have only singletons.  Giving the counts is probably more useful than all this t-test stuff.</p>
<p>When the CI&#8217;s are so big it&#8217;s silly to have single-year buckets.  This was just a first attempt.  We have a better graph since making this one, that uses bigger age buckets at the extremes.  With <a href="http://had.co.nz/plyr" rel="nofollow">plyr</a> :)</p>
<p>@Bruce: yeah we definitely need to get on that.  I figure that binning by race will get people the most riled up as possible&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bruce</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-903</link>
		<dc:creator>Bruce</dc:creator>
		<pubDate>Fri, 13 Feb 2009 22:39:40 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-903</guid>
		<description>How about showing how perceived intelligence correlates with race?  or with gender?</description>
		<content:encoded><![CDATA[<p>How about showing how perceived intelligence correlates with race?  or with gender?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hadley</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-884</link>
		<dc:creator>Hadley</dc:creator>
		<pubDate>Tue, 10 Feb 2009 13:36:27 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-884</guid>
		<description>A two-sided t-test?  Do you mean that the error bars are for the difference of the means, not the means themselves?  That would be non-standard, but ok.  What alpha are you using - ideally you should be using 1.4 (sqrt(1^2 + 1^2)) because then non-overlapping confidence intervals would correspond to a p-value from the t-test of 0.05 or less.

And you must have _really_ small numbers (or distributions that are very heavy tailed) to such wide intervals.  Some plots of sample sizes would be informative.

You might also want to plot the differences directly - there is a well known visual problem where our visual system compare distances between curves based on the shortest distance between the them, not the vertical distance.</description>
		<content:encoded><![CDATA[<p>A two-sided t-test?  Do you mean that the error bars are for the difference of the means, not the means themselves?  That would be non-standard, but ok.  What alpha are you using &#8211; ideally you should be using 1.4 (sqrt(1^2 + 1^2)) because then non-overlapping confidence intervals would correspond to a p-value from the t-test of 0.05 or less.</p>
<p>And you must have _really_ small numbers (or distributions that are very heavy tailed) to such wide intervals.  Some plots of sample sizes would be informative.</p>
<p>You might also want to plot the differences directly &#8211; there is a well known visual problem where our visual system compare distances between curves based on the shortest distance between the them, not the vertical distance.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lukas</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-881</link>
		<dc:creator>lukas</dc:creator>
		<pubDate>Tue, 10 Feb 2009 04:09:53 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-881</guid>
		<description>The data is extremely biased towards people aged 20-30.  Also, not every image is labeled with every tag, so each graph plots a subset of the data where both variables were labeled by at least one person.

To answer your question explicitly, we use a two-sided t-test for the error bars.</description>
		<content:encoded><![CDATA[<p>The data is extremely biased towards people aged 20-30.  Also, not every image is labeled with every tag, so each graph plots a subset of the data where both variables were labeled by at least one person.</p>
<p>To answer your question explicitly, we use a two-sided t-test for the error bars.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hadley</title>
		<link>http://blog.crowdflower.com/2009/02/age-and-gender-stereotypes/#comment-880</link>
		<dc:creator>Hadley</dc:creator>
		<pubDate>Tue, 10 Feb 2009 04:00:38 +0000</pubDate>
		<guid isPermaLink="false">http://blog.doloreslabs.com/2009/02/age-and-gender-stereotypes/#comment-880</guid>
		<description>How are you calculating the confidence interval?  They seem awfully large if you have 100,000 faces in total.</description>
		<content:encoded><![CDATA[<p>How are you calculating the confidence interval?  They seem awfully large if you have 100,000 faces in total.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
