The Demographics of Web Search 131
adaviel sends a link to work out of Yahoo Research indicating that demographics can help Web searches; e.g. a women searching for "wagner" probably wants the 18th-century German composer, while for men in the US "wagner" is a paint sprayer. The Yahoo researchers claim that by taking user demographics into account, "they managed to get the chosen link to appear as the top-ranked result 7 per cent more often than in the standard Yahoo search." New Scientist mentions this research and two other innovative adjuncts to current search practice: following the mouse cursor as a proxy for eye tracking, and taking back bearings on online criminals by studying the searches they make. (The latter raises disburbing privacy questions: would you want Google trolling through your search data? How about governments?)
who is asking you? (Score:3, Insightful)
would you want Google trolling through your search data? How about governments?
- what do you mean 'would you want', who is asking you, plebes?
Re:who is asking you? (Score:1, Insightful)
Thank goodness /. editors ask these probing questions in the summary. I wouldn't know what to discuss otherwise.
Neat-o. (Score:5, Insightful)
(Yes, I'm being facetious, but still. That Wagner example is pretty awful.)
Sexist search engines (Score:5, Insightful)
Yes, that's really what we need...
What next, a search result that depends on your religion? If you type "Origin of the Universe", you get articles about the Bible if the engine thinks you're Christian, and scientific material otherwise?
They need to understand there is little value in subjective data. Their results are already biased enough, they should take steps to fix that, not make it worse.
Re:Then again... (Score:5, Insightful)
What would be useful is if I could choose to search from a different persons/demographic's point of view. Whether for ebay, amazon, google.
For example say I am looking for a gift for someone else. Or I am helping someone else search for stuff. Or I'm the sort of person who has rather different interests but with search keywords that overlap.
Same goes for reviews of restaurants/movies/etc. What I like, someone else may detest.
Lastly, it could also be interesting (and even beneficial) to be able to more easily see things from other people's point of view.
Re:Sexist search engines (Score:3, Insightful)
Re:Sexist search engines (Score:2, Insightful)
If you are surfing from France, you speak French.. (Score:1, Insightful)
... not!
When I was living in France for a while (job related), I was quite annoyed by all those websites that assumed that because my computer's IP was in France I wanted to see the site in French, even if the site was a .com and I explicitly tried to click the "English" link. (My French is good enough to buy some baguettes with rillettes, but not for reading technical articles.)
This goes into the same direction: It works in many cases but when it doesn't, it will piss off the user.
highly dubious (Score:5, Insightful)
funny .... (Score:2, Insightful)
The first thing I thought of when I read Wagner was the popular brand of jeans.
There was/are gender predictors out there that will look through your search history and try to predict what gender you are. They were mildly successful (though dead wrong in my case). I think I prefer Google's more invasive yet more accurate method of paying attention to which results I click on and giving me more of the same without regard to gender or age. I DO like getting local results though.
As far as women vs woman goes ... tsk! just think, "would I use man or men here?", and then add a wo onto the front of it, its not that hard.
This is wrong. (Score:2, Insightful)
Often someone will tell me in a forum to "search for x in google", what happens when the results are not exactly the same worldwide because of this technique?
Also, there are loads of people that use proxies and so on to search the web. (like people in china) Their demographics would appear all skewed because it would seem that someone in the proxy's country of origin is requesting to search for webpage x.
I don't agree with this technique at all. It just doesn't fit. Imagine if 'egrep' started filtering strings based on additional info that you could not easily control (like timezone), it would be annoying.
Re:Correction: (Score:3, Insightful)
Are you sure? I just searched and the first result is this Slashdot article which clearly says that he was an 18th century composer, right in the summary.
Good heavens, why was this modded Insightful? I think the poster was going for Funny. Anyhow, a quick Wikipedia search reveals that Richard Wagner lived from 1813-1883, making him a 19th century composer.
Re:This is wrong. (Score:3, Insightful)
what would help is a simple way to toggle custom/standard searches and to see which way the toggle is currently set
Re:Correction: (Score:4, Insightful)
Modded insightful twice too... I guess some people can't be bothered to think for themselves and just moderate to increase whatever the current moderation is.
There is already bias in search results (Score:5, Insightful)
The search results are not just a regex matching. A modern search engine, like Google's, returns a ranked list of search results to you, and this ranking already has bias: the Pagerank algorithm sorts the results based on how popular the page is, as measured by the number of incoming links to that page. Of course, that is the general gyst of Pagerank as of the Google founders' research paper back in the late 1990s, and undoubtedly Google and other search engines have fine-tuned their algorithms since then to return "better" results to the user. But the point is still that there is already bias in the results.
Make no mistake that Google has not already thought of similar search result ranking algorithms similar to that posed in this Yahoo Research paper. The difference is that Google does not have a research arm like Yahoo, so they do not publish ideas like this. In hindsight, the Google founders were foolish to publish their Pagerank algorithm in the first place, but they were still at Stanford then.
Re:Correction: (Score:3, Insightful)
Wagner was a 19th-century composer, not 18th.
But when I (male) search for Wagner I'm more interested in Jill [imdb.com] than Josef or Richard.
They could simply not save the info. (Score:3, Insightful)
Re:wow... Just, wow.. (Score:1, Insightful)
Mkay?
Re:Neat-o. (Score:3, Insightful)
Just because it's true for the group doesn't mean it's true for the individual.
Improving search results is about aggregates -- returning the best results for the most queries. Individuals don't matter. Google has used this fact to their advantage to show many links to many people while keeping their interface clean: each user only sees three links at the bottom of the main page, for example, but each of n>>3 links displayed in that spot is viewed many times.
If Yahoo can move relevant links higher in the result list for 15 percent of queries, the only concern is about the quantity of queries for which relevant links have moved lower. If stereotypes do in fact represent the majority of a demographic, then it doesn't really matter to a search provider whether you or I as individuals represent our respective stereotypes.
Last, what if you want to know what other people not from your demographic group are seeing?
Why would you want to know? SEO? The goal of search engine optimization is completely at odds with the goal of improving search results: higher rankings of a site in spite of its relevance to the user, versus higher rankings for a site based on its relevance to the user.
Oh gheeze. A philosophical rant. That wasn't my intention. It really wasn't.
Re:If you are surfing from France, you speak Frenc (Score:3, Insightful)
THIS! I too have major hate of forced localization, everytime I set-up a new browser and load up Google, it goes to google.de (I'm in Germany, I speak the language well enough, but I want the content that I want, you stupid f'ing websites!). Even worse is Comedy Central and their South Park clips, an English-language blog embeds a clip from a South Park from Comedy Central, I click play, and guess what happens? The clip is dubbed in German! Aaarrrrggghhh!!!
Also trying to read myspace profiles (why, why?) gets pretty fucking irritating when it localizes the standard terms as "Favorite music", "Comments", etc, but then after the ":" displays the stuff the user's filled in, in their original language (usually English), meaning you have to read localized and then English words within the same sentence.
God damned morons all of them...