Google's Fight Against 'Low-Quality' Sites Continues 220
nj_peeps writes
"A couple weeks ago, JC Penney made the news for plummeting in Google rankings for everything from 'area rugs' to 'grommet top curtains.' Turns out the retail site had a number of suspicious links pointing at it that could be traced back to a link network intended to manipulate Google's ranking algorithms. Now, Overstock.com has lost rankings for another type of link that Google finds to be manipulation of their algorithms. This situation has led Google to implement a significant change to their search algorithms, affecting almost 12% of queries in an effort to cull content farms and other webspam. And in the midst of all of this, a company with substantial publicity lately for running a paid link network announces they are getting out of the link business entirely."
Does that mean (Score:3, Insightful)
we can expect google to get better, e.g. closer to what it used to be in the early days?
Re:Does that mean (Score:5, Insightful)
Google didn't get any worse, the spammers are the ones who got better.
I understand them if they are rather slow in making significant changes to their algorithm. In this sue-happy society they have to keep any collateral damage as low as possible (i.e. valid sites that move only a few spots down the ranking - can you imagine the outcry?). It's the disadvantage of being number one.
Re:Does that mean (Score:5, Interesting)
This is what I don't get. How can you decry the business of another when it adversely affects you, especially when the two industries are completely unrelated (Retail vs Search/Tech)? Google's business is to provide the most relevant results to the search request made. PERIOD. One of the search terms my site consistently is in the top three sites for recently went down several spots as people who've lifted content off my site and posted it to their site, unabridged and unedited. Just flat out copy/pasted it. I know, because there are unique aspects about my content (relevantly unique), which is why my site was so well listed, and why the content was lifted and posted elsewhere.
I worked long and hard creating unique relevant pages to get to the top of the search, only to be replaced by exact copies on other websites. I'm not upset, I consider it flattery that my content is so good that people find it that useful that they want it as their own. However, I would be pissed if the information I had was commercial in nature (it isn't) and people were just taking it because of what I call the Kazaa mentality of just copying things because you want them and are too damn cheap to buy it. In a world where people (used to) buy ring tones for $2.99 but steal $.89 MP3s.
Anyway, back to my point, as a result of people plain stealing my website content, my rankings have dropped considerably by exact copies of my work. What used to be #1 on the first page is probably now somewhere on page #2. It would suck if wasn't giving the info away, the more places that have my info the better. Still, I would love for Google to realize where the original came from (history) and gave points for being "first" for relevant content.
Re: (Score:2)
This is an issue of dilution and why mass copyright infringement ultimate devalues goods and harms their authors.
Part of the reason why people want certain goods is because not everyone has that good. When suddenly everyone has that good,the willingness of the population to pay top price for the item is significantly reduced; capitalism at work. As a result, once the market becomes diluted, the market price is reduced. This means it has effectively become a commodity item. Worse, when the commodity is commo
Re: (Score:2)
...people were just taking it because of what I call the Kazaa mentality of just copying things because you want them and are too damn cheap to buy it
I'm really surprised you got modded up on a site like /. which is overwhelmingly pro-copying (quite happily admitting - in fact almost boasting - it's copyright infringement, because apparently that makes it a lot better).
Re:Does that mean (Score:4, Insightful)
Perhaps the communal view on the issue is much more complicated than you'd like?
Perhaps Slashdot isn't a mono-culture yet, and still has plenty of dissenting views?
Perhaps the author has a point, there is a line between open culture and exploitive culture. Remixing is fine, sharing can be fine, plagiarism is not fine.
Re: (Score:3)
Damn double spacing... Bad /. 3.0! Bad.
If posts and moderation are an indicator, then he's right. The place need not be a mono culture to censor (actively attempt to hide from general view and squash credible debate via moderation) the debate. That's generally what happens. Anything which isn't pro-pirating is generally negatively moderated. Occasionally anti-pirating comments are either left alone or moderated up, but that's fairly rare. It likely has more do with moderation point availability rather than a desire to openly debate the merits.
I've noticed the moderation trend too, but I don't think it means to much. A lot of "anti-piracy" posts are trollish for whatever reason. Not the point of view, but it seems trolls favor posting from that side of the issue more so than the other. A lot of them are "piracy is bad", with no logic or reason backing it up. This happens on the pro-side too, but not as often. I'm not saying that this is most of it, but it plays a roll.
Worse, many pro-pirates then troll moderate those who offered an alternate view in other, unrelated discussions.
This is just a popular new thing
Re: (Score:3)
Whether it's Google or spammers that are responsible, the reality is that Google just isn't anywhere near as useful as it used to be. And I think they waited way too long to own up to the poor quality of results, they've been at parity with Bing and pretty much everybody else for quite a while, with their only advantage being rate of updates on their index.
Re: (Score:2)
Even on your assessment they are still the best: equal quality of results and faster updates means they are still the best.
Personally, I still think Google results are the best - and I done my own testing, even using blind search [fejus.com] for a while.
Re: (Score:2)
I understand them if they are rather slow in making significant changes to their algorithm. In this sue-happy society
They could go the way of Microsoft, i.e. Patch Tuesday. Optimize all you want, but we'll be changing the algorithm every month. (Oh, and flagging the quickest responders as spammers, they obviously care too much.)
Re: (Score:2)
And here come the Google fanboy's explaining how it's not Google's fault that Google search results are getting worse over time...
If Google's search results have gotten worse, then yes - Google has gotten worse.
Re:Does that mean (Score:5, Insightful)
Does This Mean That InfoWorld... (Score:2)
...will stop buying those infuriating astroturf placements here on Slashdot designed to bump up their SEO...?
Re: (Score:2)
Re: (Score:2)
Can you provide an example of this? Because this hasn't been my experience.
Re:Does that mean (Score:5, Funny)
I don't give a damn about their soul, I just want it to point me to the information I am looking for.
Re: (Score:2)
"I don't give a damn about their soul, I just want it to point me to the information I am looking for."
This.
Unfortunately, what I am looking for usually has nothing to do with what anyone else is looking for.
(if the information is that popular, I probably don't need to look for it, so all this page-rank nonsense is just in my way)
A.
Re: (Score:2)
Wouldn't be so bad if people didn't run off to their respective isolationist forums in order to ask/answer questions "publicly."
Two things that would make the web a fundamentally different place: Everyone to a man comfortable editing a wiki, and everyone to a man comfortable using public key cryptography software for signatures. The former would give us better consolidation of information than the chaos of forums (you think games are bad, try cell phone forums, sheesh!). The latter would allow, finally, c
Re: (Score:2)
... everyone to a man comfortable using public key cryptography software for signatures. ... would allow, finally, consequential online reputations/identities to be built.
Are you kidding? That would be horrible! The last thing I want is my boss at work asking why I've been reading erotic transformers meet the Jetsons fan fiction! ...Not that I do, that's just an example of something some sick person might be worried about...
Re: (Score:2)
Wow, that just gives "Jane! Get me off this crazy thing! " an entirely new meaning.
That's just so wrong. Now I feel dirty.
Re: (Score:2)
That was bad and you should feel bad for spreading it.
Especially since it required using the jaws of life.
Re:Does that mean (Score:4, Interesting)
You don't have to only own one identity. However building up a good rep would be enough work that it would be a limiter to promiscuous sockpuppetry. The point is that for any identity, which could span multiple sites if you want it to, we'd A) know posts were generated by the keyholder B) be able to refer to the keyholder identity compatibly C) be able to endorse or call shenanigans on certain keyholder from our own identities. D) be able to filter content based on a trust web built up over time such that trolls, spammers, and astroturfers are effectively moot -- each identity has to earn its own reputation from real people over time, either from consitently behaving as a good netizen (e.g.providing accurate information), being a good source of opinions about other netizens, or importing trust from real-world relationships. There would probably be many "flavors" of trust e.g. "this guy is a bona-fide real person but he tends to fall hook line and sinker for chain emails" or "this guy is right 90% of the time but he does not retract it when he is wrong" or "this guy has original material but don't import his endorsements because he downrates people based on personal vendettas."
Applies to email as well as forums/wikis.
I think with facebook people are getting used to the very preliminary ideas behind building a trust network, even if it is one built on a foundation of sand. So there's that at least. I'm not hopeful about anyone developing a good trust network system, much less selling it to the public, however, since developers seem to be more inclined to re-implement the CMS wheel perpetually.
Re: (Score:2)
This just in: 98% of blogs are pointless!
Re: (Score:2)
Re:Does that mean (Score:5, Funny)
Re: (Score:2)
LMGTFY
http://www.motorola.com/Consumers/US-EN/Consumer-Product-and-Services/Mobile-Phones/Motorola-DROID-X-US-EN [motorola.com]
http://www.motorola.com/Consumers/US-EN/Consumer-Product-and-Services/Mobile-Phones/DROID2-Global-US-EN [motorola.com]
(FYI, I get the SW ANH reference, I was just being snarky)
Re: (Score:2)
Easy.
Google "Google's Soul"
Re: (Score:3, Insightful)
Keep repeating the moronic claim of Google's overarching villainy. When Google does turn evil, no one is going to care because they're already ignoring you.
Re: (Score:3)
where the front page was nothing but a banner and search entry field? Or where "do no evil" was more than an abandoned slogan of good faith? Google sold its soul a long time ago, there is no going backwards.
A companies soul is the cost of them going corporate.
Re: (Score:2)
Do you mean going public? Going corporate just means (in very very vague terms) you have some additional tax rights and if you go under under legal means you are not having to personally front the bill. Basicilly making it safer for someone to take the risk and start a their own business. The LLC is being more used then the Corp now as it is cheaper. But going corporate doesn't make you evil. But once you go public then you have shareholders that you need to keep happy and they are looking at the bott
Re:Does that mean (Score:5, Insightful)
Also, what the hell is with you people. The slogan is "don't be evil", not "do no evil". It's a minor grammar error, and you're probably confused with monkeys, but this pops up time and time again. Is this some talking point kind of thing that I'm not aware of? Did I not get the memo?
Re: (Score:2)
Re: (Score:2)
That was a problem, it did not adequately represent the proportion of relevant pornography on the internet for any given search term. Porn should have easily been in the top 3, if not #1.
Can the car parts spammers be next? (Score:5, Informative)
Please tell me they are going to start going after the myriad car parts spam sites that flood the google rankings when searching for anything but the most obvious automotive items. I am sick and tired of sifting through a dozen completely worthless sites when googling for a part number I am trying to track down. Ebay is more reliable than google for almost everything I am looking for lately.
Re:Can the car parts spammers be next? (Score:5, Interesting)
The best interest of somebody selling a muffler on ebay is to get your eyes on the muffler so you can consider buying it. you obviously aren't going to buy the muffler if you've been searching for elvis wigs, and I think bad tagging gets your seller rating shot to shit.
On the other hand, in a web search, the best interest for the site is to maximize their investment and get the most eyeballs. So, they play dirty and abuse the algorithm to get as many hits as possible so their ads get as many hits as possible as well.
We're just lucky in this case that Google is siding with the people on this one, but that's only because Google seems to dislike vertical search sites as much as anyone else who is trying to search for an answer and gets baited into a mailing list / aggregator of the search results you were just looking at in google.
Re: (Score:3, Funny)
Re: (Score:3)
a computer analogy to explain a car situation.....
Whoa, are you trying to get the world to spin backwards?
Re: (Score:2)
While this IS a problem, it's not an insurmountable one. It's rare I can't tell which links are bogus from the search results page. Searching by full part numbers has worked well for me for Subaru, Ford, Mercedes, and Nissan. I recently got a new shifter cable for my 1992 F250 by part number for about a quarter of MSRP... I think it was on the third page of results or so. Maybe took me ten minutes to find, the time certainly paid for itself.
Google keeps doing what its doing, SHOCKER! (Score:2, Funny)
Completely unpredictable that Google keeps maintaining their product, unfathomable
Next target, those stupid mailing list aggregators that keep popping up first in results, but are a redirect to a redirect to a redirect ... and digg/reddit types
I, for one, welcome our new advertising overlords (Score:5, Interesting)
I think Google has "rationalized" a lot of their ad process (both ranking and sales) and the only guys who are hurt, are the ones who were gaming the system to begin with. e.g. click fraud and spamming the ranking.
Re: (Score:2)
Mine has stayed relatively stable. Couple dollars more like usual. While I'm sure it depends on the type of content that drives revenue to your site, I have a feeling that very little has changed for those who make more than a couple dollars a day.
Re: (Score:2)
I gave up on AdSense years ago, just the fact that they didn't bother to care whether or not the javascript worked across platform was enough to lose any interest in using their product. I don't mind them filling in free ads if nobody has paid for a particular spot, but when their javascript prevents people from showing their ads on my page, that's a problem and given that they supply the tools necessary, it's completely unacceptable.
Re: (Score:2)
My revs are up marginally the last few months. My theory was that it's Google doing a much better job of using their DoubleClick display adds to follow users around the web - it might be specific to my main site, it's very niche, and there really weren't that many distinct advertisers. Plus, I've been noticing Google ads following me as I browse the web. It's a little eerie, for example, I was researching antivirus packages for work. A day or so into the process, I'd start seeing ESET NOD32 ads everywher
Some improvement already (Score:3)
Re: (Score:2)
Searching for free software has been quite a challenge. It inevitably turns up more "free downloads" and "free trial" than actual free software by default, and you can't just eliminate those two without losing sites which offer both on the same page.
Anything that suppresses content farms is good! (Score:5, Interesting)
One of the things I use Google for extensively is the ability to search for wierd error messages, return codes, etc. that appear in commercial software I use for work. It's very frustruating when your very specific search query returns 45 different sites, all of which are rehosting the same forum post or newsgroup article. These get ranked higher up than other unique posts, causing a lot of scrolling through results and wasting time. Also, these aren't queries like "bmw 335i" or "" that are guaranteed to return millions of unique hits. I'm looking for the one other guy in the world who's found this issue and has a workable answer. Google used to be pretty good for that, especially if your query was well formed and incredibly specific.
Real world example - I got an error message trying to install Windows 7 SP1 last week, with a long hex number and a very specifically-worded message. I typed the query into google, and the first hit was some idiot who had no idea what he was talking about on a support forum. The next 5-6 hits were that exact same idiot's post rebroadcast to sites like eggheadcafe.com, techarea.in, etc. I eventually found the answer, but it was on page 3 of the search results.
On another topic, how and why do these content farm sites exist? How does eggheadcafe.com, which just copies newsgroup and forum data, able to pay to keep the site going? Are they all just looking to cash in on ad revenue? Do they really get that much in revenue to justify the site-crawling they must have to do?
Re:Search for error messages (Score:3)
The next generation is to get out of generic search. Build a roster of say 5 sites that do a great job on your error code problems and then use advanced search to stay in that domain.
Set up your browser to be specific search domains. (Non error related example) - I typically run IMDB and Wikipedia in a pair, so I do the search on those, one per tab.
Re: (Score:2)
I was part of a startup 10 years ago that was doing something like this. We used the DMOZ data to build a matrix of word frequency relationships to categories. Then when a user entered a query, we would determine the category, and send their query to a more specific search engine. For example, if they typed in 'beatles', we would identify this as a music query & send them results from allmusic.com. Unfortunately around the time our product was getting usable, the dot-com crash happened, all of our
Re: (Score:2)
I got an error message trying to install Windows 7 SP1 last week
sp1 is out?? why doesn't it appear in my updates? and its already giving mysterious errors!
Re: (Score:2)
MS stages the release of their service packs. IIRC some special partners get it first, then MSDN and technet subscribers get it. Then it goes on the download center and finally it goes on windows update. I think within windows update the release is gradual too though i'm not sure on that.
Theoretically this would allow them to pull a service pack before most users get their hands on it but i've never heard of that actually happening (though IIRC windows update DOES check some preconditions before handing out
Re: (Score:2)
Also, these aren't queries like "bmw 335i" or ""
If you're trying to put some text between < and >, write < and > , and don't forget to preview ;)
Do they really get that much in revenue to justify the site-crawling they must have to do?
Well, my guess is that the answer is "yes". These forums don't really require that much storage nor bandwidth, the usual "related topics" on that kind of sites helps getting better Pagerank (than the original forum), and the page is otherwise usually mostly filled with ads. You can also imagine other goals, for example :
Re: (Score:3)
The problem is that content sites and review sites and blogs and so on should be automatically excluded from all searches unless you explicitly but in words like "sale" or "review" in the search. (or have a non-commercial web sites only check box).
5 pages of reviews of a computer part that are actually all "reviews" left by consumers on shopping sites are useless when you really just want the obvious review of the part that was done by a computer review site. Their search engine also is worthless because b
alta vista (Score:3, Interesting)
Market forces will insure that firms will continue to hack the google algorithm. If Google fights back too much firms will begin to use and promote other advertisers, like Bing. This is a typical case where the end user is not the customer. The customer is the firms that pay Google to advertiser. Then search engine only serves to collect views that raise the value of those ads. Therefore the only issue is if the 'low quality' search results causes substantially fewer people to view ads.
In fact I don't see Google doing anything to make the search results better. All the link farms with Google ads appear to perpetually stay high in the ranks. The only time that anything seems to be done is when a firm fails to pay Google for ads and instead pays other firms to manipulate the rankings. I can imagine that Google, who will doing anything, ethical or not, to be the only ad agency on the web, would find that to be a very bad thing.
Re: (Score:2)
you meam all those aggregator sites will stop listing on Google and will fill Bing with all their crapola instead?
that's pretty much the best result I coudl have expected (as I don't bother using Bing in the first place).
Re: (Score:2)
If Google fights back too much firms will begin to use and promote other advertisers, like Bing. This is a typical case where the end user is not the customer. The customer is the firms that pay Google to advertiser.
Google isn't the advertiser. That's what the pages it lists are. Its more like a shopping mall. Google is trying to protect the reputation of its location by attracting Neiman Marcus and Saks 5th Avenue. JC Penney's can move to the cheap mall across town with Wal Mart and its customers [peopleofwalmart.com].
Re: (Score:2)
I live near a mall that has both a Neiman Marcus and a Target.
I keep waiting for it to explode from the inevitable matter/antimatter (Neiman/anti-Neiman) reaction.
Re:alta vista (Score:4, Insightful)
Google won, over AltaVista and others of the time, in part because the results were better - because AV's algorithm couldn't screen out the less useful results as well. They also won by just being a search engine rather than spending countless $ on becoming a "portal" when people didn't actually want a portal they wanted a search engine - perhaps AV would have done better if the $ that went into the portal thing went into improving their search functionality instead? Of course Google's keep-us-interested schemes involve much more than just the search engine these days so they could potentially fall into the same trap eventually, but unlike AV their other tools are just that: other, by which I mean that they compliment the search engine product (and the more general "information location and management" focus) or are not even related to it rather than trying to replace it.
SEO gaming - no penalty! (Score:3)
I've just looked back at JCPenney's stock price, and there's no fluctuation or even a news mention about them getting Google-slapped for SEO gaming. They made it through the Christmas season selling tons of stuff, Google has slapped them down, yet there isn't even a bump. An analyst noted they had slightly weaker January sales and blamed it on "Lower inventory clearance coupled with bad weather".
Apparently it means that SEO gaming does not rise to the level of "Corporate Evil" that would divert shoppers or stock traders. I guess the public must just see it as "corporations advertising like normal."
Re: (Score:2)
Or most of their online customers go directly to JCPenney's rather than searching for a source of doodads or widgets?
In the end, google might have done JCPenney's a favor by showing them how little business their SEO games actually brought, and that this is an expense they can well do without...
Anti Link Sites are born... (Score:3)
If I have a site that google has identified as a "bad link source", I can sell that as a service so companies can lower the rank of their competition.
Of course, Dr Suess saw this long ago http://www.squidoo.com/thesneetches [squidoo.com].
Re: (Score:2)
Re: (Score:2)
If I have a site that google has identified as a "bad link source", I can sell that as a service so companies can lower the rank of their competition.
I doubt it. Most reputation algorithms ignore (untrusted) negative feedback precisely for this reason. I think if your site is detected as a bad link source, its page rank is hard-set to exactly 0 so what you do or do not link has no effect on anything.
Of course, Dr Suess saw this long ago http://www.squidoo.com/thesneetches [squidoo.com].
nice...
Where's the story? (Score:2)
Now, Overstock.com has lost rankings for another type of link that Google finds to be manipulation of their algorithms. ... And in the midst of all of this, a company with substantial publicity lately for running a paid link network announces they are getting out of the link business entirely.
So where are the stories to support these two statements? TFS wasn't a summary of a story, it was a few low quality links and some bold, without citation, claims.
Still not detecting scraper sites (Score:2)
The Slashdot posting was mostly plagarized from this story at SearchEngineLand. [searchengineland.com] That story also has the phrase "And in the midst of all of this, a company with substantial publicity lately for running a paid link network announces they are getting out of the link business entirely.", without saying who it was. Searching for that phrase in Google brings up 73 results from sites which scraped that article, but no insight. Variations on that phrase bring up mostly hits to scraper sites.
Clearly, the new Goog
Re: (Score:2)
And obliterating the plagiarism sites from Google's index would help the original authors, too, and be a blow for justice. I'm thinking that Google could compile a list of scraper domains similarly to how it caught Bing copying its search results (MS sockpuppets: don't bother. You got caught.). Create pages with long, unusual phrases; every time you c
Re: (Score:2)
Re: (Score:2)
Just scroll down... and down, and down... you will get the answers that google indexed.
but remember, you must click on the link shown on google, dont try to search directly on the site
Re: (Score:2)
They do suck but if you scroll down to the bottom of the page, you get the answers anyway.
[John]
Datasheets (Score:2)
It sure would be nice if google laid the smack down on all of those bogus electronics datasheet archive websites. Those are totally useless and make it very difficult to find specs on old parts.
Re: (Score:3)
Funny, two days ago I sent Google a suggestion regarding datasheets. If someone types in 74HC[anyhing] etc... We want the .pdf, not a link to bogusdatasheets.com. Hell, just make it easier to select filetype:pdf in a checkbox, on the search or results page.
Malice (Score:2)
How about malicious link farms, that is someone sets up a link farm specifically to screw over one of their competitors?
Also, how about aggregation sites... Sites that have "content copied from other sites" but provide data from multiple locations in a single place making the data that much easier to use? I wouldn't consider such sites to be of low quality.
One thing that does irritate me, if i have a technical question google can usually find 50 instances of other people asking the same question, but not al
Change, yes; improvement, maybe. (Score:2)
Google's latest change is being discussed in the "search engine optimization" community. The consensus seems to be that a few big-name junk sites are being hit, and some minor link farms stopped having an effect, but the change isn't doing much else. "eHow" entries still show up. "alibaba.com" (a wholesale supplier directory, mostly for China, India, etc.) was hit, "globalsources.com" wasn't.
This may be a "manual adjustment", in emulation of Blekko's blacklist of content farms. Google's announcement, of
Look for SEO link buying from Fortune 500 firms (Score:2)
The other day I was approached by a marketing firm that wanted to buy a text link on the front page of my main website. That wasn't new, any webmaster of a half-busy site will get generic link buying requests frequently. This was different.
It was clearly a specifically written email to the webmaster, me. It wasn't the usual automated scatter-shot form letter email. I was curious, so I asked for a bit more information and it turned out to be a Fortune 500 firm that wanted to rank highly on printer supplies
I sense a great disturbance... (Score:2)
It was like thousands of Search Engine Optimizers cried out, and then suddenly were silent...
How does google know it was you? (Score:2)
How do they know "who" is responsible for the linkages?
link spam (Score:2)
I get the idea... (Score:2)
This probably means that big companies get ranked higher, and the smaller guy will end up on the bottom of the list.
Re: (Score:2, Insightful)
on Google to send me exactly where they must know I belong because I can't make that decision for myself.
If you knew the location of the web site where you "belonged", you wouldn't have to search for it to begin with.
Re: (Score:2)
If you knew the location of the web site where you "belonged", you wouldn't have to search for it to begin with.
Typing it into the search bar is quicker than prefixing "www." or thinking about the spelling.
Re:Good, now I can really depend (Score:5, Funny)
I guess the humor in my original post on this thread was lost somewhere.
It was low-quality humor, obviously culled from a humor farm - and thus downgraded.
Re: (Score:3, Insightful)
Yes, that's what this is about. Freedom to have spam served to me on a silver platter. Please Google, stop filtering all that spam in my gmail inbox too! I hate that you feel the need to protect me; I am a big boy and enjoy sifting through 1000 messages a day looking for the 2 relevant ones! Let freedom ring! /sarcasm.
Re: (Score:2)
Please publicly disclose your email address so that you can decide for yourself which spam email you'd like to keep.
Free your inbox from the tyranny of not getting enough herbal Viagra email!
Re:Good, now I can really depend (Score:5, Informative)
I run a spider. It seems over 95% of pages on the internet are content farm and similar randomly generated crap. They take a hundred sentence fragments, string them together, then see if they can fool Google and other engines into crawling them.
You will not be very happy if they stop filtering the garbage for you.
Bayesian tagging (Score:2, Interesting)
Let people tag sites they've found as a result of a search. Build a tagging system which will allow people to exclude linkspam for example.
I've set up Bayesian tagging for my email client and it works quite well, all my mails come in pre tagged, pretty much 99% accurately, only an occasional one comes through with an incorrect tag these days.
I'm aware of the processing overhead involved... which is what the Google Toolbar is for. Or should I have patented this idea first? Maybe they could just buy Stumbleu
Re:Bayesian tagging (Score:5, Insightful)
Let people tag sites they've found as a result of a search. Build a tagging system which will allow people to exclude linkspam for example.
Because no spammer could write a program to repeatedly search for and tag their site.
Re: (Score:2)
Re: (Score:2)
... and that's why Facebook, not Bing, is Google's biggest competitor.
They've known this for years.
Re: (Score:2)
You have been invited to play Mafia Wars!!!1one
Yep, social networks are the best way to channel the collective intellect of collected morons.
Social network not necessary (Score:2)
If you have an identified account and an identified choice, you can use various collaborative filtering techniques to suggest the autotags for a site. You have sets of shared bayesian statistics with those who have tagged sites similarly having greater weight than those who didn't.
Google's real problem is anonymity. The reputation of the link spammer is the same as a legitimate linker.
Re: (Score:3)
Depends, if the user is identifiable, now, doesn't it.
Come on, this is the 21st century, not the 20th.
http://en.wikipedia.org/wiki/Public_key_certificate#Client_certificates [wikipedia.org]
Re:Bayesian tagging (Score:5, Interesting)
Let people tag sites they've found as a result of a search. Build a tagging system which will allow people to exclude linkspam for example.
That would replace "PageRank" with "whoever can afford to pay Mechanical Turk to tag their site". At that point, Google might as well drop the middleman and use their AdSense auctions to sell page ranking directly.
Re: (Score:2)
Re: (Score:3)
send me exactly where they must know I belong because I can't make that decision for myself.
Congratulations on grasping the purpose of a search engine.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
It's mainly because Google doesn't like or believe in the mission of your non-profit.*
*I completely made it up...
Re: (Score:2)
Maybe your web site isn't as relevant as it used to be for that query? Maybe users are finding more valuable content elsewhere? Why do you believe the status quo should be inviolate?
Re: (Score:2)
Re: (Score:2)
How did you do that? Those are both the same link.
Re: (Score:2)
Bravo, this one's for you mods.