This weekend I was curious to see which bloggers were linking to my latest PBS article. So I did what I’ve been doing for years now and plugged the link into Google Blogsearch. The result? Google claims that my search did not match any documents.
I then turned around and plugged the same link into Technorati and found at least seven blogs that linked to the article. Okay, so maybe these just happened to be blogs not indexed by Google Blogsearch — that happens sometimes. But when I placed the regular URL for PBS’s MediaShift into Google, several of the blogs that had been indexed by Technorati showed up in the search. In fact they were showing up in the search precisely because they were linking to my PBS article.
So why didn’t they show up in my initial search? My theory is that Google Blogsearch for some reason no longer bothers to look for extremely long URLs. Therefore a URL like…
http://www.pbs.org/mediashift/2008/12/pulitzers-open-to-online-only-entrants—-but-who-qualifies347.html
… doesn’t get tracked, while a URL like…
http://www.pbs.org/mediashift/
…does.
I’ve had similar problems trying to use Google Blogsearch to track video/Youtube embeds lately . I used to be able to pluck a URL from the embed code and track every single blog that embeds a particular Youtube video, but recently it’s been impossible, forcing me to go to Technorati.
Oddly enough, these complaints are the least of Google Blogsearch’s sins; the search engine has increasingly been returning incredibly shoddy results for weeks. Scott Rosenberg describes the problems succinctly :
The Google Blog Search results have generally been the fastest and most useful tool of this kind (Google displaced Technorati, which had long served in this role, some time ago). But a couple of months ago Google Blog Search started becoming pretty much useless. Instead of only reporting links from the “main” blog content, it reported all links on a blog page, including the so-called “sidebar” or blogroll, where many bloggers place a lengthy static list of blogs they read. So this means that, for instance, every time JD Lasica adds a new post to his blog at Social Media, which includes Wordyard in its blogroll, I get a new listing in the Google Blog Search for Wordyard, even though the post has nothing to do with Wordyard. This completely messes up the utility of Google’s search for me — and, from what I see posted by other serious bloggers, many other users.
So, a question: Why has Google been so silent on this issue? And why would it allow for these problems to go on so long without reverting back to the old algorithm and search methods? I understand they’re trying to change what is searched, but at the very the least they should keep the old system until the new one is perfected. If my search habits are an indicator, then they could be quickly losing market share to Technorati.