Jay writes “ResearchBuzz reported that one more search tool (Beta) is now available to search for blogs. It was launched by Google and is called the Google Blog Search.
Read the full article at:
Jay writes “ResearchBuzz reported that one more search tool (Beta) is now available to search for blogs. It was launched by Google and is called the Google Blog Search.
Read the full article at:
buggy
Google blogs is currently ignoring norobot declarations, too, so people who don’t want their blogs indexed are *still* getting them linked by Google blogs. While I understand that norobots is no guarantee of privacy, it’s a disgraceful error for Google to have made. (they claim to be fixing it)
Re:buggy
They’re also indexing LJ private and protected entries, at least ones that were originally public but then later changed. Disgraceful!
Re:buggy
Is Google just linking, or actually crawling, “norobot” pages? And are these sites using a robots.txt or the NOINDEX meta tag?
Re:buggy
Actually crawling. Dunno on LiveJournal’s tagging, does anyone else?
my initial thoughts
I’m excited about this Google entry into a big field.
The other blog search engines I use continue to have very slow response times (I’m assuming it’s their servers, but maybe their web programming as well), so I’m hoping Google will improve things.
My first query was “LCSH,” which I recommend you all try any time you want to see a lot of librarian-type information.
I was shocked by the number of foreign-language blogs. I went into the advanced settings and limited myself to English-only. The first 4 hits I clicked on gave me a WordPress error, so if I were a WordPress person I would make sure it indexes things correctly.
The next blog referred me to an new OCLC prototype called OCLC DeweyBrowser. It looks really, really neat and I may be playing with this thing for awhile. Try DeweyBrowser, then go back to google later on!
Re:buggy
Yep, crawling. And on LJ it’s not robots.txt files, it’s the meta tag.
<meta name="robots" content="noindex, nofollow, noarchive"
<meta name="googlebot" content="nosnippet"
They have, at least, it made it is a bug which they are fixing. But I’m just shocked that Google’s default libraries, whatever they’re using for their main code base, doesn’t have “obey meta directives” at such fundamental level that this could never have happened.