Google Unveils Massive Keyword Publishing Project

Search Engines WEB writes Google has “upped” its “dictionary” … resulting in a training corpus of one trillion words from public Web pages.
We processed 1,011,582,453,213 words of running text and are publishing the counts for all 1,146,580,664 five-word sequences that appear at least 40 times. There are 13,653,070 unique words, after discarding words that appear less than 200 times.”

–In plain terms, Google has made its “suggested words” a little smarter. It added words to the database that helps in suggesting items for translation, speech recognition, spelling correction, and a few other things. (Your ever-lovin grey-eyed ‘Mudge)