I'm new here and this seemed like the time to join in. (Sorry to hear about your lost data; I just had my laptop and Palm stolen so I'm starting over, too.)

I'm interested in Results from Project Gutenberg. That page of that site fascinates me and I'm trying to research similar data. For instance, you list the top 20 bigrams, trigrams and quadrigrams; does anyone here know how to get the top 50 of each? Or the top 5-grams? It would help immensely.

Thanks in advance.