Oh, joyous day: Amazon has provided me with another means of needlessly quantifying subjective information. For a large number of their books they now publish readability statistics, in the form of the Flesch Readability Index, a number based on the mean lengths of sentences and words. An easier-to-read text gets a higher score; a score of 60 to 70 means a text is readable by an average 13-15 year old. This method of testing readability has actually become quite ubiquitous across America - the U.S. Department of Defence uses the Flesch index as its official standard and requires that its official documents meet certain score standards. More relevantly to me, Microsoft Word has also bundled Flesch for some time, and this led to my compulsively checking the scores of my undergraduate essays every five minutes, certain that an incredibly hard-to-read essay would translate directly into a high mark. (Detractors of literary criticism may have something to say about that, but who cares about them? They probably haven't even read this far.)
What Amazon does that is particularly noteworthy is turns the Flesch score into a relative percentile value - you can see what percentage of books are easier or harder than the one you're reading. Naturally, that leads to the tantalising question, "What is the hardest-to-read book in the world?" I scoured Amazon, determined to find and then immediately read the offending book so I could fill myself with the smug, self-satisfied assurance that everything I ever read would be a comparative walk in the park from now on. Unfortunately, it wasn't as easy to find ultra-difficult books as I had thought it would be.
Let's start with that old confounder of literary intentions, James Joyce. One might expect Ulysses to garner a particularly low score, but no such luck: it gets a score of 68.1. Finnegans Wake, then? It is harder, but not by much: it scores 60.0, which Amazon informs us is easier than 67% of books out there. Other books with reputations for difficulty fare just as badly. Moby Dick scores 57.9; War and Peace, 60.2; Gravity's Rainbow, 60.4. (Would you set Gravity's Rainbow as reading for someone in Year 10? I don't think my 14-year old self would ever get past the part where Katja and Brigadier Pudding... well, you know). All these, apparently, are comfortably in the easier half of all books ever written. Is every book I can think of, comparatively speaking, really a breeze? Even my previous blog entry, according to Word, manages to score in the thirties. Am I really twice as hard to read as Joyce? That's not much of an endorsement.
I became fearful that the detractors would score another point on me by virtue of scientific or legal textbooks being far harder than anything an English student might lay eyes upon. However, this is not the case. To give a few examples: a contract law textbook selected at random scores 53.5 (57% of books are harder), a book on Markov chains scores 62.2, and the formidably named "An Introduction to Magnetohydrodynamics" barely breaks into the harder half, with a score of 46.5 (43% of books have lower scores). Of course, a fool like me can't tell the difference between an easy maths textbook and a hard one, and there are, I am sure, many obscure and inaccessible science books which would be deep in the lower percentiles of reading ease, but this type of book is generally not included in Amazon's "Look Inside" scheme, so its stats aren't calculated and presumably won't affect the average.
The question, then, is: Where are all the really hard books? If I just picked books at random, about half of them should be harder than the selection above. And one in every hundred should score in the hardest percentile - something I've got absolutely near. Two more pieces of non-fiction did give slightly better results: Marx's Capital scores 43.3 (36% are harder), and the selected writings of Baudrillard scores 32.5 (21% are harder). It's an improvement, but still, one of every five books I look at should be harder to read than our friend Jean - so why can't I find any? Perhaps this is an error with the calculation itself. The formula for Flesch reading ease, according to Wikipedia, is:

It strikes me that a computer can't calculate syllables as easily as it can, say, letters in a word. Neither Amazon nor Microsoft Word actually produces a total syllable count for a given text (Amazon gives you an average syllable count, rounded spuriously to one decimal point, while Word doesn't even do that) - if they are just approximating, or guessing, based on word lists or the number of letters in a word, then this might not be that accurate a gauge in the first place. But if that is the answer, it is not a very satisfactory one. The very essence of this blog is playing the numbers game on words, applying unreliable numerical explanations to things that are wholly unquantifiable or subjective; given that to begin with we are frivolously adopting such inappropriate modes of thought, to conclude that they are indeed inappropriate seems to beg the question somewhat pathetically. And even if Amazon's Flesch index is entirely off-kilter, surely there would still be some books at the very hardest end of the scale, even if they are there undeservedly?
Thankfully, one writer finally came to the rescue: Kant. His Critique of Pure Reason scores in the hardest 10% of books - a feat that was breathtakingly unequalled, until two minutes later I checked the Critique of Judgment and found it to have a score of 18.4, putting it in the top three percent. Success! It appears that extreme difficulty is a facet of the writer and not of the subject matter, for other Kantian philosophers get much easier scores - it's not philosophy, but Immanuel himself, that is a tough nut to crack. Statistically I should still have been able to find many books of similar difficulty, but perhaps I'm just not looking hard enough; maybe you will have better success. In the meantime, I will be using the frontispieces of Kant - not Joyce, Pynchon or any other writer of fiction - to hide my comic books in.
(A quick postscript: I had intended to create some sort of graph of how hard to read a book is against how good it is, using Amazon's star rating as a measure of the latter. It didn't work out, however, because almost every book on Amazon has a very similar average rating (around four out of five). This is because by and large people only review a work if they love it or totally despise it; therefore most books have a ton of 5- and 4-star ratings, quite a few 1-stars, and almost no ratings of 2 or 3. So, on these continuous linear scales, books vary widely in terms of difficulty, but every book is roughly as good as every other.)