Flesch-Kincaid: Threat or Menace?
Mar. 23rd, 2005 11:11 pmFor the past week I've been leaving the novel behind. Instead, I've been working on an outline for a young adult novel I've wanted to write. It's a time travel story about a teenage boy who has to stop his future self from destroying history. I'm using it for my application to the Boston Public Library's Children's Writer-in-Residence program. I imagine I have a whelk's chance in a supernova of getting this fellowship, but it can't hurt to apply.
As I was working on the proposal, I started thinking about readability. Although I aim for a transparent style in my fiction, this was the first time I was developing a story for a younger audience. If this book is going to sell to teenagers or preteens, I'm going to need to make sure I write it at their level.
One of the many books on writing I own is FICTION WRITER'S BRAINSTORMER by James V. Smith, Jr. Smith has a chapter called "A Brainstormer's Guide to Revision and Editing" in which he describes the way he analyzed bestselling fiction. He typed into his computer selections from ten bestselling novels, one each by Fannie Flagg, Kaye Gibbons, John Grisham, Jan Karon, Stephen King, Elmore Leonard, Terry McMillan, Anna Quindlen, Danielle Steel, and Wallace Stegner. And then he ran them through his grammar checker to find out what the Flesch-Kincaid scale said about them.
I'm not going to go into all the details about the Flesch-Kincaid scale; you can find information on it by searching the web or picking up a good reference book. All we need to know for our purposes is that there are two ways to rank prose. Either we can give it a "readability" score that is cast as a percentage, or we can give it a grade level, which is based on the twelve standard grades in the American educational system.
So, for example, a story with a 75% readability will be understood by 75% of readers. A story with a grade level of 8 will be understood by anyone with an 8th grade education or higher.
Smith did his analysis with an open mind. I imagine that most of us would expect that the more commercial writers would score higher on readability and lower on grade level. And perhaps we'd see the reverse for the more literary writers.
Smith's results surprised him. Despite deliberately choosing a mix of commercial and literary writers, he found that many of the results fell into the same range. The average in four separate categories was as follows:
The amount of passive voice the writers used ranged from 2.3% to 13.43%.
The number of characters per word ranged from 3.72 to 4.58.
The readability ranged from 72.34% to 91.84%, with an average of 83.1%.
Finally, on the Flesch-Kincaid Grade Level scale, the range was 2.68 to 6.3, with an average grade level of 4.4.
In other words, he found that the bestselling writers were aiming their prose, prose that is read by a majority of adult readers in the country, at a fourth grade level.
He also analyzed one his own work, and to his chagrin discovered that his readability was only in the 60% range, and that his prose aimed at an 8th grade level. He decided to use this information as a tool to revise his future work.
He created his Ideal Writing Standard, which I'm calling the Smith Writing Ideal Standard or SWID, because I can pronounce it "swid." He said that from now on, he revises all his work to the following four standards PER ANY SCENE:
No more than 4.25 characters per word.
No more than 5% passive voice
No less than an 80% readability on the Flesch Reading Ease scale.
A Flesch-Kincaid Grade Level of 5 (although on the chart, he lists that as 4-6).
Now, as it so happens, today HarperCollins released the first section of the upcoming Neil Gaiman novel ANANSI BOYS. On a whim, I decided to copy the text into my computer and run the grammar checker to see what it would say about its readability.
The results were astonishing:
4.2 characters per word
Passive voice: 3%
Readability: 79.9%
Flesh-Kincaid Grade Level: 5.4
In other words, according to the numbers, Gaiman easily fulfills the Smith Ideal Writing Standard in three of four categories. On the fourth, readability, he misses by a tenth of a percent. I'm inclined just to give it to him.
Now, what are we to make of these results? Does this mean that all bestselling writers are sitting at their word processors, analyzing their scenes by formulae, and thus ensuring their literary stardom? Is literary success nothing but a cold, emotionless application of an equation? (Having two degrees in Physics myself, I must admit that the thought appeals to me.)
But, as useful as this tool might be, I don't think the rest of us need to run every single thing we write through the grammar checker. Furthermore, I doubt this is how the bestselling writers do it either.
On the contrary, I suspect that what happens is that as writers write, they learn how to write better. I doubt that Gaiman (or King, or any of the bestselling writers Smith analyzed) are using their grammar checker in such a mechanical fashion. Instead, I think that they've developed an instinct, a knack if you will, for language. Somewhere inside their minds they've learned what works and what doesn't, and it just so happens that as they write, their instincts kick in, and they make their prose as accessible as possible.
I also think they revise a lot.
But for the rest of us...well, I'm not going to be as evangelical as Smith and endorse his method 100%. But I will suggest that it can't hurt us to try a few scenes from our work through the grammar checker and see what pops out at the end. Because the higher the grade level at which we pitch our prose, the fewer our readers.
By the way, for those of you wondering, just before posting I ran this blog entry through the grammar checker. Here are the results:
4.4 characters per word
Passive voice: 3%
Readability: 63.7%
Flesh-Kincaid Grade Level: 8.7
Hm. I've got a lot of learning in front of me.
As I was working on the proposal, I started thinking about readability. Although I aim for a transparent style in my fiction, this was the first time I was developing a story for a younger audience. If this book is going to sell to teenagers or preteens, I'm going to need to make sure I write it at their level.
One of the many books on writing I own is FICTION WRITER'S BRAINSTORMER by James V. Smith, Jr. Smith has a chapter called "A Brainstormer's Guide to Revision and Editing" in which he describes the way he analyzed bestselling fiction. He typed into his computer selections from ten bestselling novels, one each by Fannie Flagg, Kaye Gibbons, John Grisham, Jan Karon, Stephen King, Elmore Leonard, Terry McMillan, Anna Quindlen, Danielle Steel, and Wallace Stegner. And then he ran them through his grammar checker to find out what the Flesch-Kincaid scale said about them.
I'm not going to go into all the details about the Flesch-Kincaid scale; you can find information on it by searching the web or picking up a good reference book. All we need to know for our purposes is that there are two ways to rank prose. Either we can give it a "readability" score that is cast as a percentage, or we can give it a grade level, which is based on the twelve standard grades in the American educational system.
So, for example, a story with a 75% readability will be understood by 75% of readers. A story with a grade level of 8 will be understood by anyone with an 8th grade education or higher.
Smith did his analysis with an open mind. I imagine that most of us would expect that the more commercial writers would score higher on readability and lower on grade level. And perhaps we'd see the reverse for the more literary writers.
Smith's results surprised him. Despite deliberately choosing a mix of commercial and literary writers, he found that many of the results fell into the same range. The average in four separate categories was as follows:
The amount of passive voice the writers used ranged from 2.3% to 13.43%.
The number of characters per word ranged from 3.72 to 4.58.
The readability ranged from 72.34% to 91.84%, with an average of 83.1%.
Finally, on the Flesch-Kincaid Grade Level scale, the range was 2.68 to 6.3, with an average grade level of 4.4.
In other words, he found that the bestselling writers were aiming their prose, prose that is read by a majority of adult readers in the country, at a fourth grade level.
He also analyzed one his own work, and to his chagrin discovered that his readability was only in the 60% range, and that his prose aimed at an 8th grade level. He decided to use this information as a tool to revise his future work.
He created his Ideal Writing Standard, which I'm calling the Smith Writing Ideal Standard or SWID, because I can pronounce it "swid." He said that from now on, he revises all his work to the following four standards PER ANY SCENE:
No more than 4.25 characters per word.
No more than 5% passive voice
No less than an 80% readability on the Flesch Reading Ease scale.
A Flesch-Kincaid Grade Level of 5 (although on the chart, he lists that as 4-6).
Now, as it so happens, today HarperCollins released the first section of the upcoming Neil Gaiman novel ANANSI BOYS. On a whim, I decided to copy the text into my computer and run the grammar checker to see what it would say about its readability.
The results were astonishing:
4.2 characters per word
Passive voice: 3%
Readability: 79.9%
Flesh-Kincaid Grade Level: 5.4
In other words, according to the numbers, Gaiman easily fulfills the Smith Ideal Writing Standard in three of four categories. On the fourth, readability, he misses by a tenth of a percent. I'm inclined just to give it to him.
Now, what are we to make of these results? Does this mean that all bestselling writers are sitting at their word processors, analyzing their scenes by formulae, and thus ensuring their literary stardom? Is literary success nothing but a cold, emotionless application of an equation? (Having two degrees in Physics myself, I must admit that the thought appeals to me.)
But, as useful as this tool might be, I don't think the rest of us need to run every single thing we write through the grammar checker. Furthermore, I doubt this is how the bestselling writers do it either.
On the contrary, I suspect that what happens is that as writers write, they learn how to write better. I doubt that Gaiman (or King, or any of the bestselling writers Smith analyzed) are using their grammar checker in such a mechanical fashion. Instead, I think that they've developed an instinct, a knack if you will, for language. Somewhere inside their minds they've learned what works and what doesn't, and it just so happens that as they write, their instincts kick in, and they make their prose as accessible as possible.
I also think they revise a lot.
But for the rest of us...well, I'm not going to be as evangelical as Smith and endorse his method 100%. But I will suggest that it can't hurt us to try a few scenes from our work through the grammar checker and see what pops out at the end. Because the higher the grade level at which we pitch our prose, the fewer our readers.
By the way, for those of you wondering, just before posting I ran this blog entry through the grammar checker. Here are the results:
4.4 characters per word
Passive voice: 3%
Readability: 63.7%
Flesh-Kincaid Grade Level: 8.7
Hm. I've got a lot of learning in front of me.
no subject
Date: 2005-03-24 04:56 am (UTC)My latest entry, btw: 4.1 char/word, 6% passive, 78.6 reading ease, grade 5.8. I'm doing better than you! How come I don't get any Hugos?
no subject
Date: 2005-03-24 04:13 pm (UTC)Seriously, though, I just did an analysis of one of my Hugo-nominated stories. It came out as follows:
4.3 char/word
0% passive voice
81.7% reading ease
grade level 4.0
So perhaps when I write my blog entries, I tend to write at a different level.
(no subject)
From:no subject
Date: 2005-03-24 09:33 am (UTC)Characters/Word: 4.5, Passive: 0%, Reading Ease: 49.1, Grade Level: 12.0 *sigh*
oops
Date: 2005-03-24 09:34 am (UTC)Re: oops
From:Re: oops
From:And this is why....
Date: 2005-03-24 12:05 pm (UTC)Because everything always seems so EASY to read.
Now I know why.
no subject
Date: 2005-03-24 01:07 pm (UTC)no subject
Date: 2005-03-24 02:11 pm (UTC)If you give one of these "fourth-grade level" best-selling novels to a group of average fourth-graders, how well will they understand the text?
no subject
Date: 2005-03-24 04:19 pm (UTC)I think that the two novels you cite would probably score at a high grade level and at a lower than average readability.
As for your second question, I don't know enough about real fourth-grade ability or the Flesch-Kincaid scale to make an educated guess. My suggestion would be to do the actual experiment. Pick one of the novels that really comes out to a fourth grade level and see how well a real fourth-grader grasps it.
no subject
Date: 2005-03-24 03:27 pm (UTC)no subject
Date: 2005-03-24 04:23 pm (UTC)Remember that nowhere in his discussion does Smith claim that this analysis leads to literary merit. The one thing he was looking for was if bestselling novels had any sort of commonality on the Flesch-Kincaid. We could debate all day as to the difference between a bestseller and a quality novel.
(no subject)
From:(no subject)
From:(no subject)
From:(no subject)
From:(no subject)
From:no subject
Date: 2005-03-24 06:11 pm (UTC)no subject
Date: 2005-03-24 09:40 pm (UTC)Run a passage from your fiction (either your novel or one of your shorts) and see what the result is. Better yet: Run something from your first short story and your latest and see what the difference is.
Note: The longer the scene the better the stats will be for averages. Dialogue heavy scenes will probably score better, on average, than description heavy scenes, as people speak with the intent of greater "readability".
Zhaneel
no subject
Date: 2005-03-25 01:37 am (UTC)However, I did run one Hugo nominee through, and posted the results above. I'll restate them here:
4.3 char/word
0% passive voice
81.7% reading ease
grade level 4.0
no subject
Date: 2005-03-25 07:44 am (UTC)just a thought.
no subject
Date: 2005-03-25 04:54 pm (UTC)no subject
Date: 2005-03-25 08:32 am (UTC)I call this tragic.
So I must be stifled that a moron can access it? No. I refuse. In this fair land of public education, where everyone who has passed through school in the past fifty years or so has been expected to pass all of the 12 mandatory grades, how is it that the reading level for the average purchaser of a best-seller is most comfortable with the same book a fourth grader is supposed to be able to enjoy? My view is that whatever story I display for public consumption, it should assume a readership of at least those whom have successfully passed out of the school system. If it challenges the reader, I have done a good job, if it fails that, then the average American is an insufferable moron and I don't care that they cannot understand what they should be able to.
Great subject, I'm happy to see it brought up.
no subject
Date: 2005-03-25 11:14 am (UTC)At the same time, if you write more advanced stuff, thats not wrong per se. If you set out to say "haha I'm better than you because my stuff is harder" it might be.
(no subject)
From:(no subject)
From:(no subject)
From:(no subject)
From:(no subject)
From:(no subject)
From:You've got to Obfuscate the Negative
From:i passed out of the school system
From: (Anonymous) - Date: 2005-04-02 05:36 am (UTC) - Expandhere via Neil Gaiman's blog...
Date: 2005-03-25 08:41 am (UTC)Here's the Flesch-Kincaid for the last history paper I handed in:
Words per sentence: 23.9 (my personal best is 24.8, by the way)
Characters per word: 4.9
Passive sentences: 3%
Reading ease: 38.6%
Grade level: 12.0
Here's the Flesch-Kincaid for the excessively wordy first draft of a 6,000-word short story I wrote last week:
Words per sentence: 13.4 (there is very little dialogue)
Characters per word: 4.2
Passive sentences: 1%
Reading ease: 79.9%
Grade level: 5.3
Clearly, I'm on the right track. Part of the reason I quit academia is because I no longer wanted to write solely for my fellow geeks. If I'm going to go to all the trouble to research and write a book, I might as well write something my parents would enjoy reading, too.
Now I want to run this test on Valley of the Dolls. I got in big trouble for writing a book report on it in fifth grade, even though I stated quite bluntly that it was the dumbest book I'd ever read. If that one's at a fifth-grade reading level, I'll eat my desk chair.
no subject
Date: 2005-03-25 08:43 am (UTC)Okay, this post didn't even qualify for a high school reading level, but my point is that whatever my piece's grade level comes out as, I have not targeted an ignorant audience. Do people even look up words they don't understand anymore? Harrumph.
no subject
Date: 2005-03-25 05:29 pm (UTC)no subject
Date: 2005-03-25 11:17 am (UTC)Ooh, this is nifty. :O I suspect I'll be obsessively checking my stats from now on~
Interesting. :D I seem to be going in the right direction.
Neil sent me (thanks, Neil!)...
Date: 2005-03-25 12:07 pm (UTC)As your statistics show, this isn't a contest between art and commerce; it's perfectly possible for stylists to rate well because some of what is scored for is just good style (minimising the passive voice, using the simplest language possible...).
We are using "readability" ambiguously here, since it actually means "scoring well on a particular test" rather than "something I will find easy to read". What's more, readability, whatever it may be, isn't everything (as
Re: Neil sent me (thanks, Neil!)...
Date: 2005-03-25 08:39 pm (UTC)And, of course, we are defining "readability" here in only one way -- the way Flesch and Kincaid did. There are surely many othr ways to define the concept.
no subject
Date: 2005-03-25 01:49 pm (UTC)Plus - Neil Gaiman posted your blog. How cool is that?
no subject
Date: 2005-03-25 08:42 pm (UTC)Um, thanks!
so I'd like to friend you, if that's okay
My policy is anyone can friend me, and I'll friend them back. Can't promise to read everyone else's posts, though. And I hope you actually find the rest of this blog interesting.
Plus - Neil Gaiman posted your blog. How cool is that?
Oh, it's way cool. Neil's a nice guy, and hard to dislike, even if one comes in second to him on the Hugo balloting. :-)
no subject
Date: 2005-03-25 02:05 pm (UTC)no subject
Date: 2005-03-25 08:43 pm (UTC)(no subject)
From:no subject
Date: 2005-03-25 03:14 pm (UTC)One of the things that really makes Gaiman a dead brilliant author is that he uses language in a very simple way to tell a very complex story. I've seen American Gods and Neverwhere used in adult literacy programs, because they're easy to read without being boring. The 'level' that something's written to refers, iirc, only to the language and grammar used, not to the thoughts that are being expressed. Which writer is better, the one who tells an epic myth using fourth-grade vocab or the one who writes "Dick and Jane" using language only a graduate student understands?
no subject
Date: 2005-03-25 05:45 pm (UTC)no subject
Date: 2005-03-25 05:58 pm (UTC)However, I think that his style has changed since his Book of the New Sun days. I found The Wizard Knight to be a very easy read compared to Wolfe's earlier works.
I think perhaps Gaiman's rubbed off on him...
(no subject)
From:no subject
Date: 2005-03-25 06:08 pm (UTC)Good luck getting the residency!
no subject
Date: 2005-03-25 08:44 pm (UTC)kinsleycastle.blogspot.com
Date: 2005-03-26 01:22 am (UTC)Do you know what sort of writer I want to be? Imagine some guy finds a book on the street. The cover and the first few pages with all the copyright information have been torn off. But in spite of that, and in spite of the fact he's never read this book before, the guy can still tell it's me after a couple of pages.
no subject
Date: 2005-03-26 07:44 pm (UTC)no subject
Date: 2005-03-27 08:09 pm (UTC)When I was a teacher
Date: 2005-03-28 12:36 am (UTC)I do think it is interesting, but, the NY Times, which is supposed to be the hardest paper in the world to read tests out only around 8th grade. Most newspapers are around 4th grade. That best-selling fiction would follow suit is not so surprising. You want people to be able to read it with ease, not have to struggle through it!
no subject
Date: 2005-04-13 10:06 pm (UTC)I pulled up an article I had written a couple years ago and it was grade 11.8, 5% passive.
I wonder to what extent my dialog lowers my writing grade level. I have a good many one-word "sentences" in dialog.
no subject
Date: 2005-04-14 12:54 pm (UTC)no subject
Date: 2005-05-27 04:22 am (UTC)you've written this entry ages ago, but i found it recently through a literary journal..
where did you get this "grammar checker"? Is it an actual program? I would like to obtain it and use it to check just how juvenile my writing is...
no subject
Date: 2005-05-27 04:35 am (UTC)Flesch-Kincaid and Bible version reading levels
Date: 2005-07-05 12:54 am (UTC)Interesting, I have run some of my own posts through the Microsoft Word grammar checker, and, to my dismay, discovered that the reading level of what I had written was much higher than I had expected. That ran counter to what I try to do when writing and how I believe "standard" prose for most readers should be written. I think the length of many of my sentences, some quite run-on, raised my F-K score.