More on “All That Apply” Questions
by Steven B. Just

Two weeks ago I wrote an article on "All That Apply" questions discussing the fact that many test creators don’t use them because learners find them “difficult” (which they are, compared to traditional multiple choice questions). This raises the question:” As compared to other question types, how difficult are they?” Let’s see if we can answer that question using some simple math.

First of all we have to control for content difficulty and assume that the questions we are comparing all have the same degree of content difficulty even though they are of different type. For example, though All That Apply questions are generally more difficult than multiple choice questions, it is possible to write a very difficult multiple choice question and a very easy All That Apply question about the same topic (US presidents):

Multiple Choice Question:

Who was the 21st president of the United States?

Rutherford B. Hayes
James Garfield
Grover Cleveland
Chester A. Arthur
Ulysses S. Grant

Unless you know a lot more about US History than I do, that’s a hard question.

All That Apply Question:

Select the presidents who served BEFORE Abraham Lincoln

George Washington
Bill Clinton
Thomas Jefferson
John Kennedy
George Bush

And that’s an easy question.

By extension, this demonstrates that it is possible to write multiple choice questions and All That Apply questions of comparable content difficulty. In other words, we are trying to make an “all things being equal” comparison.

Assuming that there is no difference in content difficulty let’s look at three question types in order of their “intrinsic” difficulty:

1. True/False
2. Multiple Choice
3. All That Apply

(You’ll see why I introduced True/False in a moment.)

Let’s start with multiple choice questions. Most of the tests I see tend to have an average score of around 90%. So, if this is the case, and if all the questions are multiple choice then the average difficulty of any one multiple choice questions is .9, meaning, on average, 90% of the test takers will get that question correct.

Now let’s look at True/False questions. Clearly (all things being equal) they are easier than multiple choice questions. For a content-equivalent test of T/F questions rather than five-choice multiple choice questions the average score would now be 94% (read my article to see why). Thus the difficulty level of each T/F question is .94. (Difficulty values go in the opposite direction of intuitive. A difficulty of 1 is easiest and a difficulty of 0 is hardest.)

Now let’s look at the All That Apply question difficulty. Again, let’s assume the same content difficulty. Consider the question above:

Select the presidents who served BEFORE Abraham Lincoln

George Washington
Bill Clinton
Thomas Jefferson
John Kennedy
George Bush

If you think about the typical All That Apply question it is really a series of True/False Questions, where each choice may be true or false. So in effect this question could be rewritten as five true/false questions:

True or False: George Washington served BEFORE Abraham Lincoln
True or False: Bill Clinton served BEFORE Abraham Lincoln
etc.

Using our difficulty assumptions from above, if this single question were presented as five separate True/False questions, each would have a difficulty of .94 and the average test score would be 94%. So, then why is an All That Apply question so much more difficult? Because, unless you give partial credit the learner must get ALL of the parts (individual T/F questions) correct to earn a point. Even one incorrect answer results in all of the answers being marked incorrect.

Now let’s see if we can quantify the “intrinsic difficulty” of an All That Apply question (again, without partial credit allowed). If the probability of getting one T/F question correct is .94 then the probability of getting all five correct is .94x.94x.94x.94x.94 = .73.

So a test consisting of only five-choice All that Apply questions with no partial credit and the same content difficulty as the multiple choice and true/false examples we used above will have an average score of 73% as compared to 90% on the multiple choice test and 94% on the True/False test. Or, put another way, a test consisting of only five-choice All That Apply questions, without partial credit, will have an average score 17% less than a comparable test consisting of only multiple choice questions!

(Oh, Chester A. Arthur was the 21st president of the US.)

Print Article