Cognitive Reflection Test (CRT-7) Assessment
Take a seven-item CRT assessment in your browser, separate intuitive lures from other misses, and review section scores with rethink notes.Reflection brief
Score status
- {{ question.id }}. {{ question.text }}
Result details
Share result
Share this result page with someone you trust to review your answers and result.
Outcome split
{{ briefLead }}
- {{ point }}
Current score lane
| Score | Lane | Next move | Copy |
|---|---|---|---|
| {{ band.range }} | {{ band.label }} | {{ band.shortAction }} |
Reset plan
- {{ step }}
Use-result notes
- {{ note }}
Section outcome map
Debrief notes
| Item | Outcome | Debrief | Response | Reset move | Copy |
|---|---|---|---|---|---|
|
{{ card.title }}
{{ card.context }}
|
{{ card.badge }} |
{{ card.lead }}
{{ card.detail }}
|
Your response: {{ card.response }}
Keyed answer: {{ card.correct }}
|
{{ card.checkMove }} |
Answer review
| # | Item | Section | Your response | Intuitive lure | Keyed answer | Status | Reset move | Copy |
|---|---|---|---|---|---|---|---|---|
| {{ row.id }} | {{ row.short }} | {{ row.sectionLabel }} |
{{ row.responseLabel }}
Changed {{ row.changedCount }} time{{ row.changedCount === 1 ? '' : 's' }}
|
{{ row.lureLabel }} | {{ row.correctLabel }} | {{ row.statusLabel }} | {{ row.checkMove }} |
Introduction:
A cognitive reflection item feels easier than it is because the first answer usually comes from a shortcut that works in a nearby problem, but not in the problem on the page. The task is not only to calculate. It is to notice that the obvious move may be using the wrong relationship, the wrong base, or the wrong unit of comparison.
The Cognitive Reflection Test, usually shortened to CRT, became well known because its original three items put that conflict in a compact form. A person can see a price gap, a production rate, or a doubling sequence and produce a tempting answer before checking the structure. A correct answer usually requires a small pause, a restatement of the relationship, and a second look at what the numbers actually describe.
- Intuitive lure
- The standard wrong answer that follows from the quick shortcut.
- Reflective override
- The moment when the first answer is checked against the real relationship and replaced when needed.
- Other miss
- A wrong answer that is not the classic lure, often caused by later arithmetic or setup drift.
- Freshness
- How much the answer reflects live reasoning rather than prior exposure, memory, or coaching.
The seven-item form broadens the original set with combined rates, ranking, cash-flow accounting, and percentage-base change. That matters because the classic bat-and-ball, machines, and lily-pad items are widely circulated. Someone may remember one answer, recognize a familiar pattern, or improve with practice without showing the same fresh reflection on newer items.
CRT scores are useful as a narrow snapshot of reflective problem solving, not as a full measure of intelligence, education, or judgment. The most informative reading compares correct answers with lure picks, looks for section differences, and treats prior familiarity as a real limit on what the score can prove.
How to Use This Tool:
Work through the seven questions in one sitting. The report appears after every item has a selected answer, so a partial run is useful for thinking but not for scoring.
- Select
Start CRT-7 questionsand answer from your best current reasoning, not from a memorized answer key. - Read the full prompt before choosing. Several lures come from reacting to the largest or most familiar number too early.
- Use the question navigator and progress bar to find unanswered items. The finished report loads only when progress reaches
100%. - Review the headline score,
Lure avoidance,Classic trio, andExtension quartetbefore focusing on individual items. - Open the debrief and answer review rows to compare your response with the keyed answer, the intuitive lure, and the suggested reset move.
- Use the copy, CSV, DOCX, chart download, or result-link options only when you are comfortable sharing answer choices and score context.
A shared result link can reopen final answers and charts, but it does not preserve the original sequence of answer changes from the live run.
Interpreting Results:
Start with the total score, then explain it with the error pattern. A lower score with many intuitive lures points to first-answer capture. A lower score with few lures and several other misses means the obvious trap was often avoided, but later setup, arithmetic, or comparison work still broke down.
| Score | Lane | Useful read |
|---|---|---|
0-1 |
Heavy first-impression pull | Slow the read and write the governing relationship before choosing. |
2-3 |
Mixed override | Identify which trap families still survive a second look. |
4-5 |
Mostly reflective | Use the item notes to find the remaining unstable problem types. |
6-7 |
Strong override | Check item familiarity before treating the pass as a fresh reflection signal. |
The section split can be more useful than the total when the classic items are familiar. A strong classic trio with a weaker extension quartet suggests that newer rate, rank, cash-flow, or percentage-base traps still need deliberate checking. The reverse pattern suggests the famous original lures remain sticky even when the added items go better.
Rescued after rethink is especially relevant because it means an initially wrong live answer changed to the keyed answer before finishing. Rechecked means an answer was revisited, but the final answer was not rescued. A replayed result keeps final scoring and answer review, but revision history is unavailable.
Use the charts as a compact summary, not as a diagnosis. The outcome chart separates correct answers, intuitive lures, and other misses. The section chart shows whether the original trio or the four added items carried more of the friction.
Technical Details:
CRT scoring treats each item as a conflict between a keyed answer and a common lure. The total score is the count of keyed answers, while lure avoidance asks how often the final response avoided the classic tempting answer. Those two quantities answer different questions. A person can avoid lures but still miss items, so lure avoidance should always be read with the correct and other-miss counts.
The seven-item form preserves the original price, rate, and doubling problems while adding four item families that reduce dependence on the most famous examples. That expanded structure makes section comparison useful, but it does not turn the result into a standardized diagnostic category.
Formula Core
For each item, c is 1 when the final answer matches the keyed answer and 0 otherwise. On a completed run, Answered is 7. For example, a run with 5 keyed answers and 1 lure pick scores 5/7 with 86% lure avoidance because 6 of 7 final answers avoided the lure.
| Item | Keyed answer | Intuitive lure | Trap family |
|---|---|---|---|
| Bat and ball | 5 cents |
10 cents |
Difference equation |
| Machines | 5 minutes |
100 minutes |
Rate scaling |
| Lily pads | 47 days |
24 days |
Doubling sequence |
| John and Mary | 4 days |
9 days |
Combined rates |
| Jerry | 29 students |
30 students |
Ordinal indexing |
| Pig trade | $20 |
$10 |
Cash-flow accounting |
| Simon | Same amount of money |
5% more money |
Percentage base shift |
| Section | Items | Interpretation boundary |
|---|---|---|
| Classic trio | 3 |
High performance may reflect reflection, familiarity, or remembered solutions because these items are widely known. |
| Extension quartet | 4 |
Added item families test whether the pause-and-check habit transfers beyond the original examples. |
The score lanes are local report labels. They group the 0 to 7 total into review language so a result can point to next steps, but they are not published diagnostic cutoffs.
| Score range | Displayed lane | Report action |
|---|---|---|
0-1 |
Heavy first-impression pull | Slow the read and write the governing relationship before choosing. |
2-3 |
Mixed override | Review which trap families still survive the second look. |
4-5 |
Mostly reflective | Tighten the last few high-friction item types. |
6-7 |
Strong override | Treat repeat attempts carefully if any items already felt familiar. |
This version uses four answer choices for each item. Multiple-choice CRT formats can be useful and have been studied, but response format affects administration details, completion speed, and comparability with open-ended CRT studies. Compare results only with tests that use similar wording and response format.
Responsible Use and Privacy Notes:
CRT-7 is a brief reflection screen, not an IQ test, hiring filter, clinical diagnosis, or broad reasoning profile. Routine scoring runs in the browser. Exports, copied rows, chart files, DOCX reports, and shared result links can still reveal answers, score context, and review notes, so treat them as assessment records.
Worked Examples:
A strong total with a weak extension split
A result of 5/7 can look solid at first. If the split is Classic trio 3/3 and Extension quartet 2/4, the better follow-up is to study the added trap families rather than assume the whole reflection habit is stable.
The same score can hide different misses
Two people can both score 4/7. Three lure picks suggest the first answer kept winning. One lure pick plus two other misses suggests the person often resisted the lure, then made a later setup or arithmetic mistake.
A rescued answer is part of the evidence
If a live run marks an item as Rescued after rethink, the final answer alone is not the whole story. The change shows the pause-and-check behavior that CRT items are intended to reveal.
A replay result needs a narrower read
A shared link can reopen a 6/7 result with charts and answer review. If it is a replay, use the final score and item statuses, but do not infer whether answers were changed during the original session.
FAQ:
Can a high CRT-7 score prove general intelligence?
No. A high score means these seven reasoning traps were handled well on this pass. It does not measure broad intelligence, clinical status, workplace suitability, or long-run decision quality.
Why does the report wait until all seven items are answered?
The score, lure avoidance percentage, and section charts all depend on a complete seven-item set. Use the navigator to find the unanswered row.
What is the difference between an intuitive lure and an other miss?
An intuitive lure is the standard tempting wrong answer for that item. An other miss is any other wrong final answer, often tied to setup, arithmetic, or comparison drift.
Why are rethink counts unavailable on some shared results?
Rethink counts require the live answer-change sequence. A shared replay can restore final answers and scores, but not the original change history.
Is this comparable with published CRT research?
Compare cautiously. The item families match the CRT literature, but the assessment uses guided four-choice responses, while some studies use open-ended answers or different wording.
Are my answers sent away for scoring?
Routine scoring happens in the browser. Sharing or exporting the result can still disclose your final choices, score, and review context.
Glossary:
- Cognitive reflection
- The tendency to check an immediate answer before committing to it.
- Classic trio
- The original three CRT items: Bat and ball, Machines, and Lily pads.
- Extension quartet
- The four added items covering combined rates, ordinal indexing, cash flow, and percentage-base shifts.
- Intuitive lure
- The standard tempting wrong answer for an item.
- Other miss
- A wrong answer that is not the standard lure.
- Rescued after rethink
- A live-session item that started wrong and ended with the keyed answer.
References:
- Cognitive Reflection and Decision Making, Journal of Economic Perspectives, 2005.
- Assessing miserly information processing: An expansion of the Cognitive Reflection Test, Thinking & Reasoning, 2014.
- Effect of response format on cognitive reflection, Behavior Research Methods, 2018.
- A limitation of the Cognitive Reflection Test: familiarity, PeerJ, 2016.
- The non-effects of repeated exposure to the Cognitive Reflection Test, Judgment and Decision Making, 2018.