How a Multistage Fitness Test Works and What It Is Intended to Do
The multistage fitness test — also known as the beep test — is used in schools and athletic programs worldwide. Here's exactly how it works and what it's measuring.
The Short Answer
The multistage fitness test — commonly called the beep test, pacer test, or 20-meter shuttle run — is a maximal aerobic capacity test in which participants run back and forth between two lines 20 meters apart, synchronized to audio beeps that gradually increase in frequency. The test continues until the participant can no longer reach the turning line before the beep. The level reached is used to estimate VO2 max — the maximum rate at which the body can consume oxygen during exercise — which is the primary measure of cardiovascular fitness. The test requires minimal equipment, can be administered to large groups simultaneously, and has been validated against laboratory VO2 max measurement methods.
The Test Protocol
The 20-meter shuttle run is set up with two parallel lines marked 20 meters apart on a flat surface. Participants line up at one line and run to the other, arriving at or before an audio beep. They then turn and run back, again arriving before the next beep.
The test is organized into levels (usually 1-23) and stages (typically 7-16 repetitions per level depending on the version). In early levels, the beeps are spaced far enough apart that participants walk briskly to barely reach the line in time. As levels progress, the time between beeps decreases and the required pace increases. By the later levels, participants are running at near-sprint speeds.
The test ends when the participant fails to reach the turning line before the beep on two consecutive occasions (in most versions). The final level and stage completed is the participant’s score.
How VO2 Max Is Estimated
VO2 max — maximum oxygen uptake, measured in milliliters of oxygen per kilogram of body weight per minute (ml/kg/min) — is the gold standard measure of cardiovascular fitness and aerobic capacity. Laboratory measurement of VO2 max requires specialized equipment and individual testing, making it impractical for schools and large groups.
The multistage fitness test estimates VO2 max from the level achieved using validated regression equations. Higher levels correspond to higher estimated VO2 max values. These estimates have moderate to good correlation with laboratory-measured VO2 max values, making the test a practical field substitute that is “good enough” for population-level assessment, fitness classification, and progress tracking even if it is not as precise as laboratory testing.
What the Test Is Intended to Do
The primary purpose of the multistage fitness test in school and athletic settings is to assess and monitor cardiovascular fitness across populations:
Baseline assessment: Establishing a student’s or athlete’s current cardiovascular fitness level at the start of a program, term, or season.
Progress monitoring: Retesting at intervals to measure improvement or decline in cardiovascular fitness over time, providing feedback on training effectiveness.
Population comparison: Comparing an individual’s VO2 max estimate to age- and sex-specific norms to determine whether fitness is below, at, or above average for their demographic group.
Health screening: Cardiovascular fitness level is a significant predictor of long-term cardiovascular health outcomes. Schools and military organizations use beep test results as part of health screening.
Limitations of the Test
The beep test measures cardiovascular fitness accurately as a field test, but several factors can introduce variation beyond true fitness differences: running surface (hard floors vs. grass), footwear, familiarity with the protocol, motivation and pacing strategy, and the accuracy of the audio equipment used. Athletes who are familiar with the test and have practiced pacing strategies tend to score higher than equally fit athletes taking it for the first time, which means beep test scores can improve with test familiarity independent of cardiovascular fitness gains. This limitation is worth noting when the test is used to track progress: some early improvement may reflect learning rather than fitness improvement. Despite these limitations, the multistage fitness test remains one of the most widely used and practically useful tools in physical education and sports science precisely because it can assess a fundamental health metric — cardiovascular fitness — in large groups simultaneously with minimal resources, providing actionable information that would otherwise require individual laboratory testing unavailable to most schools and community programs.