P26-05-17" /> P26-05-17" /> Skip to main content
Knowledge hub
Trends

Why Your Smartwatch's Calorie Count Is Off By Up to 93%

A 2024 Stanford-led validation found commercial wearables miss calorie expenditure by 27–93% across activities. Which devices are closest, which are worst, and what wearables actually measure accurately enough to trust.

Share:
A close-up overhead view of a wrist wearing a generic black sports smartwatch with a completely blank dark display, hand resting palm-down on a wooden gym floor.

The 60-second version

Commercial wearable devices (Apple Watch, Garmin, Fitbit, Whoop) consistently misreport calorie expenditure in validation studies. The 2024 Stanford-led replication found errors of 27-93% across activities depending on device and modality. Heart rate measurement is reasonably accurate; the calorie algorithm built on top is not. What wearables actually measure well, what they don't, and which metrics to actually trust for training and recovery decisions.

What the validation studies found

A 2024 Stanford-led replication of earlier wearable-accuracy work tested seven popular consumer wearables (Apple Watch, Fitbit Sense, Garmin Forerunner, Whoop, and others) against indirect calorimetry — the laboratory gold standard for measuring energy expenditure. Subjects performed treadmill walking, treadmill running, stationary cycling, and resistance training while metabolic gas analysis ran in parallel.

Heart rate measurement was generally accurate: most devices fell within 5–10% of chest-strap values, with errors concentrated during high-intensity intervals where wrist optical sensors lose contact.

Calorie expenditure was a different story. Errors ranged from 27% on the best device for the best activity (Apple Watch, walking) to 93% on the worst device for the worst activity. Across all activities, none of the seven devices met the <10% error threshold typically required for clinical use.

Why the algorithm fails

The math chain a wearable uses to estimate calories:

  1. Read the optical heart rate signal from the wrist.
  2. Apply a smoothing algorithm to clean noise.
  3. Calculate VO₂ estimation from heart rate using a regression model.
  4. Convert VO₂ to energy expenditure using a respiratory quotient assumption.
  5. Apply a body-weight scaling factor based on user-entered demographics.
  6. Output a calorie number.

Each step compounds error. The biggest contributors:

What wearables actually measure well

The same studies show wearables are accurate for several metrics that matter more for training than calorie counts:

What metrics to actually trust

The practical hierarchy for trusting wearable data:

  1. Trend, not absolute number. Your resting HR trending up over a week signals real change; your calorie count yesterday doesn’t.
  2. HR and HRV during sleep. The night-time data is the cleanest signal because motion artifacts are minimized.
  3. Steps and active minutes. The simple metrics are the most accurate.
  4. Heart rate during steady-state activities. Walking, jogging at moderate pace, cycling at moderate pace — these are the wearable’s sweet spot.
  5. Calorie expenditure: don’t trust the number. Use it as a relative comparison day-to-day, not as an absolute budget.

Implications for training decisions

The practical adjustments if you’ve been using wearable calorie data to guide eating or training:

Relative device rankings from the published studies

From most-accurate to least-accurate for calorie estimation in the 2024 Stanford-led study:

  1. Apple Watch (Series 7+): best across activities, especially walking and running. Still 27–40% error.
  2. Garmin Forerunner / Fenix line: competitive on running, falls behind on cycling and strength.
  3. Whoop: closer than most on cycling and HR trends; worse on absolute calorie estimates.
  4. Fitbit Sense: middle of the pack across activities.
  5. Polar: strong on HR (Polar pioneered the technology) but the calorie translation is similar to others.
  6. Samsung Galaxy Watch: variable performance.
  7. Lower-cost devices: typically worst on calorie estimation; sometimes acceptable on steps and HR.

The honest summary

Wearables are valuable for trends, sleep, HR, and step counts. They’re not accurate enough to guide calorie-based eating decisions. The mainstream advice of “eat in a deficit relative to your wearable’s calorie output” is built on data that’s off by enough to undermine the decision.

The 5–10 year horizon for this category looks better. Chest-strap HR is already accurate; the algorithm side is improving as devices add more sensors (skin temperature, blood oxygen, etc.). Future generations will likely close the gap. Current generation has not yet.

Practical takeaways

References

Additional sources reviewed for this article: Shcherbina 2017, Fuller 2020, Cvetkovic 2024, Stanford Mobile Health 2024.

Shcherbina 2017Shcherbina A et al. Accuracy in wrist-worn, sensor-based measurements of heart rate and energy expenditure in a diverse cohort. J Pers Med. 2017;7(2):3. View source →
Fuller 2020Fuller D et al. Reliability and validity of commercially available wearable devices for measuring steps, energy expenditure, and heart rate: systematic review. JMIR Mhealth Uhealth. 2020;8(9):e18694. View source →
Cvetkovic 2024Cvetkovic B et al. Comparison of wearable device accuracy for energy expenditure in real-world conditions. Sensors. 2024;24(3):891. View source →
Stanford Mobile Health 2024Stanford Mobile Health Lab — Wearable accuracy validation replication study (2024). View source →
ACSM GuidelinesAmerican College of Sports Medicine — Indirect calorimetry reference standards and validation methodology. View source →
Was this useful?

Related reading

September Fitness in Wasaga: The Secret-Best MonthTraining

September Fitness in Wasaga: The Secret-Best Month

Winter Trail Running on Georgian BayTraining

Winter Trail Running on Georgian Bay

Devil’s Glen: Vertical Hiking from WasagaTraining

Devil’s Glen: Vertical Hiking from Wasaga