Autonomy
What the robot does without a human in the loop: mapping, recovery, edge-case behavior, unattended run time. Anchored to the Autonomy Ladder level.
A single headline number only earns its place if the weights behind it are visible. The Robovations Score is the weighted average of five dimensions, each scored 0 to 100, published with its full breakdown on every robot page.
A headline score is only useful if the path to it is visible. Every Robovations Score is the weighted average of the same five dimensions, using the same weights, scored by the same rubric. The weights are argued on this page; the per-robot scores are on each robot page; the weighted arithmetic is the same for every product in the database.
A score is not a recommendation. A robot with a 72 is not “better than” a 68 any more than a 200-page book is better than a 180-page book. The score summarizes a relationship between a robot and a set of concerns that matter in a household. Two robots with the same score can serve different households better. The dimensions and weights exist so a reader can tell whether the score was built on the things that matter to them.
Each dimension is scored 0 to 100 from its own sources and rubric. The weights are set by how consequential each is to ownership, and how hard it is for a buyer to verify before purchase.
What the robot does without a human in the loop: mapping, recovery, edge-case behavior, unattended run time. Anchored to the Autonomy Ladder level.
Whether the robot holds up across ownership. Mean time between interventions, failure modes, part availability, and firmware cadence after launch.
The effort the robot asks of the owner: cleaning, part swaps, consumable cadence, recalibration.
What the price actually returns. Total cost over three years against the capability delivered.
What the robot sends home, to whom, on what schedule, and with what owner controls.
A worked example shows the arithmetic. The rubric shows the inputs. For each of the five dimensions, these are the observable thresholds that move a robot between bands. Two analysts applying the same rubric to the same evidence basket should land in the same band.
Reliability draws from owner reports at six, twelve, and twenty-four months. New platforms have less of this data on file. We score them against a pre-launch evidence basket — manufacturer documentation, FCC filings, peer-reviewed teardowns, beta-program reports, and the owner data that does exist — then mark the headline score Provisional. The Provisional band is widened toward the category median to reflect the uncertainty. A score moves out of Provisional once the six-month owner window accrues, at which point the rubric above takes over.
The 0 to 100 score maps to four bands, each with a threshold. The bands are argued on this page, printed on every robot page, and never vary by category.
The Robovations Score above is built from owner-reported failure rates, firmware-iteration history, published privacy policies, and consumable-cadence data. None of those exist for a robot that has not shipped. A pre-release robot scored against the Robovations Score rubric is a score detached from the criteria it claims to measure.
So a pre-release robot does not get a Robovations Score. It gets a Pre-Release Assessment instead: a parallel evaluation built from five dimensions that can be honestly evaluated before shipping. The two frameworks live side by side and a robot is scored under exactly one. Lifecycle stage decides which.
| Lifecycle stage | Evaluation framework | Label on the page |
|---|---|---|
| Pre-release Announced, not yet shipping to consumers |
Pre-Release Assessment | “Pre-Release Assessment · 58 of 100” |
| Provisional Shipping under 6 months, thin owner data |
Robovations Score | “Robovations Score · 68 of 100 · Provisional” |
| Verified Shipping 6+ months, sufficient evidence |
Robovations Score | “Robovations Score · 72 of 100” |
| Discontinued Withdrawn from market |
Robovations Score (frozen) | “Robovations Score · 70 of 100 · Discontinued [date]” |
Each one is evaluable from public information before the product ships. Weights sum to 100% and are front-loaded on the dimensions most predictive at the pre-release stage.
Pre-Release Assessment bands are distinct from the Robovations Score bands (Poor / Average / Good / Excellent) so a reader instantly knows which framework they are reading.
Minimal evidence, fundamental unknowns. The product may not be real.
Meaningful gaps in plausibility or readiness. Direction is set, execution is not.
Defensible path forward with material open questions. The publication can take a position.
Strong signals across most dimensions. Specific risks are named and bounded.
High confidence the product will ship as demoed. Narrow remaining uncertainty.
When a pre-release product becomes available to consumers, three things happen in sequence:
The Pre-Release Assessment never converts into the Robovations Score. They measure different things. The transition is a switch, not an interpolation.
A descriptive score that earns the reader’s trust has to say where it sits relative to the prior art. The Robovations Score is not a derivative of any single existing framework, but it does take ideas from several and depart from each in places that matter for consumer robotics.
A single weighted headline summarizing several reads, published with the underlying breakdown.
We do not test in a lab. The dimensions are scored from owner data, manufacturer documentation, and source-traceable evidence rather than benchtop measurements.
The principle that autonomy is a spectrum with discrete behavioral breakpoints. Anchors our Autonomy dimension, which is keyed to the Autonomy Ladder.
J3016 governs a vehicle on a road. We classify a robot in a household, where the operating envelope is messier and the safety floor is different.
The distinction between capability and human-readiness as separate axes. Our Human Readiness Criteria sit on a parallel track to the Score for the same reason.
HRL is a procurement framework for systems acquisition. Our HRC describes whether a household can adopt a robot today. The names are similar; the intended user is different.
The premise that safety, privacy, and operational autonomy are distinct concerns that need to be scored independently.
ISO 13482 is a conformity standard. The Robovations Score is a descriptive signal — we do not certify, and a robot scoring well here is not certified to ISO 13482.