ProblemsByVin Data Center
Open data · updated weekly

Data Center

Free, downloadable datasets built from the NHTSA owner-complaint and recall record — the derived layer (failure-mileage quartiles, recall gaps, per-family density) you can't get from the raw API. CSV and JSON, refreshed every week. Use them anywhere with credit.

License: CC BY 4.0 — free to use, republish, and remix with attribution.
Cite as: ProblemsByVin (https://problemsbyvin.com)
Source: US NHTSA owner complaints + recalls (public record), MY 2005–2025 · Updated: 2026-05-31
NHTSA complaints are unverified consumer reports. Counts of crashes, fires, injuries, and deaths reflect what owners alleged when filing — not adjudicated incidents.

Failure-Mileage Distribution

9,187 failure patterns

For every vehicle + component failure pattern with enough mileage-bearing complaints, the mileage at which owners report the failure — as quartiles (25th / median / 75th percentile). Built from NHTSA’s ODI flat-file, which carries odometer readings the JSON API omits. No other public source aggregates complaint mileage at this resolution.

Columns & methodology

How it's built: Each row is one (vehicle, component-category) cluster. Mileage is parsed from the NHTSA ODI bulk export and validated to the 1,000–400,000 mi band (data-entry artifacts outside that range are dropped). Quartiles are published only when a cluster has ≥8 mileage-bearing complaints. Complaints are unverified consumer reports.

ColumnMeaning
yearModel year
makeManufacturer
modelModel
componentFailure category (NHTSA component taxonomy)
complaintsTotal NHTSA complaints in this cluster
mileage_samplesComplaints carrying a valid odometer reading
mileage_p2525th-percentile failure mileage
mileage_medianMedian failure mileage
mileage_p7575th-percentile failure mileage
est_repair_usdEst. independent-shop repair cost (USD)
severitycritical | severe | moderate
urlSource page on ProblemsByVin

Post-Warranty Failure Patterns

4,782 post-warranty failure patterns

Failure patterns whose median reported mileage is past the typical 60,000-mile powertrain warranty — i.e. defects that statistically surface after coverage lapses, leaving owners to pay. Sorted by complaint volume.

Columns & methodology

How it's built: Subset of the failure-mileage dataset where the median failure mileage exceeds 60,000 mi (the most common US powertrain term, 5yr/60k) and the cluster has ≥8 mileage-bearing complaints. "miles_past_warranty" is median minus 60,000. Complaints are unverified consumer reports.

ColumnMeaning
yearModel year
makeManufacturer
modelModel
componentFailure category
complaintsTotal NHTSA complaints in this cluster
mileage_medianMedian failure mileage
miles_past_warrantyMedian minus 60,000-mi warranty
mileage_samplesComplaints carrying a valid odometer reading
est_repair_usdEst. independent-shop repair cost (USD)
severitycritical | severe | moderate
urlSource page on ProblemsByVin

Fire / Crash / Injury Complaint Index

13,539 harm-flagged failure patterns

Failure patterns where owners alleged a fire, crash, injury, or death in their NHTSA complaint, ranked by alleged deaths then fires. These are UNVERIFIED consumer reports filed with NHTSA, not confirmed incidents — they indicate where owners report the most severe outcomes, not adjudicated causation.

Columns & methodology

How it's built: Each row is a (vehicle, component-category) cluster carrying ≥1 complaint flagged by NHTSA for crash, fire, injury, or death. Counts are sums of those NHTSA flags across the cluster’s complaints. NHTSA complaints are submitted by consumers and are not verified by NHTSA or ProblemsByVin; treat as allegations.

ColumnMeaning
yearModel year
makeManufacturer
modelModel
componentFailure category
complaintsTotal NHTSA complaints in this cluster
alleged_deathsComplaints alleging a death (unverified)
alleged_firesComplaints alleging a fire (unverified)
alleged_injuriesComplaints alleging an injury (unverified)
alleged_crashesComplaints alleging a crash (unverified)
severitycritical | severe | moderate
urlSource page on ProblemsByVin

Engine-Family Complaint Density

21 engine families

Curated problematic engine families (documented defect patterns — class actions, NHTSA investigations, or 100+ fleet complaints), with total NHTSA complaints, recalls, and complaints normalized per model-year entry. Density lets you compare families of different production volumes.

Columns & methodology

How it's built: Families are the editorial set powering /engines/. For each family we sum complaints/recalls/harm flags across every matching (year, make, model) in the DB. "complaints_per_modelyear" = total complaints ÷ model-year entries present. Complaints are unverified consumer reports.

ColumnMeaning
engine_familyEngine family name
manufacturerManufacturer
displacementDisplacement
configurationEngine configuration
yearsProduction years
modelyear_entriesMatching model-year entries in DB
complaintsTotal NHTSA complaints across the family
complaints_per_modelyearComplaints ÷ model-year entries
recallsTotal recalls across the family
alleged_firesComplaints alleging fire (unverified)
alleged_deathsComplaints alleging death (unverified)
urlSource page on ProblemsByVin

Worst Transmissions 2005–2025

10 transmission families

Curated problematic transmission families (DCT, CVT, and conventional automatics with documented defect patterns), ranked by total NHTSA owner complaints across every vehicle that used the unit.

Columns & methodology

How it's built: Families are the editorial set powering /transmissions/. NHTSA does not expose a standalone "transmission" complaint category — transmission complaints bucket under "powertrain" — so this dataset aggregates by curated family (matching year/make/model lists), not a category filter. Complaints are unverified consumer reports.

ColumnMeaning
transmission_familyTransmission family name
manufacturerManufacturer
typeTransmission type
yearsProduction years
modelyear_entriesMatching model-year entries in DB
complaintsTotal NHTSA complaints across the family
complaints_per_modelyearComplaints ÷ model-year entries
recallsTotal recalls across the family
alleged_crashesComplaints alleging crash (unverified)
urlSource page on ProblemsByVin

Recall-Gap Index

1,035 unrecalled failure patterns

High-complaint failure patterns that have NO corresponding NHTSA recall — defects owners report in volume that regulators never forced a fix for. Every row is cross-checked against the live NHTSA recalls API, so vehicles that WERE recalled for the component (even when our local DB missed it) are excluded.

Columns & methodology

How it's built: Candidates are (vehicle, component) clusters with no recall for that component in our DB and ≥ the complaint threshold. Each candidate vehicle is then queried against the live NHTSA recalls-by-vehicle API; any cluster whose component is in fact covered by a real NHTSA recall is REMOVED. Survivors are genuine recall gaps as of the verification date. Complaints are unverified consumer reports; absence of a recall is per NHTSA records on the verification date.

ColumnMeaning
yearModel year
makeManufacturer
modelModel
componentFailure category with no NHTSA recall
complaintsTotal NHTSA complaints in this cluster
mileage_medianMedian failure mileage (if available)
alleged_deathsComplaints alleging death (unverified)
alleged_firesComplaints alleging fire (unverified)
severitycritical | severe | moderate
urlSource page on ProblemsByVin

NHTSA Defect Investigations

584 investigations

Every NHTSA Office of Defects Investigation (ODI) action touching a tracked vehicle — open and historical. The leading indicator of a recall: a Preliminary Evaluation (PE) can escalate to an Engineering Analysis (EA) and then a recall. Includes the recall campaign number for investigations that became recalls.

Columns & methodology

How it's built: Parsed from NHTSA’s ODI investigations flat file (FLAT_INV) and matched to our vehicles by year/make/model. One row per NHTSA action number. "status" is open when the federal record carries no close date; "recall_campaign" holds the NHTSA campaign number when the investigation led to a recall. An investigation is NHTSA examining a pattern — not a confirmed defect.

ColumnMeaning
action_numberNHTSA investigation ID (prefix = type)
typePE | EA | RQ | DP | AQ — investigation stage
statusopen | closed, per the NHTSA record
makeManufacturer (blank if it spans multiple makes)
modelModel (blank if it spans multiple models)
year_startEarliest affected model year in our fleet
year_endLatest affected model year in our fleet
componentNHTSA component under investigation
openedInvestigation open date
closedClose date (blank if open)
recall_campaignNHTSA recall campaign # if it led to a recall
nhtsa_urlOfficial NHTSA record

Writing about this?

Reporters and researchers: we'll pull a custom cut — by make, model year, state, component, or failure-mileage band — and our reliability analyst can speak to methodology on the record. Charts on this site are free to embed with attribution.

Request a custom dataset or analyst quote ›

Prefer the findings written up? See the reliability statistics — headline numbers from these same datasets. All datasets derive from NHTSA public-record complaints and recalls; methodology for each is documented inline above. Figures are computed live from the database snapshot dated 2026-05-31.
Get a free warranty quote →
Sponsored — we earn a commission if you complete a quote. Disclosure.