Blog

Resolving the quality data fragmentation problem in high-complexity automotive manufacturing

Case study

·

Defect Ontology ∙ PFMEA Alignment ∙ 3MIS Traceability ∙ Rework Analytics ∙ PDI Optimisation ∙ Stack: Neo4j, Go, React

Executive summary

A premium automotive OEM faced a structural quality data problem common across the industry: the same defect was recorded differently in every system that touched it, making cross-domain analysis effectively impossible.

Factory rework data, warranty claims, PDI inspection plans, PFMEA risk assessments, and quality notifications each described the same physical failure in incompatible vocabularies, creating silos that left root cause analysis slow, incomplete, and manual.

The platform described in this case study was built to resolve that problem at its foundation. Not by integrating data feeds, but by building a canonical defect identity layer (a Defect Ontology) that recognises the same defect across all source systems and enables analytics that could not exist before.

Headline outcomes

  • 12 redundant PDI inspection items identified. Capacity recovered.

  • 4 high-RPN inspection gaps previously undetected. Risk surfaced.

  • 1 traversal to trace warranty claims to factory PFMEA. Was multi-day.

  • 22 rework items identified as ineffective. Process signal.

  • 6+ production gateways with rework planning mismatches. Planning gap.

  • 92% VOC Pulse. Voice of customer quality signal at launch.

Manufacturing a premium vehicle involves dozens of production gateways, thousands of part interactions, and a quality management stack accumulated system by system over years. Each system is individually fit for purpose. Collectively, they describe the same physical reality in mutually incomprehensible terms.

Quality teams observed a pattern with no clean analytical explanation: 3MIS warranty claims were tracing back to defect modes the factory had recorded, inspected, and in some cases acted on, yet the connection between these events had never been formally drawn.

Consequences and operational impact

  • Rework planned in as fixed cost. No mechanism to distinguish avoidable from necessary rework. Gateway budgets set without root-cause visibility.

  • PDI checklists misaligned with risk. Over-inspection on clean components. Under-inspection where factory escapes were occurring.

  • 3MIS claims not traced to production. Lag of months between warranty event and PFMEA update. Current run unaffected.

  • PFMEA drifting from reality. High-RPN items with no detection. Low-RPN items heavily controlled. A static artefact, not a living document.

Cross-domain trace: warranty to factory, before the platform

3MIS warranty claim → manual export and VLOOKUP → institutional knowledge → multi-day effort → PFMEA record (maybe).

The approach: a horizontal data fabric built on Defect Ontology

The platform was designed around a single architectural conviction: the right level of abstraction is the defect, not the system. If the identity of that failure can be resolved canonically, every downstream analytic becomes possible.

A Defect Ontology, a semantic identity layer, sits above all source systems and creates a unified vocabulary for defect classes. It resolves records from each system to a canonical identity: the Cluster.

How one physical defect (paint surface contamination) appears across systems.

  • PFMEA. ProcessStep + FailureMode + Severity / Occurrence / Detection = RPN.

  • Factory (defect logging / rework). Material + DefectGroup + Symptom + Gateway + ReworkTime.

  • PDI inspection plans. Part + FaultType + InspectionPlanID + RPN Threshold.

  • Warranty (MIS / 3MIS). CausalPartName + CausalIssue + WarrantyPeriod + Zone.

  • QCR (notifications). Symptom + Damage + Department + Status + Severity.

The cluster: canonical cross-system defect identity

Cluster (Front Bumper / Surface Finish).

  • PFMEA. N failure mode rows.

  • Defect Logging. N rework records.

  • Warranty MIS. N warranty claims

  • PDI Plans. N inspection checkpoints.

  • QCR. N notifications.

Three-stage cross-system identity resolution pipeline

  • Stage 1: Fuzzy string matching. Part and material names.

  • Stage 2: Semantic embedding. Symptom meaning equivalence.

  • Stage 3: Manual curation. Engineer confirms or rejects.

Platform capabilities

Six integrated modules built on the Cluster identity layer. Each surfaces analytics that no single source system could produce alone.

PFMEA Alignment Dashboard: RPN vs inspection coverage

At programme launch, 4 high-RPN failure modes had zero PDI detection coverage, while 12 low-RPN items were over-inspected.

Legend: red = high RPN with no inspection (gap). Green = high RPN with inspection present. Brass = low RPN with redundant inspection. Grey = low RPN with no inspection (acceptable)

PDI / 3MIS Cluster Escape Analysis

For illustrative defect clusters: proportion caught in factory vs escaping to customer. A high escape rate demands a different intervention than a high detection rate.

Legend: green = factory detection (defect logging count). Red = customer escape (MIS / 3MIS count).

Executive KPI Dashboard: programme launch summary

  • 22 rework ineffective items.

  • 12 inspection ineffective items.

  • 8 control plan ineffective items.

  • 12 inspection redundant items.

  • 4 inspection missing (high RPN).

  • 92% VOC Pulse at launch.

Finesse / Rework Analytics: planned vs actual across gateways

Mismatches between planned and actual rework time surface process capability issues and planning assumptions not revisited against recent production data.

Legend: green = planned rework time. Red = actual rework time. Gateways shown: GW01 Body, GW02 Trim, GW03 Paint, GW04 Glazing, GW05 Electrical, GW06 Final, GW07 PDI.

Technical architecture

A production-grade analytical system designed for scale, auditability, and enterprise identity integration. The architecture deliberately separates ontology computation from the query and visualisation layer.

Backend

Go REST API. Zap structured logging. Kubernetes horizontal scaling.

Relational store

MySQL. 50+ table schema covering entity ingestion, denormalised graph tables, match quality metadata.

Graph database

Neo4j. Millisecond Cluster identity traversal across all source systems.

Matching pipeline

AWS Batch. Fuzzy string matching and semantic embedding similarity at scale.

Frontend

React and TypeScript. Recharts. Nivo Sankey. ReactFlow for Quality Navigator.

Identity and observability

Microsoft Entra SSO. DataDog infrastructure and application monitoring.

Data flow: from source systems to insight

Source systems

  • PFMEA risk data

  • Defect logging rework data

  • Warranty MIS claims data

  • PDI plans inspection

  • QCR notifications

AWS Batch Matching Pipeline

Fuzzy matching, semantic embedding, and manual curation

Outcomes and impact

Outcomes drawn from the initial programme deployment. The intent is to characterise the order of magnitude and nature of impact accurately.

Root cause identification time

Warranty cluster investigations dropped from multi-day manual correlation to a single graph traversal in the Quality Navigator. The structural barrier (disconnected vocabularies) was removed by the ontology, not by process change.

PDI capacity recovered

12 redundant inspection items identified. Effort on low-RPN, consistently-clean components released for redeployment to higher-risk areas.

4 high-RPN inspection gaps surfaced

Failure modes with meaningful process risk and no corresponding PDI detection control. Invisible to any single-system view, immediately visible in the PFMEA Alignment Dashboard.

Rework planning accuracy challenged

Planned vs actual mismatches identified at more than 6 gateways at a granularity not previously available. Basis for process improvement or planning recalibration established.

3MIS traceability as standard capability

Any warranty claim can now be traced to its Cluster, factory defect logging history, and PFMEA failure mode in a single interaction.

The compounding effect of the ontology: As more production data flows through the platform, matching confidence improves, Cluster definitions mature, and cross-domain linkages become denser. The analytics available at month twelve are qualitatively richer than those at month one. Not because the platform changed, but because the ontology learned.

Is this problem present in your organisation?

The quality data fragmentation problem is not specific to any particular manufacturer, vehicle architecture, or production volume. It is a structural consequence of how quality management systems have evolved across the industry. If your organisation recognises warranty investigations that require manual data correlation across two or more systems before root cause can be assessed, the problem is likely present.

Copyright © 2026 Exlens AI. All rights reserved. Privacy Policy.

Copyright © 2026 Exlens AI. All rights reserved. Privacy Policy.

Copyright © 2026 Exlens AI. All rights reserved. Privacy Policy.