Organise

Schema Normalisation

One record shape, no matter how many source systems.

Maps incoming records from any source into your canonical schema — field renames, type coercions, unit conversions, code-value reconciliation. Every downstream consumer sees the same shape regardless of where the data came from. Without this, each integration re-implements the same translation and diverges over time.

Shape

Operational dimensions

No human in loop

Runs without a person in the path.

Event-triggered

Fires when an upstream condition occurs.

Medium data gravity

Holds working state that compounds over runs.

Read-only inbound

Consumes external data; does not write back.

Inputs

source records (heterogeneous schemas, types, units)
source-to-canonical field mapping specification
target canonical schema definition
value-code reconciliation tables (e.g. country codes, status enums)

Outputs

normalised records conforming to canonical schema
mapping / coercion exception log
field coverage report per source

Mechanism

Maps heterogeneous source records into a canonical target schema — aligning field names, types, units, and value vocabularies so downstream consumers see one shape.

Why this is a primitive

Cannot be decomposed: the map-fields → coerce-types → align-units → reconcile-value-codes operation is the single act of conforming a record. It is upstream of entity-resolution (you normalise before you can match) and upstream of graph-instantiation (the graph wants conformant inputs), but does neither itself. Strip it and consumers wrangle source-by-source.

Where it shows up

Multi-brand retailer — normalises product records from 6 supplier feeds (different field names, units, category codes) into a single canonical product schema before entity resolution

Healthcare network — maps HL7 v2, HL7 FHIR, and proprietary EHR exports into one patient-encounter schema so analytics pipelines see a single shape

Fintech — reconciles transaction records from payment processors with different currency, date-format, and status-code conventions before downstream aggregation

HR platform — aligns employee records from Workday, BambooHR, and legacy HRIS into one canonical people schema for workforce analytics

Related primitives

Composes with

Controlled Vocabulary Authoring →Entity Resolution & Record Linkage →Knowledge Graph Instantiation →Reference & Citation Resolution →Classification Application →Data Asset Inventory & Lineage →

Alternative to

Entity Resolution & Record Linkage →