Organise

Schema Normalisation

One record shape, no matter how many source systems.

Maps incoming records from any source into your canonical schema — field renames, type coercions, unit conversions, code-value reconciliation. Every downstream consumer sees the same shape regardless of where the data came from. Without this, each integration re-implements the same translation and diverges over time.

Shape

feed Acust_nmamt_usddtfeed Bcustomeramountdatefeed Cnamevalue_eurtsnormaliserename · coerce · convertcanonicalcustomer_idamount_usdoccurred_atone shape, every source

Operational dimensions

No human in loop

Runs without a person in the path.

Event-triggered

Fires when an upstream condition occurs.

Medium data gravity

Holds working state that compounds over runs.

Read-only inbound

Consumes external data; does not write back.

Inputs

  • source records (heterogeneous schemas, types, units)
  • source-to-canonical field mapping specification
  • target canonical schema definition
  • value-code reconciliation tables (e.g. country codes, status enums)

Outputs

  • normalised records conforming to canonical schema
  • mapping / coercion exception log
  • field coverage report per source

Mechanism

Maps heterogeneous source records into a canonical target schema — aligning field names, types, units, and value vocabularies so downstream consumers see one shape.

Why this is a primitive

Cannot be decomposed: the map-fields → coerce-types → align-units → reconcile-value-codes operation is the single act of conforming a record. It is upstream of entity-resolution (you normalise before you can match) and upstream of graph-instantiation (the graph wants conformant inputs), but does neither itself. Strip it and consumers wrangle source-by-source.

Where it shows up

Multi-brand retailer — normalises product records from 6 supplier feeds (different field names, units, category codes) into a single canonical product schema before entity resolution
Healthcare network — maps HL7 v2, HL7 FHIR, and proprietary EHR exports into one patient-encounter schema so analytics pipelines see a single shape
Fintech — reconciles transaction records from payment processors with different currency, date-format, and status-code conventions before downstream aggregation
HR platform — aligns employee records from Workday, BambooHR, and legacy HRIS into one canonical people schema for workforce analytics

Related primitives

Tags

structured-databatchautonomousdata-quality

See where it fits.

Primitives are configured into named solution shapes for each client’s domain. The fastest next step is a conversation about which shape fits your problem.

Start a conversation