Data Cloud Identity Resolution Architecture

Sébastien Tang

Most Data Cloud implementations fail at identity resolution. Not because the feature doesn’t work. Because architects treat it like a configuration exercise instead of an architectural decision that determines whether the entire platform delivers value.

The data cloud identity resolution architecture you build in the first 30 days defines whether you get a unified customer view or a fragmented mess that costs six months to fix. The difference isn’t the matching rules. It’s understanding what identity resolution actually does at the data model level and designing for the conflicts that emerge at scale.

The Identity Resolution Problem

Identity resolution solves one problem: determining when records from different systems represent the same person. A customer who buys in-store, browses online, and calls support generates three records across three systems. Identity resolution creates a Unified Individual that ties them together.

Data flow showing three systems merging through identity resolution into unified customer record — data cloud identity resolution architecture — The Identity Resolution Problem

In practice, this means:

Matching rules that compare fields across Data Model Objects (DMOs)
A reconciliation strategy that decides which field values win when sources conflict
A Unified Individual record that serves as the single source of truth
Downstream segments and calculated insights that reference the unified profile

The architecture that works at 10,000 records breaks at 1 million. Not because of performance. Because edge cases that represent 0.1% of your data become 1,000 records that break downstream processes.

Matching Rules: The Core Architecture Decision

Identity resolution uses matching rules to compare records. A rule defines which DMOs to compare, which fields to match on, and how exact the match needs to be.

Three-layer matching rule architecture showing exact, fuzzy, and manual review layers with percentages — data cloud identity resolution architecture — Matching Rules: The Core Architecture Decision

The naive approach: match on email across all DMOs. That works until you encounter:

Shared family emails (multiple people, one address)
Corporate emails (multiple employees at same company)
Typos in one system but not others
Email changes over time (maiden name to married name)

The architecture that survives builds matching rules in layers:

Exact match layer — High-confidence matches on unique identifiers. Loyalty program ID, customer number, mobile phone. These create the core of your Unified Individual graph.

Fuzzy match layer — Probabilistic matches on name + address, name + phone, email domain + last name. These catch records where the exact identifier is missing but the pattern is clear.

Manual review layer — Potential matches flagged for human verification. This is not a failure mode. At enterprise scale, 2-5% of matches require judgment calls. The architecture that pretends otherwise creates silent data corruption.

In orgs with 500,000+ customer records, the typical pattern is:

70-80% matched via exact rules
15-20% matched via fuzzy rules
5-10% requiring manual review or remaining unmatched

The critical architectural decision: do you optimize for recall (matching as many records as possible) or precision (only matching when you’re certain)? Most teams optimize for recall and regret it. A false positive match — merging two different people — corrupts every downstream segment, calculated insight, and activation. A false negative — failing to match the same person — is visible and fixable.

Optimize for precision. Build monitoring to surface unmatched records. Fix them deliberately.

Reconciliation Rules: What Happens When Sources Conflict

Matching rules determine which records belong together. Reconciliation rules determine which field values win when sources disagree.

Customer record from e-commerce system says email is john.smith@gmail.com. Customer record from CRM says john.smith@company.com. Both are the same person. Which email goes into the Unified Individual?

Data Cloud supports four reconciliation strategies:

Most recent — Use the value from the most recently updated source
Source priority — Define a hierarchy of systems (CRM beats e-commerce beats support)
Most frequent — Use the value that appears most often across sources
Custom — Write Apex to implement your own logic

Most teams default to “most recent” because it sounds reasonable. That works until:

A batch import from a legacy system overwrites all current values with stale data
A mobile app with spotty connectivity creates phantom updates
A data quality issue in one system propagates to the unified profile

The architecture that works: source priority for critical fields (email, phone, address), most recent for behavioral fields (last purchase date, last interaction), custom logic for fields where business rules matter (customer tier, lifecycle stage).

Document your reconciliation strategy in a decision matrix. Every field in your Unified Individual should have a defined rule. If you can’t articulate why a field uses a specific strategy, you haven’t thought through the architecture.

Data Model Objects and the Unified Individual

Identity resolution operates on Data Model Objects, not raw Data Streams. This is the architectural detail most teams miss.

Data Streams ingest raw data from source systems. DMOs normalize that data into a standard schema. Identity resolution compares DMOs to create Unified Individuals.

The critical implication: your DMO design determines what identity resolution can match on. If you map email to different field names across DMOs, identity resolution can’t compare them. If you normalize phone numbers inconsistently (some with country codes, some without), fuzzy matching fails.

The pattern that works:

Map all source systems to a consistent set of DMOs (Individual, Contact Point Email, Contact Point Phone, Contact Point Address)
Normalize field formats during the DMO mapping (phone numbers to E.164, emails to lowercase, addresses to USPS standard)
Build matching rules that reference the normalized DMO fields
Create the Unified Individual as a materialized view over matched DMOs

At 100,000+ record scale, the typical enterprise has:

3-5 core DMOs for identity resolution (Individual, email, phone, address, loyalty ID)
10-15 behavioral DMOs that reference the Unified Individual (purchases, interactions, support cases)
20-30 calculated insights derived from the unified profile

The Unified Individual becomes the join key for everything downstream. Segments reference it. Calculated insights aggregate over it. Activations target it. If your identity resolution architecture is wrong, every downstream process inherits the error.

The Patterns That Prevent Data Chaos

Three architectural patterns separate implementations that scale from implementations that collapse:

Pattern 1: Separate identity resolution from data quality. Do not use identity resolution to fix bad data. If your source systems have inconsistent formats, fix them at the Data Stream or DMO mapping layer. Identity resolution should operate on clean, normalized data. Using fuzzy matching to compensate for data quality issues creates false positives that corrupt your unified profile.

Pattern 2: Version your matching rules. When you change a matching rule, you change which records are considered the same person. That retroactively affects every historical segment, calculated insight, and activation. The architecture that works: version your rulesets, run the new version in parallel with the old version, compare results, cutover deliberately. Data Cloud doesn’t enforce this. You have to build it into your change management process.

Pattern 3: Monitor match rates and conflicts. Build dashboards that show: percentage of records matched, distribution of match confidence scores, reconciliation conflicts by field, unmatched records by source system. These metrics tell you when your architecture is degrading. A sudden drop in match rate means a source system changed its data format. A spike in reconciliation conflicts means two systems are fighting over the same field.

In practice, at enterprise scale, you should expect:

5-10% of records requiring manual review in the first 90 days
1-2% ongoing as new edge cases emerge
Quarterly tuning of matching rules as data patterns evolve

Identity resolution is not a one-time configuration. It’s an architectural layer that requires ongoing monitoring and refinement.

Key Takeaways

Identity resolution architecture determines whether Data Cloud delivers value or creates data chaos — treat it as a core architectural decision, not a configuration task
Build matching rules in layers: exact matches for high-confidence identifiers, fuzzy matches for probabilistic patterns, manual review for edge cases
Optimize for precision over recall — false positive matches corrupt every downstream process, false negatives are visible and fixable
Reconciliation strategy matters: source priority for critical fields, most recent for behavioral fields, custom logic where business rules apply
DMO design determines what identity resolution can match on — normalize field formats at the mapping layer, not in matching rules
Version your matching rules, monitor match rates and conflicts, expect 1-2% ongoing manual review at enterprise scale

Need help with data cloud & multi-cloud architecture?

Unify customer data across Salesforce clouds with Data Cloud, build identity resolution models, and architect multi-cloud systems that actually work together.

Learn More Book a Call

Data Cloud Segmentation Audience : Guide Architecte

Data Cloud segmentation audience Salesforce : pourquoi la plupart des implémentations échouent et l'architecture qui fonctionne vraiment en enterprise.

March 10, 2026

Read article →

Data Cloud

Is Salesforce Data Cloud Worth the Investment?

Data Cloud's price tag raises hard questions. Here's the architectural reality of when it delivers ROI and when it's the wrong tool entirely.

March 9, 2026

Read article →

Data Cloud

Salesforce Data Cloud Identity Resolution

Identity Resolution dans Data Cloud : architecture, rulesets et pièges à éviter pour unifier vos profils clients à l'échelle enterprise.

February 24, 2026

Read article →

Tags:

Data Cloud Identity Resolution Data Architecture Customer 360

Assess your Salesforce AI readiness →

The Identity Resolution Problem

Matching Rules: The Core Architecture Decision

Reconciliation Rules: What Happens When Sources Conflict

Data Model Objects and the Unified Individual

The Patterns That Prevent Data Chaos

Key Takeaways

Need help with data cloud & multi-cloud architecture?

Related Articles

Data Cloud Segmentation Audience : Guide Architecte

Is Salesforce Data Cloud Worth the Investment?

Salesforce Data Cloud Identity Resolution