Session 3 - Data Modeling
Session 3 - Data Modeling
Academy
Get Certified
Session 3 of 8
Data Modeling (aka Harmonization)
First, some logistics
Questions, answers and videos
sfdc.co/DCAcademyGuide
Sneak Peek into Slack
Collaboration channel for Data Cloud Academy Tip: To ask questions you need to initiate a
Channel Name: help-dc-academy-april2024 workflow at the top “DC Academy Queries”
Sneak Peek Into Query Quip
Collaboration Channel for Data Cloud Academy
Ask Your Queries
sfdc.co/DCAcademyAskUrQuery
Forward Looking Statements
This presentation contains forward-looking statements about, among other things, trend analyses and statements regarding future events, future financial performance, anticipated growth, industry prospects,
environmental, social and governance goals, our strategies, expectation or plans regarding our investments, including strategic investments or acquisitions, our beliefs or expectations regarding our competition, our
intentions regarding use of future earnings or dividends, and the expected timing of product releases and enhancements. The achievement or success of the matters covered by such forward-looking statements
involves risks, uncertainties and assumptions. If any such risks or uncertainties materialize or if any of the assumptions prove incorrect, Salesforce’s results could differ materially from the results expressed or implied by
these forward-looking statements. The risks and uncertainties referred to above include those factors discussed in Salesforce’s reports filed from time to time with the Securities and Exchange Commission, including,
but not limited to: our ability to maintain security levels and service performance that meet the expectations of our customers, and the resources and costs required to avoid unanticipated downtime and prevent,
detect and remediate performance degradation and security breaches; the expenses associated with our data centers and third-party infrastructure providers; our ability to secure additional data center capacity; our
reliance on third-party hardware, software and platform providers; uncertainties regarding AI technologies and its integration into our product offerings; the effect of evolving domestic and foreign government
regulations, including those related to the provision of services on the Internet, those related to accessing the Internet, and those addressing data privacy, cross-border data transfers and import and export controls;
current and potential litigation involving us or our industry, including litigation involving acquired entities, and the resolution or settlement thereof; regulatory developments and regulatory investigations involving us or
affecting our industry; our ability to successfully introduce new services and product features, including any efforts to expand our services; the success of our strategy of acquiring or making investments in
complementary businesses, joint ventures, services, technologies and intellectual property rights; our ability to complete, on a timely basis or at all, announced transactions; our ability to realize the benefits from
acquisitions, strategic partnerships, joint ventures and investments, and successfully integrate acquired businesses and technologies; our ability to compete in the markets in which we participate; the success of our
business strategy and our plan to build our business, including our strategy to be a leading provider of enterprise cloud computing applications and platforms; our ability to execute our business plans; our ability to
continue to grow unearned revenue and remaining performance obligation; the pace of change and innovation in enterprise cloud computing services; the seasonal nature of our sales cycles; our ability to limit
customer attrition and costs related to those efforts; the success of our international expansion strategy; the demands on our personnel and infrastructure resulting from significant growth in our customer base and
operations, including as a result of acquisitions; our ability to preserve our workplace culture, including as a result of our decisions regarding our current and future office environments or remote work policies; our
dependency on the development and maintenance of the infrastructure of the Internet; our real estate and office facilities strategy and related costs and uncertainties; fluctuations in, and our ability to predict, our
operating results and cash flows; the variability in our results arising from the accounting for term license revenue products; the performance and fair value of our investments in complementary businesses through our
strategic investment portfolio; the impact of future gains or losses from our strategic investment portfolio, including gains or losses from overall market conditions that may affect the publicly traded companies within
our strategic investment portfolio; our ability to protect our intellectual property rights; our ability to maintain and enhance our brands; the impact of foreign currency exchange rate and interest rate fluctuations on our
results; the valuation of our deferred tax assets and the release of related valuation allowances; the potential availability of additional tax assets in the future; the impact of new accounting pronouncements and tax
laws; uncertainties affecting our ability to estimate our tax rate; uncertainties regarding our tax obligations in connection with potential jurisdictional transfers of intellectual property, including the tax rate, the timing of
transfers and the value of such transferred intellectual property; uncertainties regarding the effect of general economic, business and market conditions, including inflationary pressures, general economic downturn or
recession, market volatility, increasing interest rates, changes in monetary policy and the prospect of a shutdown of the U.S. federal government; the potential impact of financial institution instability; the impact of
geopolitical events, including the ongoing armed conflict in Europe; uncertainties regarding the impact of expensing stock options and other equity awards; the sufficiency of our capital resources; our ability to execute
our share repurchase program; our ability to comply with our debt covenants and lease obligations; the impact of climate change, natural disasters and actual or threatened public health emergencies; expected
benefits of and timing of completion of the restructuring plan and the expected costs and charges of the restructuring plan, including, among other things, the risk that the restructuring costs and charges may be
greater than we anticipate, our restructuring efforts may adversely affect our internal programs and ability to recruit and retain skilled and motivated personnel, our restructuring efforts may be distracting to employees
and management, our restructuring efforts may negatively impact our business operations and reputation with or ability to serve customers, and our restructuring efforts may not generate their intended benefits to the
extent or as quickly as anticipated; and our ability to achieve our aspirations, goals and projections related to our environmental, social and governance initiatives, including our ability to comply with emerging
corporate responsibility regulations.
September 8, 2023
Agenda
Process Overview
Q&A
Your Salesforce Team
✅
Verify access to below links
1
● Data Cloud Consultant Certification: http://sfdc.co/DCCert
● Data Cloud Consultant Exam Guide: http://sfdc.co/DCCertGuide
Start with Prepare for your Salesforce Data Cloud Consultant Credential
✅ 2
● Salesforce Data Cloud Consultant Credential Trailmix
Complete
✅ 3 ● Data Cloud: Ingestion
● Data Cloud : CRM Data Ingestion Bookmark -> Program Guide
sfdc.co/DCAcademyGuide
Recap Homework - Partner Learning Camp
➔ Complete
● Activity: Set Up Your Instance
✅
● Activity: Prepare Your Data
● Activity: Configure Data Ingestion
● Activity: Configure Batch Transforms
➔ Extra Credit
Watch the last Marketing Cloud Moments featuring
Data Cloud & Bundles
https://mcmoments.hubs.vidyard.com/
Ingestion
Exam Tips & Recap
Salesforce Certified Data Cloud Consultant
Allotted Passing
Total Question
Time Score
Data Cloud
Consultant
60 105 min 62 %
Exam Outline
Test takers are strongly advised to complete the Data Cloud Partner Learning Camp Curriculum before
attempting the exam
Salesforce Certified Data Cloud Consultant
Allotted Passing
Total Question
Time Score
Data Cloud
Consultant
60 105 min 62 %
- Gaj Sisodia
The Big Picture: Implementation Themes
Related to the components of Data Cloud
Data Consumption
integrations to source/target systems, etc. business intelligence tools.
Data Preparation
Data Ingestion Segmentation
Set up data streams bringing data into Data Turn mapped data into useful audiences or
Cloud from various supported sources and segments, to understand, target or analyze
applying necessary transformations customers at the unified level.
Any Device
Customer Graph
AI Predictions Activations
Quick Demo!
Data Cloud
Data Mapping
Map disparate data source structures to a common model
Disparate Schemas Data Map Canonical
(Normalized)
Data Model
Contact
Data Cloud
FirstName
LastName
MailingStreet
Phone Individual
firstName
Subscriber lastName
middleName
Email preferredName
Suburb militaryServiceId
birthDateDay
birthDateMonth
birthDateYear
Guest
birthDate
firstName contactPointEmailId
lastName
email mailingAddressId
mobilePhone mobilePhoneId
leadSource
…
Customer Invoices
First_Name
Surname
Address_1
Adddress_2
Basic Profile
Name, DOB, Title Jen Smith Jennifer Smith
Attributes
Communication Communication
Privacy Consent and
Subs. Consent, Subs. Consent,
Authorizations
Channel Consent Channel Consent
Two Physical Objects, One Semantic Object
Your Source: Web Orders
Your Object: Web_Sales_Order_Header
Web_Sales_Order_Header.OrderID SalesOrder.Id
Offline_Sales_Order_Header.Order_Number
Web_Sales_Order_Header.CustomerID SalesOrder.SoldtoCustomer
Offline_Sales_Order.Customer
Web_Sales_Order_Header.Purchase_Date SalesOrder.PurchaseOrderDate
Offline_Sales_OrderHeader.Date_of_Purchase
Web_Sales_Order.Store_ID SalesOrder.SalesStore
Web_Sales_Order_Header.Total_Amount SalesOrder.GrandTotalAmount
Offline_Sales_OrderHeader.Total
Harmonize Data Source Objects into Data Model Object
The web sales order details data set aligns well with the sales order product data model object.
Web_Sales_Order_Details.OrderID_SKU SalesOrderProduct.SalesOrderProduct
Web_Sales_Order_Details.OrderID SalesOrderProduct.SalesOrder
Web_Sales_Order_Details.SKU SalesOrderProduct.Product
Web_Sales_Order.Subtotal SalesOrderProduct.TotalLineAmount
Web_Sales_Order_Header.Number_of_Items SalesOrderProduct.OrderedQuantity
Explore Raw Data to Determine Object Relationships
● OrderID is the common thread, or foreign key, between the two data sets
○ Looking at Web_Sales_Order_Header, we see the list of OrderIDs is unique (no repeats)
● Looking at Web_Sales_Order_Details, we see the list of OrderIDs is not unique.
○ Multiple instances of 36061 and 36062
○ More than one SKU in the order
● We can conclude that Web_Sales_Order_Details is Many:1 with Web_Sales_Order_Header
Web_Sales_Order_Header
Web_Sales_Order_Details
Assign Object Relationships in the Data Model
Understand the relationships between objects in the raw data
● Set these relationships accordingly in the data model layer
● Relationships are defined in the UI based on Data Model fields
● Allows multiple data sources power the same data model object
Web_Sales_Order_ Web_Sales_Order
Details
OrderID OrderID
_Header
Web_Sales_Order_ Web_Sales_Order
Details
OrderID OrderID
_Header
REMEMBER:
You can extend the standard data model with custom
objects, fields and relationships for a hybrid approach.
Data Cloud is not BYO Data Model
Transform Business Domain Models into the Customer 360 Data Model
Before Mapping Your Data
Align Your Data with the Customer 360 Data Model
rt
Take Your Time with this pain
ttons
Do not rush into pushing bu
Data Cloud
● Data Cloud Data Model is normalized
○ Data needs to be normalized before it can be mapped
Party
Contact Point Social Contact Point App Contact Point Email Contact Point Phone
Contact Point Social Id (PK) Contact Point App Id (PK) Contact Point Email Id (PK) Contact Point Phone Id (PK)
Party Party
Common Entities To Know
Not a comprehensive list, rather some of the commonly used entities
Engagement Subject Area Sales Order Subject Area Product Subject Area
Device Application Engagement Sales Order Brand
Email Engagement Sales Order Product Goods Product
Engagement Topic Sales Store Product Catalog
Product Browse Engagement Order Delivery Method Product Category
Product Order Engagement
Shopping Cart Engagement
SMS Engagement
Website Engagement
GLOBAL CONSENT ENGAGEMENT CHANNEL CONSENT CONTACT POINT CONSENT DATA USE PURPOSE
(Object: Individual, Party Consent) (Object: Contact Point Type Consent) (Object: Contact Point Consent) (Object: Data Use Purpose)
You can submit Data Deletion requests for Individual profiles in Data Cloud. All requests
must be submitted using the Consent API.
A Data Deletion request deletes the specified Individual entity and any entities where a
relationship has been defined between that entity's identifying attribute and the Individual ID
attribute.
WARNING If an individual has data stored in a Data Stream that isn’t mapped to the Individual entity,
the Consent API doesn’t remove that data.
Data Deletion requests are reprocessed at 30, 60, and 90 days to ensure a full deletion. You
must submit any data deletion requests in all connected systems and Salesforce clouds.
● Multi Format (Json, csv, ● Schema enforced ● Semantic Mapping establishes Data Spaces - Once your data has been
parquet, orc) ● Parquet formatted Iceberg Tables DLO to DMO ingested, it is assigned to a Data Space that
● ●
Multi Sourced - Cloud Storage, Hydrated by transformations ● Can be optionally materialized acts as a partition, allowing you greater
Mulesoft, Kafka ● Typed (Profile Vs Engagement) ● Insights, Unified Profiles are control over how your data is organized
● Schema Preserving ● Materialized Tables DMOs
● Salesforce Data come direct into ● Simplified Curated Data to Data Model Objects - These are either
● Virtual BYOL Tables
Lake Objects Powers Business Applications materialized or views on top of the Data
Lake Objects. These can be Customer 360
DMO or materialized ones such as Unified
Individual, Computed Insights,
transformations etc.
DMO Does Not Have Category Preassigned
OTHER
Data Model "Interlocking"
Implementation tip
One way to "trick" the system is to mock a data
stream that will serve as a temporary "last
mapped stream"
One DLO to One DMO Mapping
How can you map same source data multiple times?
Secondary Email
One DLO to One DMO Mapping
Bundle Enforcement - Inability to Change Default Mapping
External User ID
Behind the Scenes
Understanding primary keys
Follow Along
Simplified use case to visualize the concept
CSV File
Name: Customer Profile
Source: S3 Commerce
ID First Name
ID 🔑 First Name
1 Homer
2 Marge
1 Homer
?
Data Extension Name: MC Customer
Source: SFMC1401234
ID First Name ID 🔑 First Name
1 Bart
2 Marge
?
Primary Key
PK DMO != PK DLO/DSO
CSV File
Name: Customer Profile
Source: S3 Commerce
ID First Name Individual DMO
ID 🔑 First Name
1 Homer
1 Homer Individual Id First Name Data Source
2 Marge
2 Marge
1 Homer
CSV File
Name: Customer Profile
Source: S3 Commerce
ID First Name Individual DMO
ID 🔑 First Name
1 Homer
1 Homer Individual Id First Name Data Source
2 Marge
2 Marge 1 Homer S3 Commerce
1 Homer
2 Marge S3 Commerce
Data Extension Name: MC Customer 1 Bart SFMC1401234
Source: SFMC1401234
2 Marge SFMC1401234
ID First Name ID 🔑 First Name
1 Bart 1 Bart
2 Marge 2 Marge
Getting Exam Ready
Real time data from CRM link
2 Complete
● Module : Ingestion and Data Modelling in Data Cloud
● Data Cloud : Data Modelling
● Module : Customer 360 Data Model for Data Cloud
● Streaming Data Transforms in Data Cloud: Quick Look
● Batch Data Transforms in Data Cloud: Quick Look
➔ Complete
● Activity: Set Up Your Instance
● Activity: Prepare Your Data
● Activity: Configure Data Ingestion
● Activity: Configure Batch Transforms
● Activity: Configure Data Model
sfdc.co/DCAcademyGuide
Thank you
Please provide your valuable feedback
post closing this zoom session, your
feedback will be very valuable to us