Professional Data Engineer Certification Exam Guide
Professional Data Engineer Certification Exam Guide
Data validation
Designing for data and application portability (e.g., multi-cloud and data
residency requirements)
Designing the project, dataset, and table architecture to ensure proper data
governance
Networking fundamentals
Data encryption
2.2 Building the pipelines. Considerations include:
Data cleansing
Identifying the services (e.g., Dataflow, Apache Beam, Dataproc, Cloud Data
Kafka)
Transformation
Batc
Languag
Choosing managed services (e.g., Bigtable, Cloud Spanner, Cloud SQL, Cloud
Managing the lake (configuring data discovery, access, and cost controls)
Processing data
Connecting to tools
Precalculating fields
Identity and Access Management (IAM) and Cloud Data Loss Prevention
(Cloud DLP)
Publishing datasets
Publishing reports and visualizations
Analytics Hub
Preparing data for feature engineering (training and serving machine learning
models)
Flex, on-demand, and flat rate slot pricing (index on flexibility or fixed
capacity)