Data Platform Best Practices
Data Platform Best Practices
BULLETPROOF
DATA PLATFORM
IN AZURE
VALDAS MAKSIMAVIČIUS
Here is a document to help you speed up the implementation of the
Modern Data Platform in Azure. Either you are a single team or an
enterprise, you want to ensure your environment is secure, cost-effective
and maintainable. My goal is to help you not only deliver a successful
minimal viable product, but also ensure the solution is extensible to
onboard new teams or use cases without any major refactoring. Let’s
investigate how to organize resources in Azure, how to link Azure Key
Vault, Azure Databricks and Azure Data Factory, and many more.
It’s a terrible irony that these very early decisions are also
the least informed. This is when your team is most
ignorant of the eventual structure of the software in the
beginning, yet that is when some of the most irrevocable
decisions must be made."
There is no way this document can withstand the test of time as the services described here
are changing fast. Hence, I will do my best to keep it up to date and notify all my
subscribers.
Hi there!
https://www.valdas.blog
https://www.dataplatformschool.com
2. Architecture Overview
The official Azure documentation often takes a siloed approach and misses out more
advanced Big Data / Machine Learning end-to-end scenarios. I collected a list of awesome
blog posts on Azure Databricks, Azure Data Factory, Azure Data Lake and other related
topics. Understand the bigger picture, build configurable end-to-end pipelines, automate
deployment, set up accesses and permissions.
Building Modern Data Platform in Azure - Useful Resources (last updated in July 2020)
https://www.valdas.blog/2019/07/13/azure-useful-links/