Azure Storage
Azure Storage
Data lake is a data repository that can store large amounts of structured, semi-structured, and unstructured data.
Data lakes are highly scalable and can handle vast amounts of raw data in its native format.
Why you choose Azure Data Lake Storage Gen2 over Azure Blob Storage in your project?
Hierarchical Namespace: ADLS Gen2 offers a directory and file system-like structure, which simplifies organizing and
managing large volumes of data compared to Blob Storage’s flat namespace.
Optimized for Big Data: ADLS Gen2 is designed specifically for big data analytics, providing better performance for large-
scale data processing and querying.
Advanced Security Features: ADLS Gen2 includes fine-grained access control and integration with Azure Active
Directory, enhancing security for sensitive data.
Integration with Big Data Tools: ADLS Gen2 integrates seamlessly with tools like Azure Databricks, Azure Synapse
Analytics, and HDInsight, making it ideal for analytics and data processing workflows.
Scalability and Performance: ADLS Gen2 is optimized for high-throughput data access and handling large datasets,
which is critical for analytics workloads.
What is the difference between Blob Storage and ADLS Gen 2?
redundant storage refers to the replication of data across multiple locations to ensure high availability and durability.
Azure provides several redundancy options to protect data from hardware failures. Locally Redundant Storage, Zone-
Redundant Storage, Geo-Redundant Storage
Hot Tier: Real-time analytics, active user data, and frequently accessed logs.
Cold Tier: Monthly backups, compliance and archival data, and historical logs.
what is max file size in adsl gen2?
the maximum file size in Azure Data Lake Storage Gen2 is 5 terabytes (TB).
A block is a single unit in a Blob. A Blob can contain many blocks but not more than 50,000 blocks per Blob. This means
you can split a Blob into 50,000 blocks to upload to Azure Blobs storage. The minimum size of a block is 64KB and
maximum is 100 MB