We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5
Google Cloud Computing Foundation Course - Week 5 Lecture Notes Summary
1. Storage Options in the Cloud
• Overview: GCP offers multiple storage solutions from object storage to managed databases. • Main Storage Categories: 1. Relational Databases: Cloud SQL (MySQL, PostgreSQL) and Cloud Spanner. 2. NoSQL Databases: Cloud Datastore and Cloud Bigtable. 3. Enterprise Data Warehouse: BigQuery (handles large datasets for analytics). • Three Key Use Cases for Cloud Storage: 1. Content Storage and Delivery: Suitable for serving images, videos, and files to users across regions. 2. Storage for Data Analytics and Compute: Leverage cloud storage for analytics, IoT, and genomics data. 3. Backup and Archival: Cost-effective options for infrequently accessed data and disaster recovery.
2. Structured vs. Unstructured Storage in the Cloud
• Structured Data: • Organized in a defined format, like rows and columns (e.g., databases, spreadsheets). • Easy to capture, analyze, and manipulate. • Examples: names, addresses, and billing information. • Unstructured Data: • Accounts for 80% of all data, difficult to analyze using traditional methods. • Examples: text, emails, multimedia (videos, photos), web pages. • Increasing focus on mining unstructured data for insights. • Storage Decision Tree: • Unstructured Data: Cloud Storage for backups, logs, media. • Structured Data: Cloud Bigtable and BigQuery for analytics. • Relational Database: Cloud SQL or Cloud Spanner. • NoSQL Database: Cloud Datastore for application data.
3. Cloud Storage for Unstructured Data
• Cloud Storage: A solution for storing and managing large amounts of object data (e.g., media, backups). • Storage Classes: 1. Multi-Regional Storage: • High redundancy across at least two locations. • Ideal for frequently accessed data (e.g., website content, videos). 2. Regional Storage: • Stored in one geographic region, cheaper than multi-regional. • Suitable for data analytics and machine learning. 3. Nearline Storage: • Cost-effective for infrequently accessed data (accessed less than once a month). • Examples: file storage for monthly analysis, backups. 4. Coldline Storage: • Lowest cost, ideal for archival and disaster recovery. • Suitable for data accessed once a year or less. • Unique Features of Cloud Storage: 1. Single API with low-latency access. 2. 99.999999999% durability across all storage classes. 3. Lifecycle Management: Automatically move data between classes based on access frequency.
4. Cloud Databases and NoSQL Options
• Cloud Datastore: A NoSQL document store, ideal for handling web and mobile app data. • Cloud BigTable: Best suited for NoSQL database applications with high-throughput and low-latency needs. • Cloud Spanner: A horizontally scalable, fully managed relational database designed for high availability and performance.
5. Data Management Features
• Cloud IAM (Identity and Access Management): Manage user access to objects and buckets. • Access Control Lists (ACLs): More fine-grained access control options (read, write permissions). • Object Versioning: Keeps a history of file modifications, supports rollback and deletion of older versions. • Bucket Organization: Data is organized into "buckets," each with a unique name and assigned region.
6. Use Cases and Labs
• Hands-on labs include working with Cloud SQL, importing data, and performing data analysis. • App development involves storing data in Cloud Datastore and understanding its scalability. QUESTIONS & ANSWERS What is the focus of Module 5 in the Google Cloud Computing Foundation course? • The focus is on leveraging managed storage and databases for cloud applications. • What are the learning objectives for this module? • To implement structured and unstructured storage models, compare different cloud storage options, and understand relational vs. NoSQL databases. • What are the main storage options in GCP? • GCP offers object stores, relational databases like Cloud SQL, and NoSQL databases like Cloud Datastore and BigTable. • What are the three common use cases for cloud storage? • Content storage and delivery, storage for analytics and compute, and backup/archival storage. • What does Cloud SQL offer? • It provides managed services for relational databases, specifically MySQL and PostgreSQL. • What is Cloud Spanner? • A horizontally scalable, highly available database service designed for mission-critical applications. • What is the difference between structured and unstructured data? • Structured data is organized in a defined manner, like spreadsheets, while unstructured data includes multimedia, emails, documents, and more. • What are examples of structured data? • Name, address, contact number, and billing information. • Why is unstructured data difficult to process? • It lacks a clear internal structure, making it hard to analyze using traditional methods. • What is Google Cloud Storage used for? • Storing and managing object data such as images, media files, backups, etc. • What is the size limit for objects in Google Cloud Storage? • Up to 5 terabytes per object. • What is multi-regional storage? • It stores data across multiple geographic regions to ensure high availability, ideal for content frequently accessed worldwide. • What is regional storage? • It stores data in a specific region, offering less redundancy than multi-regional storage but is ideal for analytics and machine learning. • What is Nearline Storage? • A low-cost storage option for data accessed less than once a month, useful for online backups and analysis. • What is Coldline Storage? • A very low-cost service for data that is rarely accessed, ideal for archiving and disaster recovery. • What is object lifecycle management in Cloud Storage? • It automates moving data to lower-cost storage as it's accessed less frequently. • What are buckets in Cloud Storage? • Buckets are containers used to organize files in Google Cloud Storage, each with a globally unique name. • What security features does Cloud Storage offer? • Access is controlled via Cloud IAM or access control lists (ACLs), defining who can perform actions on the bucket or object. • What is object versioning in Cloud Storage? • It keeps a history of modifications to objects, allowing users to restore previous versions if needed. • What is Cloud Datastore used for? • It is a NoSQL document store ideal for web and mobile applications. • What makes Cloud BigTable unique? • It is a NoSQL database that is highly scalable and designed for big data use cases like analytics. • How does Cloud SQL differ from Cloud Spanner? • Cloud SQL offers managed MySQL and PostgreSQL services, while Cloud Spanner is designed for high availability and scalability with horizontal scaling. • What is the use case for Cloud BigQuery? • It is used for real-time analytics on large datasets. • Why is backup and archival storage important in the cloud? • It provides cost savings and ensures data is recoverable in case of loss or failure on- premises. • What are the key differences between relational and NoSQL databases? • Relational databases use structured data with predefined schemas, while NoSQL databases are more flexible and handle unstructured or semi-structured data. • What kind of storage is best for serving global website content? • Multi-regional storage is ideal for this use case, offering redundancy and low-latency access. • What role does Cloud IAM play in Cloud Storage? • It provides role-based access control across the project, bucket, and object levels. • How can you reduce storage costs in Google Cloud? • By using object lifecycle management to move data to cheaper storage classes based on access frequency. • What does the Cloud Storage API offer? • A single API to access all storage classes with low-latency data access and high durability. • What does the minimum storage duration mean in Coldline storage? • Data must be stored for at least 90 days in Coldline storage due to its low-cost and archival focus.