0% found this document useful (0 votes)

17 views10 pages

Iot Ass2

Uploaded by

Ibrahim Wael

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views10 pages

Iot Ass2

Uploaded by

Ibrahim Wael

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

‫ ابراهيم وائل ابراهيم محمد سليم‬: ‫االسم‬

1 : ‫سيكشن‬
Task 1 :
Q1 
Virtualization refers to the act of creating a virtual (rather than actual)
version of something, including virtual computer hardware platforms,
operating systems, storage devices, and computer network resources.

 Applications: These represent the software programs or services running within each
virtual machine (VM). They are the user-facing components that perform specific tasks
or functions.

 Guest OS (Operating System): Each virtual machine runs its own guest operating system,
which is independent of the host operating system. This allows different operating
systems (e.g., Windows, Linux) to run simultaneously on the same physical hardware.

 Virtual Hardware: This refers to the virtualized representation of hardware components

(e.g., CPU, memory, disk, network interfaces) presented to each virtual machine. Virtual
hardware is managed and controlled by the hypervisor.

 Hypervisor: The hypervisor is the virtualization layer that enables the creation and
management of virtual machines. It abstracts and partitions the underlying physical
resources, allowing multiple VMs to share the same physical hardware. Examples of
hypervisors include VMware ESXi, Microsoft Hyper-V, and KVM (Kernel-based Virtual
Machine).
 Operating System: This is the software that manages hardware resources and provides
services to applications running within each VM. Each VM has its own operating system,
isolated from other VMs.

 Physical Server: This represents the underlying physical hardware on which the
hypervisor runs and hosts multiple virtual machines.
Q2 

Criteria Type1 hypervisor Type 2 hypervisor (Hosted)

(Bare-metal or Native)
Definition Runs directly on the system with VMs Runs on a conventional operating
running on them system
Virtualization Hardware virtualization OS Virtualization
Scalability Better scalability Not so much, because of its
reliance on the underlying OS
System Has direct access to hardware along with Are not allowed to directly access
Independence virtual machines it hosts the host hardware and its
resources
Speed Faster Slower because of the system’s
dependency
Security More secure Less secure, as any problem in the
basic operating system affects the
entire system including the
protected hypervisor
Examples - VMware ESXi - VMware workstation player
- Microsoft Hyper-V - Microsoft virtual PC
- Citrix XenServer (Xen) - Sun’s virtual Box
- KVM
Q3 

Virtual Machine Container

• Heavyweight • Lightweight
• Limited performance • Native performance
• Each VM runs in its own OS • All containers share the host OS
• Hardware-level virtualization • OS virtualization
• Startup time in minutes • Startup time in milliseconds
• Allocates required memory • Requires less memory space
• Fully isolated and hence more secure • Process-level isolation, possibly less secure

Q4 
Cloud Computing Deployment Model :
 Public Cloud: In a public cloud model, cloud services and infrastructure are owned
and operated by a third-party cloud service provider.

 Private Cloud: A private cloud is dedicated to a single organization

 Hybrid Cloud: A hybrid cloud combines elements of both public and private
clouds, allowing data and applications to be shared between them.

 Community Cloud: A community cloud is shared among several organizations with

similar interests

Cloud Computing Service Model

 Infrastructure as a Service provides virtualized computing resources over the
internet. With IaaS, cloud providers offer virtualized hardware resources such as
virtual machines, storage, networks, and other infrastructure components.

 Platform as a Service abstracts the underlying infrastructure and provides a

platform for developers to build, deploy, and manage applications without the
complexity of managing the underlying hardware and software infrastructure.

 Software as a Service (SaaS): Software as a Service delivers software applications

over the internet on a subscription basis.
Q5 
Implement VMware workstation as an example of the above hypervisor and explain how to create
a virtual machine using it.

Q6  Cloud computing open source tools have gained significant popularity in recent years,
offering a cost-effective and flexible alternative to proprietary cloud solutions. Here's a critical
evaluation and comparison of some popular open source cloud computing tools:

1. OpenStack : OpenStack is one of the most widely used open source cloud platforms. It
provides a comprehensive set of tools for building and managing private and public clouds.
Pros:
 Highly scalable and customizable
 Supports a wide range of hypervisors, including KVM, VMware, and Hyper-V
 Large community and ecosystem
 Supports multiple storage and networking options
Cons:
 Steep learning curve due to its complexity
 Requires significant resources and expertise for deployment and management
2. Apache CloudStack: Apache CloudStack is another popular open source cloud platform that
provides a highly scalable and customizable infrastructure.

Pros:
 Easy to deploy and manage compared to OpenStack
 Supports a wide range of hypervisors and storage options
 Strong focus on security and compliance

Cons:
 Smaller community and ecosystem compared to OpenStack
 Limited support for certain features, such as load balancing and autoscaling
3. Eucalyptus: Eucalyptus is an open source cloud platform that provides a highly scalable and
customizable infrastructure, with a focus on AWS compatibility.
Pros:
 Compatible with AWS APIs, making it easy to migrate workloads
 Supports a wide range of hypervisors and storage options
 Easy to deploy and manage
Cons:
 Smaller community and ecosystem compared to OpenStack
 Limited support for certain features, such as load balancing and autoscaling
4. Cloud Foundry: Cloud Foundry is an open source platform-as-a-service (PaaS) that provides a
highly scalable and customizable application deployment environment.

Pros:
 Easy to deploy and manage applications
 Supports multiple programming languages and frameworks
 Strong focus on DevOps and continuous integration/continuous deployment
(CI/CD)
Cons:
 Limited support for infrastructure management and orchestration
 Limited support for certain features, such as load balancing and autoscaling

5. Kubernetes: Kubernetes is an open source container orchestration system that provides a

highly scalable and customizable infrastructure for containerized applications.

Pros:
 Highly scalable and customizable
 Supports multiple container runtimes, including Docker and rkt
 Strong focus on automation and orchestration
Cons:

 Steep learning curve due to its complexity

 Requires significant resources and expertise for deployment and management
Task2:- Q7

Types of Big Data

1. Structured data: Any data that can be processed, easily accessible, and can be stored in a
fixed format is called structured data.
2. Unstructured data: Data that has no inherent structure. Unstructured data in Big Data is
where the data format constitutes multitudes of unstructured files (images, audio, log,
and video).Examples: Text documents, images, and video
3. Semi-structured data: In Big Data, semi-structured data is a combination of both
unstructured and structured types of data.

Big Data Job Roles

1. Data Scientist: Data scientists are responsible for extracting valuable
insights from large and complex datasets.

2. Data Engineer: Data engineers design, develop, and maintain the

infrastructure required for storing, processing, and analyzing big data..

3. Big Data Architect: A big data architect designs the overall architecture
and framework for handling large volumes of data.

4. Data Analyst: Data analysts interpret and analyze data to derive

meaningful insights. They create reports, dashboards, and visualizations
to communicate findings to stakeholders and support data-driven
decision making.
5. Machine Learning Engineer: Machine learning engineers focus on
designing and implementing machine learning models and algorithms.

6. Data Warehouse Manager: Data warehouse managers are responsible for

managing and maintaining data warehouses, which are central
repositories of structured and organized data

7. Data Governance Manager: Data governance managers establish and

enforce policies and procedures for data management, data quality, and
data security.

8. Data Privacy Officer: Data privacy officers are responsible for ensuring
that organizations comply with relevant data protection laws and
regulations.

9. Data Visualization Specialist: Data visualization specialists use various tools

and techniques to create visual representations of data.

10. Data Security Analyst: Data security analysts protect data assets
from unauthorized access, breaches, and cyber threats.
Q8 
Data Analytics Life Cycle
1. Data Collection: In this initial stage, data is gathered from various
sources, which can be structured or unstructured.

2. Data Preparation: Once the data is collected, it needs to be cleaned and

preprocessed to ensure its quality and usability.
3. Data Analysis: After the data is prepared, it is analyzed using various
statistical and machine learning techniques to uncover patterns.

4. Data Visualization: The insights obtained from the data analysis are then
visualized using charts, graphs, and other visual representations.

5. Validation : Evaluate results and review process .

6. Visualization and presentation: Once the insights are interpreted and

decisions are made, the findings are implemented in real-world scenarios.
It is essential to monitor the outcomes and continuously refine the data
analytics process.

Problems in Big Data:

 Data Volume:
Problem: Managing and processing large volumes of data efficiently.
Solution: Distributed computing frameworks like Apache Hadoop and
Apache Spark allow parallel processing of data across clusters of
commodity hardware.

 Data Velocity:
Problem: Handling the high-speed and real-time influx of data streams.
Solution: Stream processing technologies such as Apache Kafka and
Apache Flink enable real-time processing and analysis of streaming data.

 Data Variety:
Problem: Dealing with diverse data types and formats (structured, semi-
structured, unstructured).
Solution: Techniques like schema-on-read (used in NoSQL databases) and
data virtualization enable flexibility in handling different data formats.
 Data Veracity:
Problem: Ensuring data quality, accuracy, and reliability.
Solution: Data cleansing, anomaly detection, and data validation
techniques identify and correct errors, outliers, and inconsistencies in
data.
 Scalability and Performance:
Problem: Ensuring scalability and performance of analytics processes as
data volumes grow.
Solution: In-memory computing platforms (e.g., Apache Spark's RDDs and
DataFrames) and distributed databases (e.g., Apache Cassandra,
MongoDB) optimize data processing and storage performance.
 Complexity of Analysis:
Problem: Conducting complex analytics tasks on large datasets
efficiently.
Solution: Machine learning algorithms (e.g., deep learning, ensemble
methods) and graph analytics (e.g., Apache Giraph, Neo4j) enable
advanced pattern recognition, predictive modeling, and graph-based
analysis.
 Privacy and Security:
Problem: Protecting sensitive data from unauthorized access and
ensuring compliance with data privacy regulations.
Solution: Advanced encryption techniques (e.g., homomorphic
encryption), access controls, and anonymization methods (e.g.,
differential privacy) enhance data security and privacy in big data
environments.

Advanced Big Data Analytics Techniques:

 Machine Learning (ML) and Artificial Intelligence (AI):
Techniques: Supervised learning, unsupervised learning, reinforcement
learning, deep learning.
Applications: Predictive modeling, anomaly detection, natural language
processing, image recognition.

 Real-time Analytics:
Techniques: Stream processing, event-driven architectures, complex event
processing (CEP).
Applications: Real-time monitoring, fraud detection, IoT analytics.
Graph Analytics:
Techniques: Graph algorithms (e.g., PageRank, community detection), graph
databases.
Applications: Social network analysis, recommendation systems, network
optimization.

 Data Visualization and Exploration:

Techniques: Interactive dashboards, visual analytics tools.
Applications: Exploratory data analysis, interactive reporting, visual
storytelling.

 Natural Language Processing (NLP) and Text Analytics:

Techniques: Text classification, sentiment analysis, entity recognition.
Applications: Customer feedback analysis, social media monitoring, chatbot
development.

 Distributed Computing and Cloud Computing:

Techniques: MapReduce, parallel processing, containerization (e.g., Docker,
Kubernetes).
Applications: Scalable data processing, elastic computing, hybrid cloud
deployments.

 Advanced Data Integration and Federation:

Techniques: Data virtualization, data lake architectures, data mesh.
Applications: Unified data access, data interoperability, enterprise data
management.

Q9 
Q10 
Clustering is the process of dividing the datasets into
groups, consisting of similar data-points.
Types of Clustering

• Exclusive Clustering:
Hard Clustering:
Data point/ Item belongs exclusively to one cluster
For example: k-Means Clustering
• Overlapping Clustering:
Soft Cluster
Data Point/Item belongs to multiple cluster
For example: Fuzzy/ C-Means Clustering
• Hierarchical Clustering:
The hierarchy of clusters is developed in the form of a tree in this
technique, and this tree-shaped structure is known as the
dendrogram.
Q11
Implement one node Hadoop cluster and explain the Hadoop components and the meaning of
map-reduce and its steps with example

Age and Gender Classiication Report
60% (10)
Age and Gender Classiication Report
54 pages
Data Mining and Warehouse MCQS With Answer Good
74% (72)
Data Mining and Warehouse MCQS With Answer Good
30 pages
ChatGPT - Cloud Computing Concepts Explained
No ratings yet
ChatGPT - Cloud Computing Concepts Explained
20 pages
Unit 8
No ratings yet
Unit 8
12 pages
CCL All
No ratings yet
CCL All
68 pages
Wa0020.
No ratings yet
Wa0020.
24 pages
Abhay's Assignment 2
No ratings yet
Abhay's Assignment 2
8 pages
Computer and Network Technology Handbook
No ratings yet
Computer and Network Technology Handbook
10 pages
Cloud Computing Concepts Explained - Removed
No ratings yet
Cloud Computing Concepts Explained - Removed
7 pages
Fog and Cloud Computing
No ratings yet
Fog and Cloud Computing
22 pages
Technical Infrastructure For E-Commerce
No ratings yet
Technical Infrastructure For E-Commerce
31 pages
Cloud Computing Concepts Explained
No ratings yet
Cloud Computing Concepts Explained
11 pages
Cloud Computing
No ratings yet
Cloud Computing
47 pages
Unit I
No ratings yet
Unit I
17 pages
Cloud
No ratings yet
Cloud
8 pages
It Application Toolsl3
No ratings yet
It Application Toolsl3
3 pages
Lec 3 Emerging Technologies For Business Processes
No ratings yet
Lec 3 Emerging Technologies For Business Processes
51 pages
Cloud Computing
No ratings yet
Cloud Computing
19 pages
Maham Murtaza 108
No ratings yet
Maham Murtaza 108
5 pages
1st I.A Answers CC
No ratings yet
1st I.A Answers CC
7 pages
CCL Assignments 1 and 2
No ratings yet
CCL Assignments 1 and 2
4 pages
Cloud
No ratings yet
Cloud
37 pages
Cloud DevOps Services VFA
No ratings yet
Cloud DevOps Services VFA
88 pages
CDC Ansbank
No ratings yet
CDC Ansbank
26 pages
Abstraction and Virtualization Concepts
No ratings yet
Abstraction and Virtualization Concepts
4 pages
Cloud Computing - Lecture 2
No ratings yet
Cloud Computing - Lecture 2
22 pages
#4 Cloud Computing
No ratings yet
#4 Cloud Computing
40 pages
CCS - Simp
No ratings yet
CCS - Simp
8 pages
Cloud Computing Lab 01-1
No ratings yet
Cloud Computing Lab 01-1
7 pages
Unit5 Cloud
No ratings yet
Unit5 Cloud
23 pages
Viva CC
No ratings yet
Viva CC
17 pages
CC Answer
No ratings yet
CC Answer
4 pages
Cloud Unit Ii
No ratings yet
Cloud Unit Ii
3 pages
A. Write A 500-Word Article On Trending IT Technologies in 2023
No ratings yet
A. Write A 500-Word Article On Trending IT Technologies in 2023
5 pages
Introduction To Cloud and AWS
No ratings yet
Introduction To Cloud and AWS
11 pages
To Create A Roadmap For My Course Presentation
No ratings yet
To Create A Roadmap For My Course Presentation
18 pages
R Programming
No ratings yet
R Programming
16 pages
CC Model
No ratings yet
CC Model
13 pages
FOC Question Paper Solution N
No ratings yet
FOC Question Paper Solution N
14 pages
CC Mid Sem QB Ans
No ratings yet
CC Mid Sem QB Ans
7 pages
CloudComputing Module2
No ratings yet
CloudComputing Module2
17 pages
Cloud Computing Unit 1
No ratings yet
Cloud Computing Unit 1
44 pages
I
No ratings yet
I
5 pages
Cloud
No ratings yet
Cloud
30 pages
L1-2-3 Revision of Basics From IVCC - Deck 1
No ratings yet
L1-2-3 Revision of Basics From IVCC - Deck 1
34 pages
Cloud Computing Assignment 1
No ratings yet
Cloud Computing Assignment 1
7 pages
Module - 1
No ratings yet
Module - 1
119 pages
COE10 CLOUD COMPUTING Question Bank
No ratings yet
COE10 CLOUD COMPUTING Question Bank
10 pages
Cloud Computing Chat Style Answers
No ratings yet
Cloud Computing Chat Style Answers
8 pages
Cloud Computing Question Bank
No ratings yet
Cloud Computing Question Bank
3 pages
Cloud Answers
No ratings yet
Cloud Answers
29 pages
Server Notes
No ratings yet
Server Notes
16 pages
CC Prelim QB Soln
No ratings yet
CC Prelim QB Soln
13 pages
CC Ia1
No ratings yet
CC Ia1
27 pages
Cloud Computing University Exams
No ratings yet
Cloud Computing University Exams
14 pages
Cloud Architecture
No ratings yet
Cloud Architecture
8 pages
PGP Cloud Computing Brochure Utexas
No ratings yet
PGP Cloud Computing Brochure Utexas
19 pages
IOT Imp Ques For Exam
No ratings yet
IOT Imp Ques For Exam
17 pages
Computer Science (Tma)
No ratings yet
Computer Science (Tma)
6 pages
CC Ia1
No ratings yet
CC Ia1
15 pages
CC Mid 2 Ans
No ratings yet
CC Mid 2 Ans
16 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
Ch2 Boolean Algebra and Logic Circuits Part2
No ratings yet
Ch2 Boolean Algebra and Logic Circuits Part2
12 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Lec 5
No ratings yet
Lec 5
29 pages
Lec 1
No ratings yet
Lec 1
40 pages
Lec 2
No ratings yet
Lec 2
50 pages
Lec 7
No ratings yet
Lec 7
40 pages
Assignment 2-Intro-to-IoT-Connecting - Things
No ratings yet
Assignment 2-Intro-to-IoT-Connecting - Things
3 pages
Sheet 1 - IoT
No ratings yet
Sheet 1 - IoT
2 pages
Lecture 4 C Programming
No ratings yet
Lecture 4 C Programming
15 pages
Data Mining - Data Reduction
No ratings yet
Data Mining - Data Reduction
6 pages
Mining Inter-Transaction Association Rules From Multiple Time-Series Data
No ratings yet
Mining Inter-Transaction Association Rules From Multiple Time-Series Data
6 pages
Lucrative Brain Oligodendrogliomas Neoplasm Dissection in Asynchronous MRI Via PCM-Fuzzy Indigenous Framework
No ratings yet
Lucrative Brain Oligodendrogliomas Neoplasm Dissection in Asynchronous MRI Via PCM-Fuzzy Indigenous Framework
19 pages
Unit 6 - Information Privacy and Data Mining
No ratings yet
Unit 6 - Information Privacy and Data Mining
8 pages
Nanda and Panda 2013 - A Survey On Nature Inspired Metaheuristic Algorithms For Partitional Clustering
No ratings yet
Nanda and Panda 2013 - A Survey On Nature Inspired Metaheuristic Algorithms For Partitional Clustering
18 pages
10.cluster Analysis
No ratings yet
10.cluster Analysis
68 pages
Software Engineering Dissertation Examples
100% (2)
Software Engineering Dissertation Examples
6 pages
E-Commerce Data: Topic-5.2: Text Mining/Analytics
No ratings yet
E-Commerce Data: Topic-5.2: Text Mining/Analytics
63 pages
Reshma Project Report
No ratings yet
Reshma Project Report
47 pages
Tristan Camilleri - SOR0511 - Questions - 2021.06.20
No ratings yet
Tristan Camilleri - SOR0511 - Questions - 2021.06.20
33 pages
Enhancing Food Integrity Through Artificial Intelligence and Machine Learning: A Comprehensive Review
No ratings yet
Enhancing Food Integrity Through Artificial Intelligence and Machine Learning: A Comprehensive Review
28 pages
CST 6th Semester
No ratings yet
CST 6th Semester
17 pages
ML Questions - GROUP - 08
No ratings yet
ML Questions - GROUP - 08
23 pages
Syllabus - PGD - DS - Batch-7 PDF
No ratings yet
Syllabus - PGD - DS - Batch-7 PDF
12 pages
(Ebook) Pattern Recognition by Sergios Theodoridis, Konstantinos Koutroumbas ISBN 9780126858754, 0126858756 PDF Download
100% (4)
(Ebook) Pattern Recognition by Sergios Theodoridis, Konstantinos Koutroumbas ISBN 9780126858754, 0126858756 PDF Download
30 pages
Birdwatch Paper 2022 10 27
No ratings yet
Birdwatch Paper 2022 10 27
24 pages
Serra - Methodological Foundation of A Numerical Taxonomy
No ratings yet
Serra - Methodological Foundation of A Numerical Taxonomy
137 pages
Data Science
No ratings yet
Data Science
8 pages
Cluster Analysis Clustering
No ratings yet
Cluster Analysis Clustering
6 pages
Project Proposal of Digital Image Processing: Name Cms-Id Discipline Email ID
No ratings yet
Project Proposal of Digital Image Processing: Name Cms-Id Discipline Email ID
1 page
Model Question Paper 2
No ratings yet
Model Question Paper 2
7 pages
IDRISI Selva GIS Image Processing Specifications
No ratings yet
IDRISI Selva GIS Image Processing Specifications
12 pages
3 Hotspot Analysis Module
0% (1)
3 Hotspot Analysis Module
9 pages
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
No ratings yet
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
102 pages
Seven Text Mining Techniques
No ratings yet
Seven Text Mining Techniques
21 pages
Chapter - 2 - Arranging - and - Collecting - Data Class9
No ratings yet
Chapter - 2 - Arranging - and - Collecting - Data Class9
10 pages
Chapter Fraud Detection
No ratings yet
Chapter Fraud Detection
14 pages
Brain
100% (1)
Brain
41 pages

Iot Ass2

Uploaded by

Iot Ass2

Uploaded by

‫ ابراهيم وائل ابراهيم محمد سليم‬: ‫االسم‬

 Virtual Hardware: This refers to the virtualized representation of hardware components

Criteria Type1 hypervisor Type 2 hypervisor (Hosted)

Virtual Machine Container

 Private Cloud: A private cloud is dedicated to a single organization

 Community Cloud: A community cloud is shared among several organizations with

Cloud Computing Service Model

 Platform as a Service abstracts the underlying infrastructure and provides a

 Software as a Service (SaaS): Software as a Service delivers software applications

5. Kubernetes: Kubernetes is an open source container orchestration system that provides a

 Steep learning curve due to its complexity

Types of Big Data

Big Data Job Roles

2. Data Engineer: Data engineers design, develop, and maintain the

4. Data Analyst: Data analysts interpret and analyze data to derive

6. Data Warehouse Manager: Data warehouse managers are responsible for

7. Data Governance Manager: Data governance managers establish and

9. Data Visualization Specialist: Data visualization specialists use various tools

2. Data Preparation: Once the data is collected, it needs to be cleaned and

5. Validation : Evaluate results and review process .

6. Visualization and presentation: Once the insights are interpreted and

Problems in Big Data:

Advanced Big Data Analytics Techniques:

 Data Visualization and Exploration:

 Natural Language Processing (NLP) and Text Analytics:

 Distributed Computing and Cloud Computing:

 Advanced Data Integration and Federation:

You might also like