Unit – 3 - Data Center Architecture
Unit – 3 - Data Center Architecture
-Suryadeepsinh P. Jadeja
Course Outcome
• Understand Data Center Architecture.
Data Center Fundamentals
• Data Center Architecture : Data center architecture in the cloud refers to
the “the physical and logical layout of the resources and equipment within
a data center facility”
• What is a data center?
– A data center is a physical location that stores computing machines and their
related hardware equipment.
– It contains the computing infrastructure that IT systems require, such as servers,
data storage drives, and network equipment.
– It is the physical facility that stores any company’s digital data.
• Data centers are backbone of the cloud computing ecosystem that provide
scalable and flexible infrastructure to support various applications and
workloads.
• Data centers in cloud computing require high levels of reliability, availability,
and scalability. To achieve continuous operation and reduce the possibility
of service disruptions, they are built with redundancy and failover
mechanisms. These mechanisms include security measures such as access
controls, encryption, and monitoring which protect from unauthorized
access and cyber threats only.
History & Evolution of Data Centers
• The history of data centers began in the 1940s with the development of the Electronic Numerical Integrator and Computer (ENIAC). The
Electronic Numerical Integrator and Computer (ENIAC) was the first electronic digital programmable general-purpose computer. The
U.S. military designed the ENIAC to calculate artillery firing tables. However, it was not completed until late 1945.
• The ENIAC was large, weighing more than 27 tons and occupying 300 square feet of space. Its thousands of vacuum tubes, resistors,
capacitors, relays, and crystal diodes could only perform numeric calculations. Punch cards served as data storage. The first data center
(called a “mainframe”) was built in 1945 to house the ENIAC at the University of Pennsylvania.
• Early computer systems, complex to operate and maintain, required a special environment in which to operate. Many cables were
necessary to connect all the components, and methods to accommodate and organize these were devised such as standard racks to
mount equipment, raised floors, and cable trays (installed overhead or under the elevated floor). A single mainframe required a great
deal of power and had to be cooled to avoid overheating. Security became important – computers were expensive, and were often
used for military purposes. Basic design guidelines for controlling access to the computer room were therefore devised.
• During the boom of the microcomputer industry, and especially during the 1980s, users started to deploy computers everywhere, in
many cases with little or no care about operating requirements. The availability of inexpensive networking equipment, coupled with
new standards for the network structured cabling, made it possible to use a hierarchical design that put the servers in a specific room
inside the company. The use of the term data center, as applied to specially designed computer rooms, started to gain popular
recognition about this time.
• The boom of data centers came during the dot-com bubble of 1997–2000. Companies needed fast Internet connectivity and non-stop
operation to deploy systems and to establish a presence on the Internet. Installing such equipment was not viable for many smaller
companies. Many companies started building very large facilities, called internet data centers which provide enhanced capabilities,
such as crossover backup.
• The term cloud data centers (CDCs) have been used. The global data center market saw steady growth in the 2010s, with a notable
acceleration in the latter half of the decade. According to Gartner, worldwide data center infrastructure spending reached $200 billion
in 2021, representing a 6% increase from 2020 despite the economic challenges posed by the COVID-19 pandemic.
• The United States is currently the foremost leader in data center infrastructure, hosting 5,381 data centers as of March 2024, the
highest number of any country worldwide.
Continue…
• With the changing times, the data center is
constantly changing and developing, which
also promotes the evolution of the network
architecture to meet the growing business
needs.
• Currently, most modern data center network
architectures have evolved from on-premises
physical servers to a virtualized infrastructure
supporting networks, applications, and
workloads in multiple private and public
clouds
• Specifically, they are designed for the needs
of cloud computing in which several services
are provided over the internet. The ability to
scale up or down, and the high level of
protection and dependability are therefore
ensured, along with a big range of options in
terms of applications being provided through
the internet.
Continue…
• Why are Data Centers Important?
– Every business needs computing equipment to run its web applications, offer services to
customers, sell products, or run internal applications for accounts, human resources, and
operations management.
– As the business grows and IT operations increase, the scale and amount of required
equipment also increases exponentially.
– Equipment that is distributed across several branches and locations is hard to maintain.
Instead, companies use data centers to bring their devices to a central location and
manage it cost effectively.
– A data center facility, which provide various services to an organization includes the
following:
• Systems for storing, sharing, accessing and processing data across the organization;
• Physical infrastructure for supporting data processing and data communications;
and
• Utilities such as cooling, electricity, network security access and uninterruptible
power supplies (UPSes).
– Data centers bring several benefits, such as:
• Backup power supplies to manage power outages
• Data replication across several machines for disaster recovery
• Temperature-controlled facilities to extend the life of the equipment
• Easier implementation of security measures for compliance with data laws.
Key components of a data center
• A data center is a facility composed of various components that are designed to
store, process, manage, and disseminate large amounts of data. The key
components of a data center include:
– Servers:
• Physical or virtual machines that process and store data.
• Can include different types such as application servers, database servers, and web servers.
– Storage Systems:
• Hardware and software systems for storing and managing data.
• Types of storage include Direct-Attached Storage (DAS), Network-Attached Storage (NAS), and Storage
Area Network (SAN).
– Networking Equipment:
• Routers, switches, and firewalls to manage data traffic within the data center and between the data
center and external networks.
• Ensures reliable and efficient communication between servers and other devices.
– Data Center Networking Architecture:
• Design and topology that define how networking components are interconnected.
• Includes considerations for redundancy, bandwidth, and scalability.
– Power Infrastructure:
• Uninterruptible Power Supply (UPS) systems to provide continuous power in case of outages.
• Power distribution units (PDUs) for distributing power to servers and equipment.
Continue…
– Cooling Systems:
• Air conditioning units and HVAC systems to maintain an optimal temperature within the data center.
• Prevents overheating of servers and other hardware components.
– Physical Security:
• Measures to secure the physical premises, including access controls, surveillance cameras, and
security personnel.
• Protects against unauthorized access, theft, and other physical threats.
– Fire Suppression Systems:
• Specialized systems to detect and suppress fires in the data center.
• Use of fire-resistant materials and suppression agents like inert gases.
– Environmental Monitoring:
• Sensors to monitor temperature, humidity, and other environmental factors.
• Provides early warnings to prevent hardware damage due to adverse conditions.
– Backup Systems:
• Regularly scheduled backups of data to ensure data recovery in the event of data loss or system
failures.
• May include tape backups, off-site storage, or cloud-based backup solutions.
– Virtualization:
• Hypervisors or virtualization platforms that enable the creation and management of virtual machines.
• Improves resource utilization and facilitates efficient server deployment.
Continue…
– Management and Monitoring Tools:
• Software tools for monitoring the performance, health, and resource utilization of
servers and other components.
• Helps in identifying and addressing issues proactively.
– Redundancy and High Availability:
• Duplicate systems and components to ensure uninterrupted operation in case of failures.
• Strategies like clustering and load balancing for high availability.
– Cabling Infrastructure:
• Structured cabling systems for connecting servers, networking equipment, and other
devices.
• Organized cabling improves manageability and facilitates troubleshooting.
– Automation and Orchestration Tools:
• Tools for automating routine tasks, deploying updates, and orchestrating workflows.
• Enhances operational efficiency and reduces manual intervention.
– Documentation and Labelling:
• Comprehensive documentation of the data center layout, configurations, and
procedures.
• Proper labeling of equipment and cables for easy identification and maintenance.
– These components work together to create a reliable, secure, and efficient
data center infrastructure capable of meeting the demands of modern
computing and data storage requirements.
Types Of Data Centers
• Data centers come in various types, each designed to meet specific
needs and requirements.
• The classification of data centers is often based on factors such as
size, purpose, ownership, and the level of redundancy. Here are
several types of data centers:
– Enterprise Data Centers
• Operated by individual organizations for their own IT needs.
• Characteristics:
– Serve the computing needs of a specific enterprise.
– May include on-premises facilities or private cloud environments.
– Managed and owned by the organization.
• Benefits
– An enterprise data center can give better security because you manage risks internally.
You can customize the data center to meet your requirements.
• Limitations
– It is costly to set up your own data center and manage ongoing staffing and running
costs. You also need multiple data centers because just one can become a single high-risk
point of failure.
Continue…
– Managed Services Data Centers
• Facilities where a third-party provider manages and
operates the IT infrastructure for clients.
• serving customers directly.
• Characteristics:
– Managed service providers (MSPs) offer a range of services,
including server management, security, and monitoring.
– Clients outsource IT operations to focus on core business
activities.
Continue…
– Colocation data centers
• Colocation facilities are large data center facilities in which you can
rent space to store your servers, racks, and other computing hardware
provided and owned by different organizations.
• Characteristics:
– Tenants own and manage their servers and IT equipment.
– Shared infrastructure, with cost savings achieved through economies of scale.
– Colocation providers offer services like security, cooling, and network
connectivity.
• Benefits :
– Colocation facilities reduce ongoing maintenance costs and provide fixed
monthly costs to house your hardware. You can also geographically distribute
hardware to minimize latency and to be closer to your end users
• Limitations
– It can be challenging to source colocation facilities across the globe and in
different geographical areas you target. Costs could also add up quickly as you
expand.
Continue…
– Cloud Data Centers
• Infrastructure hosted by cloud service providers and
accessible over the internet..
• Characteristics:
– Provides scalable computing resources on a pay-as-you-go
model.
– Offers various services, including Infrastructure as a Service
(IaaS), Platform as a Service (PaaS), and Software as a Service
(SaaS).
• Benefits :
– A cloud data center reduces both hardware investment and
the ongoing maintenance cost of any infrastructure. It gives
greater flexibility in terms of usage options, resource sharing,
availability, and redundancy.
Continue…
– High-Performance Computing (HPC) Data Centers:
• Facilities optimized for handling complex and resource-
intensive computations.
• Characteristics:
– Used for scientific research, simulations, and advanced analytics .
– Requires specialized hardware for parallel processing.
– Scalability :
• Cloud data center networks should be designed to scale horizontally, accommodating the
increasing demand for resources. This involves adding more servers, storage, and
networking components as needed.
• Scalability ensures that the network can handle a growing number of users, applications,
and data without compromising performance
Continue…
– Software-Defined Networking (SDN) :
• Centralizes the control plane to make network management more flexible and
programmable.
• SDN enables dynamic configuration, efficient traffic management, and easier
implementation of network policies.
– Security :
• Security is a critical consideration in cloud data center networking. Virtual
Private Clouds (VPCs) and security groups are commonly used to isolate and
secure different parts of the network.
• Encryption, firewalls, and intrusion detection/prevention systems are
deployed to safeguard data in transit and protect against malicious activities.
– High Availability:
• Cloud data centers aim for high availability by designing redundant systems
and implementing failover mechanisms.
• This ensures that services remain accessible even in the event of hardware
failures or other disruptions.
• Redundant networking components, such as multiple network paths and
switches, contribute to a more resilient infrastructure.
Continue…
– Interconnectivity :
• Cloud data centers may span multiple geographical regions.
Interconnecting these regions efficiently is crucial for data
transfer and communication between different parts of the
cloud infrastructure.
• Content Delivery Networks (CDNs) may also be employed to
cache and deliver content from locations closer to end-users,
reducing latency.
– Elasticity :
• Elasticity in cloud data center networking refers to the ability
to dynamically scale resources up or down based on
demand. This is closely tied to the concept of on-demand
provisioning and de-provisioning of resources.
Continue…
• Advantages of Data Center Networking
– Scalability :
• Cloud data center networking allows for easy scalability, enabling organizations
to quickly and efficiently increase or decrease resources based on demand.
This flexibility is crucial for businesses with varying workloads.
– Cost Efficiency :
• Cloud computing eliminates the need for organizations to invest heavily in
physical infrastructure.
• Instead, they can leverage a pay-as-you-go model, paying only for the
resources they consume. This can result in significant cost savings.
– Resource Utilization :
• Virtualization and resource pooling in cloud data centers lead to better
utilization of computing resources. Multiple virtual machines can run on a
single physical server, optimizing hardware usage.
– Global Accessibility :
• Cloud data centers can be distributed globally, allowing users to access
applications and data from anywhere with an internet connection. This is
especially beneficial for businesses with a global presence or remote
workforce.
Continue…
– Automation and Orchestration :
• Cloud data center networking often incorporates automation and
orchestration tools. This streamlines repetitive tasks, accelerates deployment,
and reduces the likelihood of human errors in network configuration.
– High Availability and Reliability :
• Cloud providers design their data centers for high availability and reliability.
Redundaney, failover mechanisms. and distributed architectures contribute to
continuous service availability.
– Security Measures :
• Cloud providers invest heavily in security infrastructure, including firewalls,
encryption, and identity management. Centralized security controls and
regular updates enhance the overall security posture of cloud data center
networks.
– Rapid Deployment :
• Cloud data center networking allows for rapid deployment of new applications
and services. This agility is essential for businesses looking to quickly respond
to changing market conditions or roll out innovative solutions.
Continue…
• Disadvantages of Data Center Networking
– Dependence on Internet Connectivity :
• Cloud-based services rely on internet connectivity. If there are network
outages or slow internet connections, it can impact access to cloud resources.
– Security Concerns :
• While cloud providers implement robust security measures, some
organizations may have concerns about the security of their data in a shared
environment. Compliance with regulatory standards may also be a
consideration.
– Limited Customization and Control :
• Cloud users may have limited control over the underlying infrastructure. This
lack of customization can be a challenge for organizations with specific
requirements or those accustomed to managing every aspect of their network.
– Data Transfer Costs :
• While storing data in the cloud can be cost-effective, transferring large
volumes of data in and out of the cloud may incur additional charges. This can
be a consideration for businesses with significant data transfer needs.
Continue…
– Downtime During Maintenance :
• Cloud providers periodically perform maintenance on their data centers. While
efforts are made to minimize downtime, some services may be temporarily
unavailable during maintenance windows.
– Potential Vendor Lock-in :
• Moving data and applications between different cloud providers or
transitioning back to an on-premises environment can be challenging.
Organizations may face vendor lock-in, limiting their ability to switch providers
easily.
– Latency Concerns :
• Latency may be a concern for applications that require extremely low
response times. While cloud providers aim to minimize latency, certain
workloads may be better suited to on-premises solutions.
– Data Sovereignty and Compliance :
• Some organizations, especially in highly regulated industries, may have
concerns about data sovereignty and compliance. Knowing where data is
stored and ensuring compliance with regional regulations can be challenging in
a cloud environment.
Continue…
• Real-world Application of data center networking
– E-commerce and Retail
– Finance and Banking
– Healthcare
– Education
– Manufacturing and Supply Chain
– Media and Entertainment
– Gaming Industry
– Government and Public Services
Data Center Networking
• A data center network topology refers to the physical layout and
interconnection of network devices within a data center.
• It encompasses the arrangement and interconnection of various
components like servers, storage devices, networking equipment, and
power sources.
• A well-designed topology ensures high availability, scalability, and fault
tolerance while minimizing latency and downtime.
• Some common data center network topologies used in cloud computing:
– Tree Topology
– Fat-Tree Topology
– Spine-and-leaf Topology
– Mesh Topology
– Hyperconverged Infrastructure
– Virtualized Overlay Network
– Interconnected Pods
– Edge Computing Topologies
Continue…
Access
• Tree Topology :
– In a tree topology, switches and routers are organized in a hierarchical tree-like structure.
– Tree topology is implemented with multiple layers of switches, including an access layer for
individual servers located at the leaves and are connected to the switches at the edge level an
aggregation layer to connect multiple access switches, and a core layer for high-speed data
transmission between different network segments.
– Tree is simple , but has multiple drawbacks, performance and reliability of switches should increase
as we go higher in the topology levels, which would require different switches for different levels.
Continue…
Access
Access
Layer
• Fat-Tree Topology :
– Fat tree is an extension to tree topologies and widely used in commercial data center networks.
– Fat trees were originally proposed by Leiserson in 1985.
– Fat trees also consist of levels: core, aggregation, edges, and access. And adds the POD to the
terminology, which consists of aggregation switches, edge switches, and servers.
– Each core switch is connected to the aggregation layer of each POD through one port.
– Each POD has two layers of switches. Switches in the upper layer (i.e., aggregation layer) are
connected to core switches using their ports. The remaining ports are used to connect to switches at
the lower layer (i.e., edge layer).
– Fat trees offer more flexibility and can be more cost effective compared to tree topologies
Continue…
• Spine-and-Leaf Topology:
– Spine-and-leaf topology is a modern network topology that has revolutionized data center layouts by
simplifying scalability and enhancing performance.
– It has two layers of switches: spine and leaf layer switches.
– Traditional three-layer network configurations consist of servers connected to access switches, which
are then connected to what are called aggregate switches. These aggregate switches provide
redundancy for the access switches. The aggregate switches then connect traffic to the core
switches, which can communicate with each other.
– This setup has two major issues: it is difficult to scale, and moving traffic between so many different
devices results in a lot of latency.
– Spine and leaf architecture solves this by eliminating an entire layer and connecting every leaf to
every component of the spine, meaning packets never take more than two hops.
– Reduced hop count reduces latency and improves overall performance
Continue…
• Hyperconverged Infrastructure(HCI):
– It combines storage, computing, and networking into a single system or cluster.
– Designed to reduce data center complexity. These systems make use of hypervisor for
virtualized computing and networking and for software-defined storage.
– All the components in Hyperconverged Infrastructure are managed centrally by the
defined software.
Continue…
• In summary, scalability and elasticity are critical for achieving a responsive, cost-effective, and
efficient infrastructure in cloud data centers. They provide the flexibility needed to handle varying
workloads and ensure optimal resource utilization in dynamic and ever-changing environments.
Differences between scalability and
elasticity
Infrastructure as Code (IaC) and
automation tools
• Infrastructure as Code (IaC) and automation tools play a crucial role in
managing and provisioning resources in cloud data centers efficiently.
They help streamline the deployment process, ensure consistency, and
enable version control.
• Infrastructure as Code (IaC):
– Infrastructure as Code (IaC) is a method of managing and provisioning IT
infrastructure using code rather than manual configuration.
– It allows teams to automate the setup and management of their
infrastructure, making it more efficient and consistent. This is particularly
useful in the DevOps environment, where teams constantly update and deploy
software.
– Manual infrastructure management is time-consuming and prone to error—
especially when we manage applications at scale. Infrastructure as code lets
us define our infrastructure's desired state without including all the steps to
get to that state. It automates infrastructure management so developers can
focus on building and improving applications instead of managing
environments. Organizations use infrastructure as code to control costs,
reduce risks, and respond with speed to new business opportunities.
Continue…
Continue…
• How does infrastructure as code work?
– An infrastructure architecture contains resources such as servers, networking,
operating systems, and storage. IaC controls virtualized resources by treating
configuration files like source code files. we can use it to manage
infrastructure in a codified, repeatable way.
– IaC configuration management tools use different language specifications. We
can develop IaC similar to application code in Python or Java. We also write
the IaC in an integrated development environment (IDE) with built-in error
checking. And we can maintain it under source control with commits at each
code change.
• Approaches to IaC
– There are two different approaches to infrastructure as code.
– Declarative (we define the desired state of the final solution)
• also known as the functional approach
• This method involves defining the desired state of the infrastructure, but not the steps to
get there. The IaC tooling then automatically makes the necessary changes to achieve
this state.
Continue…
• Puppet:
– Puppet is an open-source cloud automation tool .
– It helps automate the provisioning, configuration, and management of servers.
– Configuration management tool using its own declarative language.
– Uses a client-server model for managing infrastructure nodes.
– Scalable and suitable for complex environments.
• Chef:
– Chef is an open-source cloud automation platform .
– It streamlines the configuration and management of infrastructure across networks.
– Configuration management tool using a Ruby-based DSL.
– Utilizes a client-server model or a standalone mode.
– Supports the creation of reusable configurations (cookbooks).
• Jenkins:
– open-source cloud automation server
– Automation server for building, testing, and deploying software.
– Integrates with various plugins and supports CI/CD pipelines.
– Provides a web-based interface for easy configuration.
Continue…
• SaltStack:
– Uses a master-minion architecture for configuration management.
– Employs a YAML-based language for configuration files.
– Known for speed and scalability in managing large infrastructure.
• Nomad:
– Orchestrator for deploying and managing applications.
– Developed by HashiCorp, part of the HashiCorp ecosystem.
– Focuses on simplicity and ease of use.
• These tools, whether IaC or automation tools, contribute to the broader goal of
achieving efficient, scalable, and consistent infrastructure management in both on-
premises and cloud environments. The choice of tools often depends on specific
use cases, organizational preferences, and the underlying infrastructure.
References
• https://aws.amazon.com/what-is/data-center/
• https://www.geeksforgeeks.org/what-is-data-center-in-cloud-computing/
• https://www.checkpoint.com/cyber-hub/cyber-security/what-is-data-
center/
• https://www.cisco.com/c/en/us/solutions/data-center-
virtualization/what-is-a-data-center.html
• https://prasenjitmanna.com/writing/2022-02-26-dc-topologies/
• https://www.geeksforgeeks.org/what-is-hyperconvergence/
• https://community.fs.com/encyclopedia/overlay-network.html
• https://www.netapp.com/data-storage/what-is-data-center-automation/
• https://www.geeksforgeeks.org/what-is-infrastructure-as-code-iac/
• https://www.tpointtech.com/scaling-in-cloud-computing
• https://k21academy.com/terraform-iac/infrastructure-as-code-iac/
• Cloud and Data Center Technology, Atul Prakashan.