0% found this document useful (0 votes)

409 views30 pages

Capacity Planning For Active Directory Domain Services

This document provides guidance on capacity planning for Active Directory Domain Services (AD DS). It discusses goals of capacity planning, baseline requirements, and a three-step process for capacity planning. The document evaluates capacity needs for RAM, network bandwidth, storage, processing, and virtualization considerations. It also includes appendices on CPU sizing criteria and fundamentals of storage and operating systems.

Uploaded by

cladic4366

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

409 views30 pages

Capacity Planning For Active Directory Domain Services

Uploaded by

cladic4366

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

Capacity Planning for Active Directory Domain Services

This topic is originally written by Ken Brumfield, Senior Premier Field Engineer at Microsoft. This version on the TechNet Wiki allows users to supplement the content
with information based on their own experiences and share other valuable insights with the rest of the community. Effort will be made to keep the two versions in
sync, but there is no guarantee that the two versions will remain identical. If you need to reference a version that is supported by Microsoft, refer to the TechNet library
version.

This topic provides recommendations for capacity planning for Active Directory Domain Services (AD DS).

1 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

Table of Contents
Goals of Capacity Planning
Baseline Requirements for Capacity Planning Guidance
Three-Step Process for the Capacity Planning Cycle
Applying the process
Data Collection Summary Tables
New environment
High-Level Evaluation Criteria
Planning
RAM
Evaluating
Virtualization Considerations for RAM
Calculation Summary Example
Network
Evaluating
Bandwidth Needs
Virtualization Considerations for Network Bandwidth
Calculation Summary Example
Storage
Sizing
Evaluating for Storage
Virtualization Considerations for Storage
Calculation Summary Example
Storage Performance
Evaluating Performance of Storage
Evaluating the results
Virtualization Considerations for Performance
Calculation Summary Example
Processing
Evaluating Active Directory Processor Usage
Introduction
Target Site Behavior Profile
Calculating CPU demands
When to Tune LDAP Weights
Virtualization Considerations for Processing
Calculation Summary Example
Cross-Trust Client Authentication Load for NTLM
Evaluating Cross-Trust Client Authentication Load
Planning
Virtualization Considerations
Calculation Summary Example
Monitoring For Compliance With Capacity Planning Goals
Appendix A: CPU Sizing Criteria
Important Concepts
Definitions
Thread-Level Parallelism
Data-Level Parallelism
CPU Speed vs. Multiple-Core Considerations
Response Time/How the System Busyness Impacts Performance
Applying the concepts to capacity planning
Appendix B: Considerations Regarding Different Processor Speeds, and the
Effect of Processor Power Management on Processor Speeds
Appendix C: Fundamentals Regarding the Operating System Interacting with
Storage
Operating System Architecture Considerations
Introducing simple storage subsystems
Introducing RAID

2 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

↑ Back to top

Goals of Capacity Planning

Capacity planning is not the same as troubleshooting performance incidents. They are closely related, but quite different. The goals of capacity planning are:
Properly implement and operate an environment
Minimize the time spent troubleshooting performance issues.

In capacity planning, an organization might have a baseline target of 40% processor utilization during peak periods in order to meet client performance requirements and accommodate
the time necessary to upgrade the hardware in the datacenter. Whereas, to be notified of abnormal performance incidents, a monitoring alert threshold might be set at 90% over a 5
minute interval.

The difference is that when a capacity management threshold is continually exceeded (a one-time event is not a concern), adding capacity (that is, adding in more or faster processors)
would be a solution or scaling the service across multiple servers would be a solution. Performance alert thresholds indicate that client experience is currently suffering and immediate
steps are needed to address the issue.

As an analogy: capacity management is about preventing a car accident (defensive driving, making sure the brakes are working properly, and so on) whereas performance
troubleshooting is what the police, fire department, and emergency medical professionals do after an accident. This is about “defensive driving,” Active Directory-style.

Over the last several years, capacity planning guidance for scale-up systems has changed dramatically. The following changes in system architectures have challenged fundamental
assumptions about designing and scaling a service:
64-bit server platforms
Virtualization
Increased attention to power consumption
SSD storage
Cloud scenarios

Additionally, the approach is shifting from a server-based capacity planning exercise to a service-based capacity planning exercise. Active Directory Domain Services (AD DS), a mature
distributed service that many Microsoft and third-party products use as a backend, becomes one the most critical products to plan correctly to ensure the necessary capacity for other
applications to run.

Baseline Requirements for Capacity Planning Guidance

Throughout this article, the following baseline requirements are expected:

Readers have read and are familiar with Performance Tuning Guidelines for Windows Server 2012 R2 .
The Windows Server platform is an x64 based architecture. But even if your Active Directory environment is installed on Windows Server 2003 x86 (now nearing
the end of the support lifecycle) and has a DIT less 1.5 GB in size that can easily be held in memory, the guidelines from this article are still applicable.
Capacity planning is a continuous process and you should regularly review how well the environment is meeting expectations.
Optimization will occur over multiple hardware lifecycles as hardware costs change. For example, memory becomes cheaper, the cost per core decreases, or the
price of different storage options change.
Plan for the peak busy period of the day. It is recommended to look at this in either 30 minute or hour intervals. Anything greater may hide the actual peaks
and anything less may be distorted by “transient spikes.”
Plan for growth over the course of the hardware lifecycle for the enterprise. This may include a strategy of upgrading or adding hardware in a staggered
fashion, or a complete refresh every 3 to 5 years. Each will require a “guess” as how much the load on Active Directory will grow. Historical data, if collected, will
help with this assessment.
Plan for fault tolerance. Once an estimate N is derived, plan for scenarios that include N -1, N -2, N - x .
Add in additional servers according to organizational need to ensure that the loss of a single or multiple servers does not exceed maximum peak
capacity estimates.
Also consider that the growth plan and fault tolerance plan need to be integrated. For example, if one DC is required to support the load, but the
estimate is that the load will be doubled in the next year and require two DCs total, there will not be enough capacity to support fault tolerance. The
solution would be to start with three DCs. One could also plan to add the third DC after 3 or 6 months if budgets are tight.
Note
Adding Active Directory-aware applications might have a noticeable impact on the DC load, whether the load is coming from the application servers or
clients.

Three-Step Process for the Capacity Planning Cycle

In capacity planning, first decide what quality of service is needed. For example, a core datacenter supports a higher level of concurrency and requires more consistent experience for
users and consuming applications, which requires greater attention to redundancy and minimizing system and infrastructure bottlenecks. In contrast, a satellite location with a handful of
users does not need the same level of concurrency or fault tolerance. Thus, the satellite office might not need as much attention to optimizing the underlying hardware and
infrastructure, which may lead to cost savings. All recommendations and guidance herein are for optimal performance, and can be selectively relaxed for scenarios with less demanding
requirements.

The next question is: virtualized or physical? From a capacity planning perspective, there is no right or wrong answer; there is only a different set of variables to work with. Virtualization
scenarios come down to one of two options:

“Direct mapping” with one guest per host (where virtualization exists solely to abstract the physical hardware from the server)

3 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

“Shared host”

Testing and production scenarios indicate that the “direct mapping” scenario can be treated identically to a physical host. “Shared host,” however, introduces a number of considerations
spelled out in more detail later. The “shared host” scenario means that AD DS is also competing for resources, and there are penalties and tuning considerations for doing so.
With these considerations in mind, the capacity planning cycle is an iterative three-step process:
1.   Measure the existing environment, determine where the system bottlenecks currently are, and get environmental basics necessary to plan the amount of capacity needed.
2.   Determine the hardware needed according to the criteria outlined in step 1.
3.   Monitor and validate that the infrastructure as implemented is operating within specifications. Some data collected in this step becomes the baseline for the next cycle of
capacity planning.

Applying the process

To optimize performance, ensure these major components are correctly selected and tuned to the application loads:
1. Memory
2. Network
3. Storage
4. Processor
5. Net Logon

The basic storage requirements of AD DS and the general behavior of well written client software allow environments with as many as 10,000 to 20,000 users to forego
heavy investment in capacity planning with regards to physical hardware, as almost any modern server class system will handle the load. That said, the following table
summarizes how to evaluate an existing environment in order to select the right hardware. Each component is analyzed in detail in subsequent sections to help AD DS
administrators evaluate their infrastructure using baseline recommendations and environment-specific principals.

In general:

Any sizing based on current data will only be accurate for the current environment.
For any estimates, expect demand to grow over the lifecycle of the hardware.
Determine whether to oversize today and grow into the larger environment, or add capacity over the lifecycle.
For virtualization, all the same capacity planning principals and methodologies apply, except that the overhead of the virtualization needs to be added to
anything domain related
Capacity planning, like anything that attempts to predict, is NOT an accurate science. Do not expect things to calculate perfectly and with 100% accuracy. The
guidance here is the leanest recommendation; add in capacity for additional safety and continuously validate that the environment remains on target.

Data Collection Summary Tables

New environment

Component Es mates

Storage/Database Size 40KB to 60KB for each user

RAM Database Size

Base operating system recommendations

Third-party applications

Network 1 Gb

CPU 1000 concurrent users for each core

High-Level Evaluation Criteria

Component Evalua on Criteria Planning Considera ons

Storage/Database The section entitled “To activate logging of disk space that is
Size freed by defragmentation” in Storage Limits

Storage/ Database LogicalDisk(<NTDS Database Drive>)\Avg Disk Storage has two concerns to address
Performance sec/Read, LogicalDisk(<NTDS Database Drive>)\Avg Space available, which with the size of today’s spindle based and SSD
Disk sec/Write, LogicalDisk(<NTDS Database based storage is irrelevant for most AD environments.
Drive>)\Avg Disk sec/Transfer Input/Output (IO) Operations available – In many environments, this is
LogicalDisk(<NTDS Database Drive>)\Reads/sec, often overlooked. But it is important to evaluate only environments
LogicalDisk(<NTDS Database Drive>)\Writes/sec, where there is not enough RAM to load the entire NTDS Database into
LogicalDisk(<NTDS Database memory.
Drive>)\Transfers/sec Storage can be a complex topic and should involve hardware vendor expertise
for proper sizing. Particularly with more complex scenarios such as SAN, NAS,
and iSCSI scenarios. However, in general, cost per Gigabyte of storage is often

4 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

in direct opposition to cost per IO:

RAID 5 has lower cost per Gigabyte than Raid 1, but Raid 1 has lower
cost per IO
Spindle-based hard drives have lower cost per Gigabyte, but SSDs
have a lower cost per IO
After a restart of the computer or the Active Directory Domain Services
service, the Extensible Storage Engine (ESE) cache is empty and performance
will be disk bound while the cache warms.
In most environments AD is read intensive IO in a random pattern to disks,
negating much of the benefit of caching and read optimization strategies.
Plus, AD has a way larger cache in memory than most storage system caches.

RAM Database size Storage is the slowest component in a computer. The more that can
be resident in RAM, the less it is necessary to go to disk.
Base operating system recommendations
Ensure enough RAM is allocated to store the operating system, Agents
Third-party applications (antivirus, backup, monitoring), NTDS Database and growth over time.
For environments where maximizing the amount of RAM is not cost
effective (such as a satellite locations) or not feasible (DIT is too large),
reference the Storage section to ensure that storage is properly sized.

Network “Network Interface(*)\Bytes Received/sec” In general, traffic sent from a DC far exceeds traffic sent to a DC.
As a switched Ethernet connection is full-duplex, inbound and
“Network Interface(*)\Bytes Sent/sec” outbound network traffic need to be sized independently.
Consolidating the number of DCs will increase the amount of
bandwidth used to send responses back to client requests for each
DC, but will be close enough to linear for the site as a whole.
If removing satellite location DCs, don’t forget to add the bandwidth
for the satellite DC into the hub DCs as well as use that to evaluate
how much WAN traffic there will be.

CPU “Logical Disk(<NTDS Database Drive>)\Avg Disk sec/Read” After eliminating storage as a bottleneck, address the amount of
compute power needed.
“Process(lsass)\% Processor Time” While not perfectly linear, the number of processor cores consumed
across all servers within a specific scope (such as a site) can be used
to gauge how many processors are necessary to support the total
client load. Add the minimum necessary to maintain the current level
of service across all the systems within the scope.
Changes in processor speed, including power management related
changes, impact numbers derived from the current environment.
Generally, it is impossible to precisely evaluate how going from a 2.5
GHz processor to a 3 GHz processor will reduce the number of CPUs
needed.

NETLOGON “Netlogon(*)\Semaphore Acquires” Net Logon Secure Channel/MaxConcurrentAPI only affects

environments with NTLM authentications and/or PAC Validation. PAC
“Netlogon(*)\Semaphore Timeouts” Validation is on by default in operating system versions before
Windows Server 2008. This is a client setting, so the DCs will be
“Netlogon(*)\Average Semaphore Hold Time”
impacted until this is turned off on all client systems.
Environments with significant cross trust authentication, which
includes intraforest trusts, have greater risk if not sized properly.
Server consolidations will increase concurrency of cross-trust
authentication.
Surges need to be accommodated, such as cluster fail-overs, as users
re-authenticate en masse to the new cluster node.
Individual client systems (such as a cluster) might need tuning too.

↑ Back to top

Planning
For a long time, the community’s recommendation for sizing AD DS has been to “put in as much RAM as the database size.” For the most part, that recommendation is
all that most environments needed to be concerned about. But the ecosystem consuming AD DS has gotten much bigger, as have the AD DS environments
themselves, since its introduction in 1999. Although the increase in compute power and the switch from x86 architectures to x64 architectures has made the subtler
aspects of sizing for performance irrelevant to a larger set of customers running AD DS on physical hardware, the growth of virtualization has reintroduced the tuning
concerns to a larger audience than before.

5 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

The following guidance is thus about how to determine and plan for the demands of Active Directory as a service regardless of whether it is deployed in a physical, a
virtual/physical mix, or a purely virtualized scenario. As such, we will break down the evaluation to each of the four main components: storage, memory, network, and
processor. In short, in order to maximize performance on AD DS, the goal is to get as close to processor bound as possible.

↑ Back to top

RAM
Simply, the more that can be cached in RAM, the less it is necessary to go to disk. To maximize the scalability of the server the minimum amount of RAM should be the
sum of the current database size, the total SYSVOL size, the operating system recommended amount, and the vendor recommendations for the agents (antivirus,
monitoring, backup, and so on). An additional amount should be added to accommodate growth over the lifetime of the server. This will be environmentally subjective
based on estimates of database growth based on environmental changes.

For environments where maximizing the amount of RAM is not cost effective (such as a satellite locations) or not feasible (DIT is too large), reference the Storage
section to ensure that storage is properly designed.

A corollary that comes up in the general context in sizing memory is sizing of the page file. In the same context as everything else memory related, the goal is to
minimize going to the much slower disk. Thus the question should go from, “how should the page file be sized?” to “how much RAM is needed minimize paging?” The
answer to the latter question is outlined in the rest of this section. This leaves most of the discussion for sizing the page file to the realm of general operating system
recommendations and the need to configure the system for memory dumps, which are unrelated to AD DS performance.

↑ Back to top

Evaluating
The amount of RAM that a domain controller (DC) needs is actually a complex exercise for these reasons:

High potential for error when trying to use an existing system to gauge how much RAM is needed as LSASS will trim under memory pressure conditions,
artificially deflating the need.
The subjective fact that an individual DC only needs to cache what is “interesting” to its clients. This means that the data that needs to be cached on a DC in a
site with only an Exchange server will be very different than the data that needs to be cached on a DC that only authenticates users.
The labor to evaluate RAM for each DC on a case-by-case basis is prohibitive and changes as the environment changes.
The criteria behind the recommendation will help to make informed decisions:
The more that can be cached in RAM, the less it is necessary to go to disk.
Storage is by far the slowest component of a computer. Access to data on spindle-based and SSD storage media is on the order of 1,000,000x slower than
access to data in RAM.

Thus, in order to maximize the scalability of the server, the minimum amount of RAM is the sum of the current database size, the total SYSVOL size, the operating
system recommended amount, and the vendor recommendations for the agents (antivirus, monitoring, backup, and so on). Add additional amounts to accommodate
growth over the lifetime of the server. This will be environmentally subjective based on estimates of database growth. However, for satellite locations with a small set
of end users, these requirements can be relaxed as these sites will not need to cache as much to service most of the requests.
For environments where maximizing the amount of RAM is not cost effective (such as a satellite locations) or not feasible (DIT is too large), reference the Storage
section to ensure that storage is properly sized.
Note: A corollary while sizing memory is sizing of the page file. Because the goal is to minimize going to the much slower disk, the question goes from “how should
the page file be sized?” to “how much RAM is needed to minimize paging?” The answer to the latter question is outlined in the rest of this section. This leaves most of
the discussion for sizing the page file to the realm of general operating system recommendations and the need to configure the system for memory dumps, which are
unrelated to AD DS performance.

↑ Back to top

Virtualization Considerations for RAM

Avoid memory overcommit at the host. The fundamental goal behind optimizing the amount of RAM is to minimize the amount of time spent going to disk. In
virtualization scenarios, the concept of memory overcommit exists where more RAM is allocated to the guests then exists on the physical machine. This in and of itself
is not a problem. It becomes a problem when the total memory actively used by all the guests exceeds the amount of RAM on the host and the underlying host starts
paging. Performance becomes disk-bound in cases where the domain controller is going to the NTDS.DIT to get data, or the domain controller is going to the page
file to get data, or the host is going to disk to get data that the guest thinks is in RAM.

↑ Back to top

Calculation Summary Example

6 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

Component Es mated Memory (example)

Base Operating System Recommended RAM (Windows Server 2008) 2 GB

LSASS internal tasks 200 MB

Monitoring Agent 100 MB

Antivirus 100 MB

Database (Global Catalog) 8.5 GB are you sure ???

Cushion for backup to run, administrators to log on without impact 1 GB

Total 12 GB

RECOMMENDED: 16 GB

Over time, the assumption can be made that more data will be added to the database and the server will probably be in production for 3 to 5 years. Based on an
estimate of growth of 33%, 16 GB would be a reasonable amount of RAM to put in a physical server. In a virtual machine, given the ease with which settings can be
modified and RAM can be added to the VM, starting at 12 GB with the plan to monitor and upgrade in the future is reasonable.

↑ Back to top

Network
Evaluating
This section is less about evaluating the demands regarding replication traffic, which is focused on traffic traversing the WAN and is thoroughly covered in Active
Directory Replication Traffic , than it is about evaluating total bandwidth and network capacity needed, inclusive of client queries, Group Policy applications, and so
on. For existing environments, this can be collected by using performance counters “Network Interface(*)\Bytes Received/sec,” and “Network Interface(*)\Bytes
Sent/sec.”, Sample intervals for Network Interface counters in either 15, 30, or 60 minutes. Anything less will generally be too volatile for good measurements; anything
greater will smooth out daily peeks excessively.

Note
Generally, the majority of network traffic on a DC is outbound as the DC responds to client queries. This is the reason for the focus on outbound traffic, though it is
recommended to evaluate each environment for inbound traffic also. The same approaches can be used to address and review inbound network traffic requirements.
For more information, see Knowledge Base article 929851: The default dynamic port range for TCP/IP has changed in Windows Vista and in Windows Server 2008 .

↑ Back to top

Bandwidth Needs
Planning for network scalability covers two distinct categories: the amount of traffic and the CPU load from the network traffic. Each of these scenarios is straight-
forward compared to some of the other topics in this article.

In evaluating how much traffic must be supported, there are two unique categories of capacity planning for AD DS in terms of network traffic. The first is replication
traffic that traverses between domain controllers and is covered thoroughly in the reference Active Directory Replication Traffic and is still relevant to current
versions of AD DS. The second is the intrasite client-to-server traffic. One of the simpler scenarios to plan for, intrasite traffic predominantly receives small requests
from clients relative to the large amounts of data sent back to the clients. 100 Mb will generally be adequate in environments up to 5,000 users per server, in a site.
Using a 1 Gb network adapter and Receive Side Scaling (RSS) support is recommended for anything above 5,000 users. To validate this scenario, particularly in the case
of server consolidation scenarios, look at Network Interface(*)\Bytes/sec across all the DCs in a site, add them together, and divide by the target number of domain
controllers to ensure that there is adequate capacity. The easiest way to do this is to use the “Stacked Area” view in Windows Reliability and Performance Monitor
(formerly known as Perfmon), making sure all of the counters are scaled the same.

Consider the following example (also known as, a really, really complex way to validate that the general rule is applicable to a specific environment). The following
assumptions are made:

The goal is to reduce the footprint to as few servers as possible. Ideally, one server will carry the load and an additional server is deployed for redundancy (N+1
scenario).
In this scenario, the current network adapter supports only 100 Mb and is in a switched environment.

7 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

The maximum target network bandwidth utilization is 60% in an N scenario (loss of a DC).

Each server has about 10,000 clients connected to it.

Knowledge gained from the data in the chart (NetworkInterface(*)\Bytes Sent/sec):

1. The business day starts ramping up around 5:30 and winds down at 7:00 PM
2. The peak busiest period is from 8:00 AM to 8:15, with greater than 25 Bytes sent/sec on the busiest DC. (Note: all performance data is historical. So the peak
data point at 8:15 indicates the load from 8:00 to 8:15).
3. There are spikes before 4:00 AM, with more than 20 Bytes sent/sec on the busiest DC, which could indicate either load from different time zones or background
infrastructure activity, such as backups. Since the peak at 8:00 AM exceeds this activity, it is not relevant.
4. There are 5 Domain Controllers in the site.
5. The max load is about 5.5 MB/s per DC, which represents 44% of the 100 Mb connection. Using this data, it can be estimated that the total bandwidth needed
between 8:00 AM and 8:15 AM is 28 MB/s.
Note
Be careful with the fact that Network Interface sent/receive counters are in bytes and network bandwidth is measured in bits. 100 Mb / 8 = 12.5 MB, 1 Gb / 8 =
128 MB.

Conclusions:

1. This current environment does meet the N+1 level of fault tolerance at 60% target utilization. Taking one system offline will shift the bandwidth per server from
about 5.5 MB/s (44%) to about 7 MB/s (56%).
2. Based on the previously stated goal of consolidating to one server, this both exceeds the maximum target utilization and theoretically the possible utilization of
a 100 Mb connection.
3. With a 1 Gb connection this will represent 22% of the total capacity.
4. Under normal operating conditions in the N+1 scenario, client load will be relatively evenly distributed at about 14 MB/s per server or 11% of total capacity.
5. To ensure that capacity is adequate during unavailability of a DC, the normal operating targets per server would be about 30% network utilization or 38 MB/s
per server. Failover targets would be 60% network utilization or 72 MB/s per server.

In short, the final deployment of systems must have a 1 Gb network adapter and be connected to a network infrastructure that will support said load. A further note is
that given the amount of network traffic generated, the CPU load from network communications can have a significant impact and limit the maximum scalability of AD
DS. This same process can be used to estimate the amount of inbound communication to the DC. But given the predominance of outbound traffic relative to inbound
traffic, it is an academic exercise for most environments. Ensuring hardware support for RSS is important in environments with greater than 5,000 users per server. For
scenarios with high network traffic, balancing of interrupt load can be a bottleneck. This can be detected by Processor(*)\% Interrupt Time being unevenly distributed
across CPUs. RSS enabled NICs can mitigate this limitation and increase scalability.

Note
A similar approach can be used to estimate the additional capacity necessary when consolidating data centers, or retiring a domain controller in a satellite location.
Simply collect the outbound and inbound traffic to clients and that will be the amount of traffic that will now be present on the WAN links.

In some cases, you might experience more traffic than expected because traffic is slower, such as when certificate checking fails to meet aggressive time-outs on the
WAN. For this reason, WAN sizing and utilization should be an iterative, ongoing process.

↑ Back to top

Virtualization Considerations for Network Bandwidth

It is easy to make recommendations for a physical server: 1 Gb for servers supporting greater than 5000 users. Once multiple guests start sharing an underlying virtual
switch infrastructure, additional attention is necessary to ensure that the host has adequate network bandwidth to support all the guests on the system, and thus
requires the additional rigor. This is nothing more than an extension of ensuring the network infrastructure into the host machine. This is regardless whether the
network is inclusive of the domain controller running as a virtual machine guest on a host with the network traffic going over a virtual switch, or whether connected
directly to a physical switch. The virtual switch is just one more component where the uplink needs to support the amount of data being transmitted. Thus the physical
host physical network adapter linked to the switch should be able to support the DC load plus all other guests sharing the virtual switch connected to the physical
network adapter.

↑ Back to top

Calculation Summary Example

System Peak Bandwidth

Domain controller 1 6.5 MB/s

8 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

Domain controller 2 6.25 MB/s

Domain controller 3 6.25 MB/s

Domain controller 4 5.75 MB/s

Domain controller 5 4.75 MB/s

Total 28.5 MB/s

RECOMMENDED: 72 MB/s (28.5 MB/s divided by 40%)

Target System(s) Count Total Bandwidth (From Above)

2 28.5 MB/s

Resulting Normal Behavior 28.5 / 2 = 14.25 MB/s

As always, over time the assumption can be made that client load will increase and this growth should be planned for as best as possible. The recommended amount
to plan for would allow for an estimated growth in network traffic of 50%.

↑ Back to top

Storage
Planning storage constitutes two components:

Capacity, or storage size

Performance

A great amount of time and documentation is spent on planning capacity, leaving performance often completely overlooked. With current hardware costs, most
environments are not large enough that either of these is actually a concern, and the recommendation to “put in as much RAM as the database size” usually covers the
rest, though it may be overkill for satellite locations in larger environments.

Sizing
Evaluating for Storage
Compared to 13 years ago when Active Directory was introduced, a time when 4 GB and 9 GB drives were the most common drive sizes, sizing for Active Directory is
not even a consideration for all but the largest environments. With the smallest available hard drive sizes in the 180GB range, the entire operating system, SYSVOL, and
NTDS.DIT can easily fit on one drive. As such, it is recommended to deprecate heavy investment in this area.

The only recommendation for consideration is to ensure that 110% of the NTDS.DIT size is available in order to enable defrag. Additionally, accommodations for
growth over the life of the hardware should be made.

The first and most important consideration is evaluating how large the NTDS.DIT and SYSVOL will be. These measurements will lead into sizing both fixed disk and
RAM allocation. Due to the (relatively) low cost of these components, the math does not need to be rigorous and precise. Content about how to evaluate this for both
existing and new environments can be found in the Data Storage series of articles. Specifically, refer to the following articles:

For existing environments: The section titled “To activate logging of disk space that is freed by defragmentation” in the article Storage Limits .
For new environments: The article titled Growth Estimates for Active Directory Users and Organizational Units .

Note
The articles are based on data size estimates made at the time of the release of Active Directory in Windows 2000. Use object sizes that reflect the actual size of
objects in your environment.
When reviewing existing environments with multiple domains, there may be variations in database sizes. Where this is true, use the smallest global catalog (GC) and
non-GC sizes.

9 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

The database size can vary between operating system versions. DCs that run earlier operating systems such as Windows Server 2003 have a smaller database size than
a DC that runs a later operating system such as Windows Server 2008 R2, especially when features such Active Directory Recycle Bin or Credential Roaming are
enabled.
Note

For new environments, notice that the estimates in Growth Estimates for Active Directory Users and Organizational Units indicate that 100,000 users (in the
same domain) consume about 450 MB of space. Please note that the attributes populated can have a huge impact on the total amount. Attributes will be
populated on many objects by both third-party and Microsoft products, including Microsoft Exchange Server and Lync. An evaluation based on the portfolio of
the products in the environment is preferred, but the exercise of detailing out the math and testing for precise estimates for all but the largest environments
may not actually be worth significant time and effort.

Ensure that 110% of the NTDS.DIT size is available as free space in order to enable offline defrag, and plan for growth over a 3 to 5 year hardware lifespan.
Given how cheap storage is, estimating storage at 300% the size of the DIT as storage allocation is safe to accommodate growth and the potential need for
offline defrag.

↑ Back to top

Virtualization Considerations for Storage

In a scenario where multiple Virtual Disk files are being allocated on a single volume use a fixed sized disk of at least 210% (100% of the DIT + 110% free space) the
size of the DIT to ensure that there is adequate space reserved.

↑ Back to top

Calculation Summary Example

Data Collected From Evalua on Phase

NTDS.DIT size 35 GB

Modifier to allow for offline defrag 2.1

Total Storage Needed 73.5 GB

Note
This storage needed is in addition to the storage needed for SYSVOL, operating system, page file, temporary files, local cached data (such as installer files), and
applications.

↑ Back to top

Storage Performance
Evaluating Performance of Storage
As the slowest component within any computer, storage can have the biggest adverse impact on client experience. For those environments large enough for which the
RAM sizing recommendations are not feasible, the consequences of overlooking planning storage for performance can be devastating. Also, the complexities and
varieties of storage technology further increase the risk of failure as the relevance of long standing best practices of “put operating system, logs, and database” on
separate physical disks is limited in it’s useful scenarios. This is because the long standing best practice is based on the assumption that is that a “disk” is a dedicated
spindle and this allowed IO to be isolated. This assumptions that make this true are no longer relevant with the introduction of:

new storage types and virtualized and shared storage scenarios

shared spindles on a Storage Area Network (SAN)
virtual disk file on a SAN or network-attached storage
Solid State Drives
tiered storage architectures (i.e. SSD storage tier caching larger spindle based storage)

In short, the end goal of all storage performance efforts, regardless of underlying storage architecture and design, is to ensure the needed amount of Input/Output
Operations per Second (IOPS) are available and that those IOPS happen within an acceptable time frame. This section covers how to evaluate what AD DS demands of
the underlying storage in order to ensure storage solutions are properly designed. Given the variability of today’s storage technologies, it is best to work with the
storage vendors to ensure adequate IOPS. For those scenarios with locally attached storage, reference the Appendix C for the basics in how to design traditional local
storage scenarios. This principals are generally applicable to more complex storage tiers and will also help in dialog with the vendors supporting backend storage
solutions.

Given the wide breadth of storage options available, it is recommended to engage the expertise of hardware support teams or vendors to ensure that the
specific solution meets the needs of AD DS. The following numbers are the information that would be provided to the storage specialists.

For environments where the database is too large to be held in RAM, use the performance counters to determine how much IO needs to be supported:

LogicalDisk(*)\Avg Disk sec/Read (for example, if NTDS.dit is stored on the D:/ drive, the full path would be LogicalDisk(D:)\Avg Disk sec/Read)

10 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

LogicalDisk(*)\Avg Disk sec/Write

LogicalDisk(*)\Avg Disk sec/Transfer
LogicalDisk(*)\Reads/sec
LogicalDisk(*)\Writes/sec
LogicalDisk(*)\Transfers/sec

These should be sampled in 15/30/60 minute intervals to benchmark the demands of the current environment.

↑ Back to top

Evaluating the results

Note: the focus is on reads from the database as this is usually the most demanding component, the same logic can be applied to writes to the log file by substituting
LogicalDisk(<NTDS Log>)\Avg Disk sec/Write and LogicalDisk(<NTDS Log)\Writes/sec):

LogicalDisk(<NTDS>)\Avg Disk sec/Read indicates whether or not the current storage is adequately sized. If the results are roughly equal to the Disk Access
Time for the disk type, LogicalDisk(<NTDS>)\Reads/sec is a valid measure. Check the manufacturer specifications for the storage on the back end, but good
ranges for LogicalDisk(<NTDS>)\Avg Disk sec/Read would roughly be:
7200 – 9 to 12.5 milliseconds (ms)
10,000 – 6 to 10 ms
15,000 – 4 to 6 ms
SSD – 1 to 3 ms
NOTE: Recommendations exist stating that storage performance is degraded at 15ms to 20ms (depending on source). The difference between the
above values and the other guidance is that the above values are the normal operating range. The other recommendations are troubleshooting
guidance to identify when client experience significantly degrades and becomes noticeable. Reference Appendix C for a deeper explanation.
LogicalDisk(<NTDS>)\Reads/sec is the amount of IO that is being performed.
If LogicalDisk(<NTDS>)\Avg Disk sec/Read is within the optimal range for the backend storage, LogicalDisk(<NTDS>)\Reads/sec can be used directly to
size the storage.
If LogicalDisk(<NTDS>)\Avg Disk sec/Read is not within the optimal range for the backend storage, additional IO is needed according to the following
formula:

(LogicalDisk(NTDS)\Avg Disk sec/Read)/(Physical Media Disk Access Time)(LogicalDisk(NTDS)\Avg Disk sec/Read)

Considerations:

Note that if the server is configured with a sub-optimal amount of RAM, these values will be inaccurate for planning purposes. They will be erroneously on the
high side and can still be used as a worst case scenario.
Adding/Optimizing RAM specifically will drive a decrease in the amount of read IO (LogicalDisk(<NTDS>)\Reads/Sec. This means the storage solution may not
have to be as robust as initially calculated. Unfortunately, anything more specific then this general statement is environmentally dependent on client load and
general guidance cannot be provided. The best option is to adjust storage sizing after optimizing RAM.

↑ Back to top

Virtualization Considerations for Performance

Similar to all of the preceding virtualization discussions, the key here is to ensure that the underlying shared infrastructure can support the DC load plus the other
resources using the underlying shared media and all pathways to it. This is true whether a physical domain controller is sharing the same underlying media on a SAN,
NAS, or iSCSI infrastructure as other servers or applications, whether it is a guest using pass through access to a SAN, NAS, or iSCSI infrastructure that shares the
underlying media, or if the guest is using a virtual disk file that resides on shared media locally or a SAN, NAS, or iSCSI infrastructure. The planning exercise is all about
making sure that the underlying media can support the total load of all consumers.

Also, from a guest perspective, as there are additional code paths that must be traversed, there is a performance impact to having to go through a host to access any
storage. Not surprisingly, storage performance testing indicates that the virtualizing has an impact on throughput that is subjective to the processor utilization of the
host system (see Appendix A: CPU Sizing Criteria), which is obviously influenced by the resources of the host demanded by the guest. This contributes to the
virtualization considerations regarding processing needs in a virtualized scenario (see Virtualization Considerations for Processing).

Making this more complex is that there are a variety of different storage options that are available that all have different performance impacts. As a safe estimate when
migrating from physical to virtual, use a multiplier of 1.10 to adjust for different storage options for virtualized guests on Hyper-V, such as pass-through storage, SCSI
Adapter, or IDE. The adjustments that need to be made when transferring between the different storage scenarios are irrelevant as to whether the storage is local,
SAN, NAS, or iSCSI.

↑ Back to top

Calculation Summary Example

Determining the amount of IO needed for a healthy system under normal operating conditions:

LogicalDisk(<NTDS Database Drive>)\Transfers/sec during the peak period 15 minute period

To determine the amount of IO needed for storage where the capacity of the underlying storage is exceeded:
Needed IOPS = (LogicalDisk(< NTDS Database Drive >)\Avg Disk sec/Read/ <Target Avg Disk sec/Read>) * LogicalDisk(< NTDS Database Drive >)\Read/sec

Counter Value

11 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

Actual LogicalDisk(< NTDS Database Drive >)\Avg Disk sec/Transfer .02 seconds (20 milliseconds)

Target LogicalDisk(< NTDS Database Drive >)\Avg Disk sec/Transfer .01 seconds

Multiplier for Change in Available IO 0.02 / 0.01 = 2

Value Name Value

LogicalDisk(< NTDS Database Drive >)\Transfers/sec 400

Multiplier for Change in Available IO 2

Total IOPS needed during peak period 800

To determine the rate at which the cache is desired to be warmed:

Determine the maximum acceptable time to warm the cache. It is either the amount of time that it should take to load the entire database from disk, or for
scenarios where the entire database cannot be loaded in RAM, this would be the maximum time to fill RAM.
Determine the size of the database, excluding white space. For more information see, Evaluating Active Directory Storage.
Divide the database size by 8 KB; that will be the total IOs necessary to load the database.
Divide the total IOs by the number of seconds in the defined time frame.

Note that the rate calculated, while accurate, will not be exact because previously loaded pages are evicted if ESE is not configured to have a fixed cache size, and AD
DS by default uses variable cache size.

Data Points to Collect Values

Maximum Acceptable Time to Warm 10 minutes (600 seconds)

Database Size 2 GB

Calcula on Step Formula Result

Calculate size of database in pages (2GB * 1024 * 1024) = Size of database in KB 2,097,152 KB

Calculate number of pages in database 2,097,152 KB / 8 KB = Number of Pages 262,144 pages

Calculate IOPS necessary to fully warm the cache 262,144 pages / 600 seconds = IOPS needed 437 IOPS

↑ Back to top

Processing
Evaluating Active Directory Processor Usage
For most environments, after storage, RAM, and networking are properly tuned as described in the Planning section, managing the amount of processing capacity will
be the component that deserves the most attention. There are two challenges in evaluating CPU capacity needed:

Whether or not the applications in the environment are being well-behaved in a shared services infrastructure, and is discussed in the section titled “Tracking
Expensive and Inefficient Searches” in the article Creating More Efficient Microsoft Active Directory-Enabled Applications or migrating away from down-level
SAM calls to LDAP calls.

12 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

In larger environments, the reason this is important is that poorly coded applications can drive volatility in CPU load, “steal” an inordinate amount of CPU time
from other applications, artificially drive up capacity needs, and unevenly distribute load against the DCs.
As AD DS is a distributed environment with a large variety of potential clients, estimating the expense of a “single client” is environmentally subjective due to
usage patterns and the type or quantity of applications leveraging AD DS. In short, much like the networking section, for broad applicability, this is better
approached from the perspective of evaluating the total capacity needed in the environment.

For existing environments, as storage sizing was discussed previously, the assumption is made that storage is now properly sized and thus the data regarding
processor load is valid. To reiterate, it is critical to ensure that the bottleneck in the system is not the performance of the storage. When a bottleneck exists and the
processor is waiting, there are idle states that will go away once the bottleneck is removed. As processor wait states are removed, by definition, CPU utilization
increases as it no longer has to wait on the data. Thus, collect performance counters “Logical Disk(<NTDS Database Drive>)\Avg Disk sec/Read” and “Process(lsass)\%
Processor Time”. The data in “Process(lsass)\% Processor Time” will be artificially low if “Logical Disk(<NTDS Database Drive>)\Avg Disk sec/Read” exceeds 10 to 15 ms,
which is a general threshold that Microsoft support uses for troubleshooting storage-related performance issues. As before, it is recommended that sample intervals
be either 15, 30, or 60 minutes. Anything less will generally be too volatile for good measurements; anything greater will smooth out daily peeks excessively.

↑ Back to top

Introduction
In order to plan capacity planning for domain controllers, processing power requires the most attention and understanding. When sizing systems to ensure maximum
performance, there is always a component that is the bottleneck and in a properly sized Domain Controller this will be the processor.
Similar to the networking section where the demand of the environment is reviewed on a site-by-site basis, the same must be done for the compute capacity
demanded. Unlike the networking section, where the available networking technologies far exceed the normal demand, pay more attention to sizing CPU capacity. As
any environment of even moderate size; anything over a few thousand concurrent users can put significant load on the CPU.
Unfortunately, due to the huge variability of client applications that leverage AD, a general estimate of users per CPU is woefully inapplicable to all environments.
Specifically, the compute demands are subject to user behavior and application profile. Therefore, each environment needs to be individually sized.

↑ Back to top

Target Site Behavior Profile

As mentioned previously, when planning capacity for an entire site, the goal is to target a design with an N+1 capacity design, such that failure of one system during
the peak period will allow for continuation of service at a reasonable level of quality. That means that in an “N” scenario, load across all the boxes should be less than
100% (better yet, less than 80%) during the peak periods.

Additionally, if the applications and clients in the site are using best practices for locating domain controllers (that is, using the DsGetDcName function ), the clients
should be relatively evenly distributed with minor transient spikes due to any number of factors.

In the next example, the following assumptions are made:

Each of the five DCs in the site has the same quantity (4) of CPUs.
Total target CPU usage during business hours is 40% under normal operating conditions (“N+1”) and 60% otherwise (“N”). During non-business hours, the
target CPU usage is 80% because backup software and other maintenance are expected to consume all available resources.

13 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

Analyzing the data in the chart (Processor Information (_Total)\% Processor Utility) for each of the DCs:

For the most part, the load is relatively evenly distributed which is what would be expected when clients use DC locator and have well written searches.
There are a number of 5-minute spikes of 10%, with some as large as 20%. Generally, unless they cause the capacity plan target to be exceeded, investigating
these is not worthwhile.
The peak period for all systems is between about 8:00 AM and 9:15 AM. With the smooth transition from about 5:00 AM through about 5:00 PM, this is
generally indicative of the business cycle. The more randomized spikes of CPU usage on a box-by-box scenario between 5:00 PM and 4:00 AM would be outside
of the capacity planning concerns.
Note
On a well-managed system, said spikes are might be backup software running, full system antivirus scans, hardware or software inventory, software or patch
deployment, and so on. Because they fall outside the peak user business cycle, the targets are not exceeded.

As each system is about 40% and all systems have the same numbers of CPUs, should one fail or be taken offline, the remaining systems would run at an
estimated 53% (System D’s 40% load is evenly split and added to System A and System C’s existing 40% load). For a number of reasons, this linear assumption is
NOT perfectly accurate, but provides enough accuracy to gauge.

Alternate scenario – Two domain controllers running at 40%: One domain controller fails, estimated CPU on the remaining one would be an estimated 80%. This
far exceeds the thresholds outlined above for capacity plan and also starts to severely limit the amount of head room for the 10% to 20% seen in the load
profile above, which means that the spikes would drive the DC to 90% to 100% during the “N” scenario and definitely degrade responsiveness.

↑ Back to top

Calculating CPU demands

The “Process\% Processor Time” performance object counter sums the total amount of time that all of the threads of an application spend on the CPU and divides by
the total amount of system time that has passed. The effect of this is that a multi-threaded application on a multi-CPU system can exceed 100% CPU time, and would
be interpreted VERY differently than “Processor Information\% Processor Utility”. In practice the “Process(lsass)\% Processor Time” can be viewed as the count of CPUs
running at 100% that are necessary to support the process’s demands. A value of 200% means that 2 CPUs, each at 100%, are needed to support the full AD DS load.
Although a CPU running at 100% capacity is the most cost efficient from the perspective of money spent on CPUs and power and energy consumption, for a number
of reasons detailed in Appendix A, better responsiveness on a multi-threaded system occurs when the system is not running at 100%.

To accommodate transient spikes in client load, it is recommended to target a peak period CPU of between 40% and 60% of system capacity. Working with the
example above, that would mean that between 3.33 (60% target) and 5 (40% target) CPUs would be needed for the AD DS (lsass process) load. Additional capacity
should be added in according to the demands of the base operating system and other agents required (such as antivirus, backup, monitoring, and so on). Although
the impact of agents needs to be evaluated on a per environment basis, an estimate of between 5% and 10% of a single CPU can be made. In the current example, this
would suggest that between 3.43 (60% target) and 5.1 (40% target) CPUs are necessary during peak periods.

14 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

The easiest way to do this is to use the “Stacked Area” view in Windows Reliability and Performance Monitor (perfmon), making sure all of the counters are scaled the
same.

Assumptions:

Goal is to reduce footprint to as few servers as possible. Ideally, 1 server would carry the load and an additional server added for redundancy (N+1 scenario).

Knowledge gained from the data in the chart (Process(lsass)\% Processor Time):

The business day starts ramping up around 7:00 and decreases at 5:00 PM.
The peak busiest period is from 9:30 AM to 11:00 AM. (Note: all performance data is historical. The peak data point at 9:15 indicates the load from 9:00 to 9:15).
There are spikes before 7:00 AM which could indicate either load from different time zones or background infrastructure activity, such as backups. Because the
peak at 9:30 AM exceeds this activity, it is not relevant.
There are 3 domain controllers in the site.

At max load, lsass consumes about 485% of 1 CPU, or 4.85 CPUs running at 100%. As per the math earlier, this means the site needs about 12.25 CPUs for AD DS. Add
in the above suggestions of 5% to 10% for background processes and that means replacing the server today would need approximately 12.30 to 12.35 CPUs to
support the same load. An environmental estimate for growth now needs to be factored in.

↑ Back to top

When to Tune LDAP Weights

There are several scenarios where tuning LdapSrvWeight should be considered. Within the context of capacity planning, this would be done when the application or
user loads are not evenly balanced, or the underlying systems are not evenly balanced in terms of capability. Reasons to do so beyond capacity planning are outside of
the scope of this article.

There are two common reasons to tune LDAP Weights:

The PDC emulator is an example that affects every environment for which user or application load behavior is not evenly distributed. As certain tools and
actions target the PDC emulator, such as the Group Policy management tools, second attempts in the case of authentication failures, trust establishment, and so
on, CPU resources on the PDC emulator may be more heavily demanded than elsewhere in the site.
It is only useful to tune this if there is a noticable difference in CPU utilization in order to reduce the load on the PDC emulator and increase the load on
other domain controllers will allow a more even distribution of load.
Inthis case, set LDAPSrvWeight between 50 to 75 for the PDC emulator.
Servers with differing counts of CPUs (and speeds) in a site. For example, say there are two 8 core servers and one 4 core servers. The last server has half the
processors of the other two servers. This means that a well distributed client load will increase the average CPU load on the 4 core box to roughly 2x of the 8
core boxes.
For example the two 8 core boxes would be running at 40% and the 4 core box would be running at 80%.
Also, consider the impact of loss of one 8 core box in this scenario, specifically the fact that the 4 core box would now be overloaded.

15 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

Example 1 - PDC

Es mated New
U liza on with Defaults New LdapSrvWeight
U liza on

DC1 (PDC emulator) 53% 57 40%

DC2 33% 100 40%

DC3 33% 100 40%

The catch here is that if the PDC emulator role is transferred or seized, particularly to another domain controller in the site, there will be a dramatic increase on the new
PDC emulator.

Using the example from the section Target Site Behavior Profile , an assumption was made that all 3 domain controllers in the site had 4 CPUs. What should happen,
under normal conditions, if one of the domain controllers had 8 CPUs? There would be two domain controllers at 40% utilization and one at 20% utilization. While this
is not bad, there is an opportunity to balance the load a little bit better. Leverage LDAP weights to accomplish this. An example scenario would be:

Example 2 - Differing CPU counts:

Processor Informa on(_Total)\% Processor U lity

New LdapSrvWeight Es mated New U liza on
U liza on with Defaults

4 CPU DC1 40 100 30%

4 CPU DC2 40 100 30%

8 CPU DC3 20 200 30%

Be very careful with these scenarios though. As can be seen above, the math looks really nice and pretty on paper. But throughout this article, planning for an “N+1”
scenario is of paramount importance. The impact of one DC going offline must be calculated for every scenario. In the immediately preceding scenario where the load
distribution is even, in order to ensure a 60% load during an “N” scenario, with the load balanced evenly across all servers, the distribution will be fine as the ratios stay
consistent. Looking at the PDC emulator tuning scenario, and in general any scenario where user or application load is unbalanced, the effect is very different:

Es mated New
Tuned U liza on New LdapSrvWeight
U liza on

DC1 (PDC emulator) 40% 85 47%

DC2 40% 100 53%

DC3 40% 100 53%

↑ Back to top

Virtualization Considerations for Processing

There are two layers of capacity planning that need to be done in a virtualized environment. At the host level, similar to the identification of the business cycle outlined
for the domain controller processing previously, thresholds during the peak period need to be identified. Because the underlying principals are the same for a host
machine scheduling guest threads on the CPU as for getting AD DS threads on the CPU on a physical machine, the same goal of 40% to 60% on the underlying host
are recommended. At the next layer, the guest layer, since the principals of thread scheduling have not changed, the goal within the guest remains in the 40% to 60%
range.

In a direct mapped scenario, one guest per host, all the capacity planning done to this point needs to be added to the requirements (RAM, disk, network) of the
underlying host operating system. In a shared host scenario, testing indicates that there is 10% impact on the efficiency of the underlying processors. That means if a
site needs 10 CPUs at a target of 40%, the recommended amount of virtual CPUs to allocate across all the “N” guests would be 11. In a site with a mixed distribution of
physical servers and virtual servers, the modifier only applies to the VMs. For example, if a site has an “N+1” scenario, 1 physical or direct-mapped server with 10 CPUs

16 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

would be about equivalent to 1 guest with 11 CPUs on a host, with 11 CPUs reserved for the domain controller.

Throughout the analysis and calculation of the CPU quantities necessary to support AD DS load, the numbers of CPUs that map to what can be purchased in terms
physical hardware do not necessarily map cleanly. Virtualization eliminates the need to round up. Virtualization decreases the effort necessary to add compute capacity
to a site, given the ease with which a CPU can be added to a VM. It does not eliminate the need to accurately evaluate the compute power needed so that the
underlying hardware is available when additional CPUs need to be added to the guests. As always, remember to plan and monitor for growth in demand.

↑ Back to top

Calculation Summary Example

System Peak CPU

Domain controller 1 120%

Domain controller 2 147%

Domain controller 3 218%

Total CPUs being used 485%

Target System(s) Count Total Bandwidth (From Above)

CPUs needed at 40% target 4.85/.4 = 12.25

Repeating doue to the importance of this point, remember to plan for growth. Assuming 50% growth over the next 3 years, this environment will need 18.375 CPUs
(12.25 * 1.5) at the three year mark. An alternate plan would be to review after the first year and add in additional capacity as needed.

↑ Back to top

Cross-Trust Client Authentication Load for NTLM

Evaluating Cross-Trust Client Authentication Load
Many environments may have one or more domains connected by a trust. An authentication request for an identity in another domain that does not use Kerberos
authentication needs to traverse a trust using the domain controller’s secure channel to another domain controller either in the destination domain or the next domain
in the path to the destination domain. The number of concurrent calls using the secure channel that a domain controller can make to a domain controller in a trusted
domain is controlled by a setting known as MaxConcurrentAPI. For domain controllers, ensuring that the secure channel can handle the amount of load is
accomplished by one of two approaches: tuning MaxConcurrentAPI or, within a forest, creating shortcut trusts. To gauge the volume of traffic across an individual
trust, refer to How to do performance tuning for NTLM authentication by using the MaxConcurrentApi setting .
During data collection, this, as with all the other scenarios, must be collected during the peak busy periods of the day for the data to be useful.

Note
Intraforest and interforest scenarios may cause the authentication to traverse multiple trusts and each stage would need to be tuned.
Planning
There are a number of applications that use NTLM by default, or use it in a certain configuration scenario. Application servers grow in capacity and service an
increasing number of active clients. There is also a trend that clients keep sessions open for a limited time and rather reconnect on a regular basis (such as email pull
sync). Another common example for high NTLM load is web proxy servers that require authentication for Internet access.
These applications can cause a significant load for NTLM authentication, which can put significant stress on the DCs, especially when users and resources are in
different domains.

There are multiple approaches to managing cross-trust load, which in practice are used in conjunction rather than in an exclusive either/or scenario. The possible
options are:

Reduce cross-trust client authentication by locating the services that a user consumes in the same domain that the user is resident in.
Increase the number of secure-channels available. This is relevant to intraforest and cross-forest traffic and are known as shortcut trusts.
Tune the default settings for MaxConcurrentAPI.

For tuning MaxConcurrentAPI on an existing server, the equation is:

17 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

New_MaxConcurrentApi_setting >= (semaphore_acquires + semaphore_time-outs) * average_semaphore_hold_time / time_collection_length

For more information, see KB article 2688798: How to do performance tuning for NTLM authentication by using the MaxConcurrentApi setting .

↑ Back to top

Virtualization Considerations
None, this is an operating system tuning setting.

Calculation Summary Example

Data Type Value

Semaphore Acquires (Minimum) 6,161

Semaphore Acquires (Maximum) 6,762

Semaphore Timeouts 0

Average Semaphore Hold Time 0.012

Collection Duration (seconds) 1:11 minutes (71 seconds)

Formula (from KB 2688798) ((6762-6161) + 0) * 0.012 /

Minimum value for MaxConcurrentAPI ((6762-6161) + 0) * 0.012 / 71 = .101

For this system for this time period, the default values are acceptable.

↑ Back to top

Monitoring For Compliance With Capacity Planning Goals

Throughout this article, it has been discussed that planning and scaling go towards utilization targets. Here is a summary chart of the recommended thresholds that
must be monitored to ensure the systems are operating within adequate capacity thresholds. Keep in mind that these are not performance thresholds, but capacity
planning thresholds. A server operating in excess of these thresholds will work, but is time to start validating that all the applications are well behaved. If said
applications are well behaved, it is time to start evaluating hardware upgrades or other configuration changes.

Category Performance Counter Interval/Sampling Target Warning

Processor /p> Processor Information (_Total)\% Processor Utility 60 min 40% 60%

RAM (Windows Server 2008 R2 or earlier) Memory\Available MB < 100 MB N/A < 100 MB

RAM (Windows Server 2012) Memory\Long-Term Average Standby Cache Lifetime(s) 30 min Must be tested Must be tested

Network Network Interface(*)\Bytes Sent/sec 30 min 40% 60%

Network Interface(*)\Bytes Received/sec

18 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

↑ Back to top

Appendix A: CPU Sizing Criteria

Important Concepts

Definitions
Processor (microprocessor) – a component that reads and executes program instructions

CPU – Central Processing Unit

Multi-Core processor – multiple CPUs on the same integrated circuit

Multi-CPU – multiple CPUs, not on the same integrated circuit

Logical Processor – one logical computing engine from the perspective of the operating system. This includes hyper-threaded, one core on multi-core processor, or a
single core processor.

As today’s server systems have multiple processors, multiple multi-core processors, and hyper-threading, this information is generalized to cover both scenarios. As
such, the term logical processor will be used as it represents the operating system and application perspective of the available computing engines.

↑ Back to top

Thread-Level Parallelism
Each thread is an independent task, as each thread has its own stack and instructions. Because AD DS is multi-threaded and the number of available threads can be
tuned by using How to view and set LDAP policy in Active Directory by using Ntdsutil.exe , it scales well across multiple logical processors.

↑ Back to top

Data-Level Parallelism
This involves sharing data across multiple threads within one process (in the case of the AD DS process alone) and across multiple threads in multiple processes (in
general). With concern to over-simplifying the case, this means that any changes to data are reflected to all running threads in all the various levels of cache (L1, L2, L3)
across all cores running said threads as well as updating shared memory. Performance can degrade during write operations while all the various memory locations are
brought consistent before instruction processing can continue.

↑ Back to top

CPU Speed vs. Multiple-Core Considerations

The general rule of thumb is faster logical processors reduce the duration it takes to process a series of instructions, while more logical processors means that more
tasks can be run at the same time. These rules of thumb break down as the scenarios become inherently more complex with considerations of fetching data from
shared-memory, waiting on data-level parallelism, and the overhead of managing multiple threads. This is also why scalability in multi-core systems is not linear.

Consider the following analogies in these considerations: think of a highway, with each thread being an individual car, each lane being a core, and the speed limit
being the clock speed.

1. If there is only one car on the highway, it doesn’t matter if there are two lanes or 12 lanes. That car is only going to go as fast as the speed limit will allow.
2. Assume that the data the thread needs is not immediately available. The analogy would be that a segment of road is shutdown. If there is only one car on the
highway, it doesn’t matter what the speed limit is until the lane is reopened (data is fetched from memory).
3. As the number of cars increase, the overhead to manage the number of cars increases. Compare the experience of driving and the amount of attention
necessary when the road is practically empty (such as late evening) versus when the traffic is heavy (such as mid-afternoon, but not rush hour). Also, consider
the amount of attention necessary when driving on a two-lane highway, where there is only one other lane to worry about what the drivers are doing, versus a
6-lane highway where one has to worry about what a lot of other drivers are doing.
Note
The analogy about the rush hour scenario is extended in the next section: Response Time/How the System Busyness Impacts Performance .

19 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

As a result, specifics about more or faster processors become highly subjective to application behavior, which in the case of AD DS is very environmentally specific and
even varies from server to server within an environment. This is why the references earlier in the article do not invest heavily in being overly precise, and a margin of
safety is included in the calculations. When making budget-driven purchasing decisions, it is recommended that optimizing usage of the processors at 40% (or the
desired number for the environment) occurs first, before considering buying faster processors. The increased synchronization across more processors reduces the true
benefit of more processors from the linear progression (2x the number of processors provides less than 2x available additional compute power).

Note
Amdahl’s Law and Gustafson’s Law are the relevant concepts here.

↑ Back to top

Response Time/How the System Busyness Impacts Performance

Queuing theory is the mathematical study of waiting lines (queues). ). In queuing theory, the Utilization Law is represented by the equation:

Uk=B/T

Where U k is the utilization percentage, B is the amount of time busy, and T is the total time the system was observed. Translated into the context of Windows, this
means the number of 100 nanosecond (ns) interval threads that are in a Running state divided by how many 100 ns intervals were available in given time interval. This
is exactly the formula for calculating % Processor Utility(reference Processor Object and PERF_100NSEC_TIMER_INV ).

Queuing theory also provides the formula: N = U k/1- U k to estimate the number of waiting items based on utilization ( N is the length of the queue). Charting this
over all utilization intervals provides the following estimates to how long the queue to get on the processor is at any given CPU load.

It is observed that after 50% CPU load, on average there is always a wait of 1 other item in the queue, with a noticeably rapid increase after about 70% CPU utilization.

Returning to the driving analogy used earlier in this section:

The busy times of “mid-afternoon” would, hypothetically, fall somewhere into the 40% to 70% range. There is enough traffic such that one’s ability to pick any
lane is not majorly restricted, and the chance of another driver being in the way, while high, does not require the level of effort to “find” a safe gap between
other cars on the road.
One will notice that as traffic approaches rush hour, the road system approaches 100% capacity. Changing lanes can become very challenging because cars are
so close together that increased caution must be exercised to do so.

This is why the long term averages for capacity conservatively estimated at 40% allows for head room for abnormal spikes in load, whether said spikes transitory (such
as poorly coded queries that run for a few minutes) or abnormal bursts in general load (the morning of the first day after a long weekend).

The above statement regards % Processor Time calculation being the same as the Utilization Law is a bit of a simplification for the ease of the general reader. For
those more mathematically rigorous:

Translating the PERF_100NSEC_TIMER_INV

B = The number of 100 ns intervals “Idle” thread spends on the logical processor. The change in the “X” variable in the PERF_100NSEC_TIMER_INV
calculation
T = the total number of 100 ns intervals in a given time range. The change in the “Y” variable in the PERF_100NSEC_TIMER_INV calculation.
U k = The utilization percentage of the logical processor by the “Idle Thread” or % Idle Time.
Working out the math:
U k = 1 - %Processor Time
%Processor Time = 1 - U k
%Processor Time = 1 - B / T
%Processor Time = 1 – X1 – X0 / Y1-Y0

20 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

↑ Back to top

Applying the concepts to capacity planning

The preceding math may make determinations about the number of logical processors needed in a system seem overwhelmingly complex. This is why the approach to
sizing the systems is focused on determining maximum target utilization based on current load and calculating the number of logical processors required to get there.
Additionally, while logical processor speeds will have a significant impact on performance, cache efficiencies, memory coherence requirements, thread scheduling and
synchronization, and imperfectly balanced client load will all have significant impacts on performance that will vary on a server-by-server basis. With the relatively
cheap cost of compute power, attempting to analyze and determine the perfect number of CPUs needed becomes more an academic exercise than it does provide
business value.

40% is not a hard and fast requirement, it is a reasonable start. Various consumers of Active Directory require various levels of responsiveness. There may be scenarios
where environments can run at 80% or 90% utilization as a sustained average, as the increased wait times for access to the processor will not noticeably impact client
performance. It is important to re-iterate that there are many areas in the system that are much slower than the logical processor in the system, including access to
RAM, access to disk, and transmitting the response over the network. All of these items need to be tuned in conjunction. Examples:

Adding more processors to a system running 90% that is disk-bound is probably not going to significantly improve performance. Deeper analysis of the system
will probably identify that there are a lot of threads that are not even getting on the processor because they are waiting on IO to complete.
Resolving the disk-bound issues potentially means that threads that were previously spending a lot of time in a waiting state will no longer be in a waiting state
for IO and there will be more competition for CPU time, meaning that the 90% utilization in the previous example will go to 100% (because it can not go
higher). Both components need to be tuned in conjunction.
Note
Processor Information(*)\% Processor Utility can exceed 100% with systems that have a "Turbo" mode. This is where the CPU exceeds the rated processor
speed for short periods. Reference CPU manufacturers documentation and description of the counter for greater insight.

Discussing whole system utilization considerations also brings into the conversation domain controllers as virtualized guests. Response Time/How the System Busyness
Impacts Performance applies to both the host and the guest in a virtualized scenario. This is why in a host with only one guest, a domain controller (and generally any
system) has near the same performance it does on physical hardware. Adding additional guests to the hosts increases the utilization of the underlying host, thereby
increasing the wait times to get access to the processors as explained previously. In short, logical processor utilization needs to be managed at both the host and at
the guest levels.

Extending the previous analogies, leaving the highway as the physical hardware, the guest VM will be analogized with a bus (an express bus that goes straight to the
destination the rider wants). Imagine the following four scenarios:

It is off hours, a rider gets on a bus that is nearly empty, and the bus gets on a road that is also nearly empty. As there is no traffic to contend with, the rider has
a nice easy ride and gets there just as fast as if the rider had driven instead. The rider’s travel times are still constrained by the speed limit.
It is off hours so the bus is nearly empty but most of the lanes on the road are closed, so the highway is still congested. The rider is on an almost-empty bus on
a congested road. While the rider does not have a lot of competition in the bus for where to sit, the total trip time is still dictated by the rest of the traffic
outside.
It is rush hour so the highway and the bus are congested. Not only does the trip take longer, but getting on and off the bus is a nightmare because people are
shoulder to shoulder and the highway is not much better. Adding more busses (logical processors to the guest) does not mean they can fit on the road any
more easily, or that the trip will be shortened.
The final scenario, though it may be stretching the analogy a little, is where the bus is full, but the road is not congested. While the rider will still have trouble
getting on and off the bus, the trip will be efficient after the bus is on the road. This is the only scenario where adding more busses (logical processors to the
guest) will improve guest performance.

From there it is relatively easy to extrapolate that there are a number of scenarios in between the 0% utilized and the 100% utilized state of the road and the 0% and
100% utilized state of the bus that have varying degrees of impact.

Applying the principals above of 40% CPU as reasonable target for the host as well as the guest is a reasonable start for the same reasoning as above, the amount of
queuing.

↑ Back to top

Appendix B: Considerations Regarding Different Processor

Speeds, and the Effect of Processor Power Management
on Processor Speeds
Throughout the sections on processor selection the assumption is made that the processor is running at 100% of clock speed the entire time the data is being

21 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

collected and that the replacement systems will have the same speed processors. Despite both assumptions in practice being false, particularly with Windows Server
2008 R2 and later, where the default power plan is Balanced , the methodology still stands as it is the conservative approach. While the potential error rate may
increase, it only increases the margin of safety as processor speeds increase.

For example, in a scenario where 11.25 CPUs are demanded, if the processors were running at half speed when the data was collected, the more accurate
estimate might be 5.125/2.
It is impossible to guarantee that doubling the clock speeds would double the amount of processing that happens for a given time period. This is due to the
fact the amount of time that the processors spend waiting on RAM or other system components could stay the same. The net effect is that the faster processors
might spend a greater percentage of the time idle while waiting on data to be fetched. Again, it is recommended to stick with the lowest common denominator,
being conservative, and avoid trying to calculate a potentially false level of accuracy by assuming a linear comparison between processor speeds.

Alternatively, if processor speeds in replacement hardware are lower than current hardware, it would be safe to increase the estimate of processors needed by a
proportionate amount. For example, it is calculated that 10 processors are needed to sustain the load in a site, and the current processors are running at 3.3 Ghz and
replacement processors will run at 2.6 Ghz, this is a 21% decrease in speed. In this case, 12 processors would be the recommended amount.

That said, this variability would not change the Capacity Management processor utilization targets. As processor clock speeds will be adjusted dynamically based on
the load demanded, running the system under higher loads will generate a scenario where the CPU spends more time in a higher clock speed state, making the
ultimate goal to be at 40% utilization in a 100% clock speed state at peak. Anything less than that will generate power savings as CPU speeds will be throttled back
during off peak scenarios.

Note
An option would be to turn off power management on the processors (setting the power plan to High Performance ) while data is collected. That would give a more
accurate representation of the CPU consumption on the target server.

To adjust estimates for different processors, it used to be safe, excluding other system bottlenecks outlined above, to assume that doubling processor speeds doubled
the amount of processing that could be performed. Today, the internal architecture of processors is different enough between processors, that a safer way to gauge
the effects of using different processors than data was taken from is to leverage the SPECint_rate2006 benchmark from Standard Performance Evaluation Corporation.

1. Find the SPECint_rate2006 scores for the processor that are in use and that plan to be used.
a. On the website of the Standard Performance Evaluation Corporation, select Results, highlight CPU2006, and select Search all SPECint_rate2006 results.
b. Under Simple Request, enter the search criteria for the target processor, for example Processor Matches E5-2630 (baselinetarget) and Processor
Matches E5-2650 (baseline).
c. Find the server and processor configuration to be used (or something close, if an exact match is not available) and note the value in the Result and #
Cores columns.
2. To determine the modifier use the following equation:
((Target platform per-core score value)×(MHz per-core of baseline platform))/((Baseline per-core score value)×(MHz per-core of target platform))
Using the above example:
(35.83×2000)/(33.75×2300)=0.92
3. Multiply the estimated number of processors by the modifier. In the above case to go from the E5-2650 processor to the E5-2630 multiply the calculated 11.25
CPUs * 0.92 = 10.35 processors needed.

↑ Back to top

Appendix C: Fundamentals Regarding the Operating

System Interacting with Storage
The queuing theory concepts outlined in Response Time/How the System Busyness Impacts Performance are also applicable to storage. Having a familiarity of how
the operating system handles I/O is necessary to apply these concepts. In the Microsoft Windows operating system, a queue to hold the I/O requests is created for
each physical disk. However, a clarification on physical disk needs to be made. Array controllers and SANs present aggregations of spindles to the operating system as
single physical disks. Additionally, array controllers and SANs can aggregate multiple disks into one array set and then split this array set into multiple “partitions”,
which is in turn presented to the operating system as multiple physical disks (ref. figure).

In this figure the two spindles are mirrored and split into logical areas for data storage (Data 1 and Data 2).
These logical areas are viewed by the operating system as separate physical disks.

Although this can be highly confusing, the following terminology is used throughout this appendix to identify the different entities:

22 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

Spindle – the device that is physically installed in the server.

Array – a collection of spindles aggregated by controller.
Array partition – a partitioning of the aggregated array
LUN – an array, used when referring to SANs
Disk – What the operating system observes to be a single physical disk.
Partition – a logical partitioning of what the operating system perceives as a physical disk.

↑ Back to top

Operating System Architecture Considerations

The operating system creates a First In/First Out (FIFO) I/O queue for each disk that is observed; this disk may be representing a spindle, an array, or an array partition.
From the operating system perspective, with regard to handling I/O, the more active queues the better. As a FIFO queue is serialized, meaning that all I/Os issued to
the storage subsystem must be processed in the order the request arrived. By correlating each disk observed by the operating system with a spindle/array, the
operating system now maintains an I/O queue for each unique set of disks, thereby eliminating contention for scarce I/O resources across disks and isolating I/O
demand to a single disk. As an exception, Windows Server 2008 introduces the concept of IO prioritization, and applications designed to use the “Low” priority fall out
of this normal order and take a back seat. Applications not specifically coded to leverage the “Low” priority default to “Normal.”

↑ Back to top

Introducing simple storage subsystems

Starting with a simple example (a single hard drive inside a computer) a component-by-component analysis will be given. Breaking this down into the major storage
subsystem components, the system consists of:

1. 1 – 10,000 RPM Ultra Fast SCSI HD (Ultra Fast SCSI has a 20 MB/s transfer rate)
2. 1 – SCSI Bus (the cable)
3. 1 – Ultra Fast SCSI Adapter
4. 1 – 32-bit 33 MHz PCI bus

Once the components are identified, an idea of how much data can transit the system, or how much I/O can be handled, can be calculated. Note that the amount of
I/O and quantity of data that can transit the system is correlated, but not the same. This correlation depends on whether the disk I/O is random or sequential and the
block size. (All data is written to the disk as a block, but different applications using different block sizes.) On a component-by-component basis:

The hard drive – The average 10,000-RPM hard drive has a 7-millisecond (ms) seek time and a 3 ms access time. Seek time is the average amount of time it
takes the read/write head to move to a location on the platter. Access time is the average amount of time it takes to read or write the data to disk, once the
head is in the correct location. Thus, the average time for reading a unique block of data in a 10,000-RPM HD constitutes a seek and an access, for a total of
approximately 10 ms (or .010 seconds) per block of data.

When every disk access requires movement of the head to a new location on the disk, the read/write behavior is referred to as “random.” Thus, when all I/O is
random, a 10,000-RPM HD can handle approximately 100 I/O per second (IOPS) (the formula is 1000 ms per second divided by 10 ms per I/O or 1000/10=100
IOPS).

Alternatively, when all I/O occurs from adjacent sectors on the HD, this is referred to as sequential I/O. Sequential I/O has no seek time because when the first
I/O is complete, the read/write head is at the start of where the next block of data is stored on the HD. Thus a 10,000-RPM HD is capable of handling
approximately 333 I/O per second (1000 ms per second divided by 3 ms per I/O).

Note
This example does not reflect the disk cache, where the data of one cylinder is typically kept. In this case, the 10 ms are needed on the first IO and the disk
reads the whole cylinder. All other sequential IO is satisfied from the cache. As a result, in-disk caches might improve sequential IO performance.

So far, the transfer rate of the hard drive has been irrelevant. Whether the hard drive is 20 MB/s Ultra Wide or an Ultra3 160 MB/s, the actual amount of IOPS
the can be handled by the 10,000-RPM HD is ~100 random or ~300 sequential I/O. As block sizes change based on the application writing to the drive, the
amount of data that is pulled per I/O is different. For example, if the block size is 8 KB, 100 I/O operations will read from or write to the hard drive a total of 800
KB. However, if the block size is 32 KB, 100 I/O will read/write 3,200 KB (3.2 MB) to the hard drive. As long as the SCSI transfer rate is in excess of the total
amount of data transferred, getting a “faster” transfer rate drive will gain nothing. See the following tables for comparison.

7200 RPM 10,000 RPM 15,000 RPM

9ms seek, 4ms access 7ms seek, 3ms access 4ms seek, 2ms access

Random I/O 80 100 150

Sequential I/O 250 300 500

23 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

10,000 RPM drive 8KB Block size (Ac ve Directory Jet)

Random I/O 800 KB/s

Sequential I/O 2400 KB/s

Understanding how the “SCSI Backplane (bus)”, or in this scenario the ribbon cable, impacts throughput of the storage subsystem depends on knowledge of the
block size. Essentially the question would be, how much I/O can the bus handle if the I/O is in 8 KB blocks? In this scenario, the SCSI bus is 20 MB/s, or 20480
KB/s. 20480 KB/s divided by 8 KB blocks yields a maximum of approximately 2500 IOPS supported by the SCSI bus.
Note
The figures in the following table represent an example. Most attached storage devices currently use PCI Express, which provides much higher throughput.

8KB Block size

2KB Block size
I/O supported by SCSI bus per block size (AD Jet)

(SQL 7.0/2000)

20 MB/s 10,000 2,500

40 MB/s 20,000 5,000

128 MB/s 65,536 16,384

320 MB/s 160,000 40,000

As can be determined from this chart, in the scenario presented, no matter what the use, the bus will never be a bottleneck, as the spindle maximum is 100 I/O,
well below any of the above thresholds.
Note
This assumes that the SCSI bus is 100% efficient.

SCSI Adapter – For determining the amount of I/O that this can handle, the manufacturer’s specifications need to be checked. Directing I/O requests to the
appropriate device requires processing of some sort, thus the amount of I/O that can be handled is dependent on the SCSI adapter (or array controller)
processor.

In this example, the assumption that 1,000 I/O can be handled will be made.

PCI bus – This is an often overlooked component. In this example, this will not be the bottleneck; however as systems scale up, it can become a bottleneck. For
reference, a 32 bit PCI bus operating at 33Mhz can in theory transfer 133 MB/s of data. Following is the equation: 32 bits/8 bits per byte * 33 MHz = 133 MB/s.
Note that is the theoretical limit; in reality only about 50% of the maximum is actually reached, although in certain burst scenarios, 75% efficiency can be
obtained for short periods.
A 66Mhz 64 bit PCI bus can support a theoretical maximum of (64 bits / 8 bits per byte * 66 Mhz =) 528 MB/sec. Additionally, any other device (such as the
network adapter, second SCSI controller, and so on) will reduce the bandwidth available as the bandwidth is shared and the devices will contend for the limited
resources.

After analysis of the components of this storage subsystem, the spindle is the limiting factor in the amount of I/O that can be requested, and consequently the amount
of data that can transit the system. Specifically, in an AD DS scenario, this is 100 random I/O per second in 8KB increments, for a total of 800 KB per second when
accessing the Jet database. Alternatively, the maximum throughput for a spindle that is exclusively allocated to log files would suffer the following limitations: 300
sequential I/O per second in 8KB increments, for a total of 2400KB (2.4MB) per second.

Now, having analyzed a simple configuration, the following table demonstrates where the bottleneck will occur as components in the storage subsystem are changed
or added.

BOTTLENECK
Notes Disk Bus Adapter PCI bus
ANALYSIS

This is the domain controller Add 1 disk (Total=2) 200 I/Os

configuration after adding a second total. 800
disk. The disk configuration represents I/O is random KB/s total.
the bottleneck at 800 KB/s.
4 KB block size

24 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

10,000 RPM HD

After adding 7 disks, the disk Add 7 disks 800 I/Os

configuration still represents the (Total=8) total. 3200
bottleneck at 3200 KB/s. KB/s total
I/O is random

4 KB block size

10,000 RPM HD

After changing I/O to sequential, the Add 7 disks (Total=8) 2400 I/O sec can be
network adapter becomes the read/written to disk,
bottleneck because it is limited to 1000 I/O is sequential controller limited to 1000
IOPS. IOPS
4 KB block size

10,000 RPM HD

After replacing the network adapter Add 7 disks (Total=8) 800 I/Os
with a SCSI adapter that supports total. 3,200
10,000 IOPS, the bottleneck returns to I/O Random KB/s total
the disk configuration.
4 KB block size

10,000 RPM HD

Upgrade SCSI
Adapter (now
supports 10,000
I/O)

After increasing the block size to 32KB, Add 7 disks (Total=8) 800 I/Os total. 25,600
the bus becomes the bottleneck KB/s (25 MB/s) can be
because it only supports 20 MB/s. I/O Random read/written to disk.
32KB block size The bus only supports
20 MB/s
10,000 RPM HD

After upgrading the bus and adding Add 13 disks 2800 I/Os
more disks, the disk remains the (Total=14)
bottleneck. 11,200 KB/s
Add 2nd SCSI Adapter (10.9 Mb/s)
w/14 disks

I/O Random

4 KB block size

10,000 RPM HD

Upgrade to 320
MB/s SCSI bus

After changing I/O to sequential, the Add 13 disks 8,400 I/Os

disk remains the bottleneck. (Total=14)
33,600 KB\s
Add 2nd SCSI Adapter
w/14 disks (32.8 MB\s)

I/O Sequential

4 KB block size

10,000 RPM HD

Upgrade to 320 MB/s

SCSI bus

After adding faster hard drives, the Add 13 disks 14,000 I/Os
disk remains the bottleneck. (Total=14)
56,000 KB/s
Add 2nd SCSI Adapter
w/14 disks (54.7 MB/s)

I/O Sequential

25 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

4 KB block size

15,000 RPM HD

Upgrade to 320 MB/s

SCSI bus

After increasing the block size to 32KB, Add 13 disks 14,000 I/Os
the PCI bus becomes the bottleneck. (Total=14)
448,000 KB/s
Add 2nd SCSI Adapter
w/14 disks (437 MB/s) is the
read/write limit to the
I/O Sequential spindle.

32KB block size The PCI bus supports a

theoretical maximum of
15,000 RPM HD 133 MB/s (75% efficient
at best).
Upgrade to 320 MB/s
SCSI bus

↑ Back to top

Introducing RAID
The nature of a storage subsystem does not change dramatically when an array controller is introduced; it just replaces the SCSI adapter in the calculations. What does
change is the cost of reading and writing data to the disk when using the various array levels (such as RAID 0, RAID 1, or RAID 5).

In RAID 0, the data is striped across all the disks in the RAID set. This means that during a read or a write operation, a portion of the data is pulled from or pushed to
each disk, increasing the amount of data that can transit the system during the same time period. Thus, in one second, on each spindle (again assuming 10,000-RPM
drives), 100 I/O operations can be performed. The total amount of I/O that can be supported is N spindles times 100 I/O per second per spindle (yields 100*N I/O per
second).

In RAID 1, the data is mirrored (duplicated) across a pair of spindles for redundancy. Thus, when a read I/O operation is performed, data can be read from both of the
spindles in the set. This effectively makes the I/O capacity from both disks available during a read operation. The caveat is that write operations gain no performance
advantage in a RAID 1. This is because the same data needs to be written to both drives for the sake of redundancy. Though it does not take any longer, as the write of
data occurs concurrently on both spindles, because both spindles are occupied duplicating the data, a write I/O operation in essence prevents 2 read operations from
occurring. Thus, every write I/O costs 2 read I/O. A formula can be created from that information to determine the total number of I/O operations that are occurring:
Read I/O + 2 * Write I/O = Total available disk I/O consumed. When the ratio of reads to writes and the number of spindles are known, the following equation can be
derived from the above equation to identify the maximum I/O that can be supported by the array: Maximum IOPS per spindle * 2 spindles * [(%Reads + %Writes) /
(%Reads + 2 * %Writes)] = Total IOPS.

RAID 1+ 0, behaves exactly the same as RAID 1 regarding the expense of reading and writing. However, the I/O is now striped across each mirrored set. If Maximum
IOPS per spindle * 2 spindles * [(%Reads + %Writes) / (%Reads + 2 * %Writes)] = Total I/O in a RAID 1 set, when a multiplicity (N) of RAID 1 sets are striped, the Total
I/O that can be processed becomes N * I/O per RAID 1 set: N * {Maximum IOPS per spindle * 2 spindles * [(%Reads + %Writes) / (%Reads + 2 * %Writes)] } = Total
IOPS

In RAID 5, sometimes referred to as N+1 RAID, the data is striped across N spindles and parity information is written to the “+1” spindle. However, RAID 5 is much
more expensive when performing a write I/O than RAID 1 or 1+0. RAID 5 performs the following process every time a write I/O is submitted to the array:

1. Read the old data

2. Read the old parity
3. Write the new data
4. Write the new parity

As every write I/O request that is submitted to the array controller by the operating system requires 4 I/O operations to complete, write requests submitted take 4
times as long to complete as a single read I/O. To derive a formula to translate I/O requests from the operating system perspective to that experienced by the spindles:
Read I/O + 4 * Write I/O = Total I/O. Similarly in a RAID 1 set, when the ratio of reads to writes and the number of spindles are known, the following equation can be
derived from the above equation to identify the maximum I/O that can be supported by the array (Note that total number of spindles does not include the “drive” lost
to parity): IOPS per Spindle * (Spindles – 1) * [(%Reads + %Writes) / (%Reads + 4 * %Writes)] = Total IOPS

↑ Back to top

Introducing SANs
Expanding the complexity of the storage subsystem, when a SAN is introduced into the environment, the basic principles outlined do not change, however I/O
behavior for all of the systems connected to the SAN needs to be taken into account. As one of the major advantages in using a SAN is an additional amount of

26 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

redundancy over internally or externally attached storage, capacity planning now needs to take into account fault tolerance needs. Also, more components are
introduced that need to be evaluated. Breaking a SAN down into the component parts:

1. SCSI or Fibre Channel Hard Drive

2. Storage unit channel backplane
3. Storage units
4. Storage controller module
5. SAN switch(es)
6. HBA(s)
7. The PCI bus

When designing any system for redundancy, additional components are included to accommodate the potential of failure. It is very important, when capacity planning,
to exclude the redundant component from available resources. For example, if the SAN has two controller modules, the I/O capacity of one controller module is all
that should be used for total I/O throughput available to the system. This is due to the fact that if one controller fails, the entire I/O load demanded by all connected
systems will need to be processed by the remaining controller. As all capacity planning is done for peak usage periods, redundant components should not be factored
into the available resources and planned peak utilization should not exceed 80% saturation of the system (in order to accommodate bursts or anomalous system
behavior). Similarly, the redundant SAN switch, storage unit, and spindles should not be factored into the I/O calculations.

When analyzing the behavior of the SCSI or Fibre Channel Hard Drive, the method of analyzing the behavior as outlined previously does not change. Although there
are certain advantages and disadvantages to each protocol, the limiting factor on a per disk basis is the mechanical limitation of the hard drive.

Analyzing the channel on the storage unit is exactly the same as calculating the resources available on the SCSI bus, or bandwidth (such as 20 MB/s) divided by block
size (such as 8 KB). Where this deviates from the simple previous example is in the aggregation of multiple channels. For example, if there are 6 channels, each
supporting 20 MB/s maximum transfer rate, the total amount of I/O and data transfer that is available is 100 MB/s (this is correct, it is not 120 MB/s). Again, fault
tolerance is a major player in this calculation, in the event of the loss of an entire channel, the system is only left with 5 functioning channels. Thus, to ensure
continuing to meet performance expectations in the event of failure, total throughput for all of the storage channels should not exceed 100 MB/s (this assumes load
and fault tolerance is evenly distributed across all channels). Turning this into an I/O profile is dependent on the behavior of the application. In the case of Active
Directory Jet I/O, this would correlate to approximately 12,500 I/O per second (100 MB/s / 8 KB per I/O).

Next, obtaining the manufacturer’s specifications for the controller modules is required in order to gain an understanding of the throughput each module can support.
In this example, the SAN has two controller modules that support 7,500 I/O each. The total throughput of the system may be 15,000 IOPS if redundancy is not desired.
In calculating maximum throughput in the case of failure, the limitation is the throughput of one controller, or 7,500 IOPS. This threshold is well below the 12,500 IOPS
(assuming 4 KB block size) maximum that can be supported by all of the storage channels, and thus, is currently the bottleneck in the analysis. Still for planning
purposes, the desired maximum I/O to be planned for would be 10,400 I/O.

When the data exits the controller module, it transits a Fibre Channel connection rated at 1 Gb/s (or 1 Gigabit per second). To correlate this with the other metrics, 1
Gb/s turns into 128 MB/s (1 Gb/s divided by 8 bits/byte). As this is in excess of the total bandwidth across all channels in the storage unit (100 MB/s), this will not
bottleneck the system. Additionally, as this is only one of the two channels (the additional 1 Gb/s Fibre Channel connection being for redundancy), if one connection
fails, the remaining connection still has enough capacity to handle all the data transfer demanded.

En route to the server, the data will most likely transit a SAN switch. As the SAN switch has to process the incoming I/O request and forward it out the appropriate
port, the switch will have a limit to the amount of I/O that can be handled, however, manufacturers specifications will be required to determine what that limit is. For
example, if there are two switches and each switch can handle 10,000 IOPS, the total throughput will be 20,000 IOPS. Again, fault tolerance being a concern, if one
switch fails, the total throughput of the system will be 10,000 IOPS. As it is desired not to exceed 80% utilization in normal operation, using no more than 8000 I/O
should be the target.

Finally, the HBA installed in the server would also have a limit to the amount of I/O that it can handle. Usually, a second HBA is installed for redundancy, but just like
with the SAN switch, when calculating maximum I/O that can be handled, the total throughput of N-1 HBAs is what the maximum scalability of the system is.

↑ Back to top

Caching Considerations
Caches are one of the components that can significantly impact the overall performance at any point in the storage system. Detailed analysis about caching algorithms
is beyond the scope of this article; however, some basic statements about caching on disk subsystems are worth illuminating:

Caching does improved sustained sequential write I/O as it can buffer many smaller write operations into larger I/O blocks and de-stage to storage in fewer, but
larger block sizes. This will reduce total random I/O and total sequential I/O, thus providing more resource availability for other I/O.
Caching does not improve sustained write I/O throughput of the storage subsystem. It only allows for the writes to be buffered until the spindles are available
to commit the data. When all the available I/O of the spindles in the storage subsystem is saturated for long periods, the cache will eventually fill up. In order to
empty the cache, enough time between bursts, or extra spindles, need to be allotted in order to provide enough I/O to allow the cache to flush.

Larger caches only allow for more data to be buffered. This means longer periods of saturation can be accommodated.

In a normally operating storage subsystem, the operating system will experience improved write performance as the data only needs to be written to cache.
Once the underlying media is saturated with I/O, the cache will fill and write performance will return to disk speed.

When caching read I/O, the scenario where the cache is most advantageous is when the data is stored sequentially on the disk, and the cache can read-ahead
(it makes the assumption that the next sector contains the data that will be requested next).
When read I/O is random, caching at the drive controller is unlikely to provide any enhancement to the amount of data that can be read from the disk. Any
enhancement is non-existent if the operating system or application-based cache size is greater than the hardware-based cache size.

In the case of Active Directory, the cache is only limited by the amount of RAM.

27 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

↑ Back to top

SSD Considerations
SSDs are a completely different animal than spindle based hard disks. Yet the two key criteria remain: “How many IOPS can it handle?” and “What is the latency for
those IOPS?” In comparison to spindle based Hard Disks, SSDs can handle higher volumes of IO and can have lower latencies. In general and as of this writing, while
SSDs are still expensive in a cost-per-Gigabyte comparison, they are very cheap in terms of cost-per-IO and deserve significant consideration in terms of storage
performance.

Considerations:

Both IOPS and latencies are very subjective to the manufacturer designs and in some cases have been observed to be poorer performing than spindle based
technologies. In short, it is more important to review and validate the manufacturer specs drive by drive and not assume any generalities.
IOPS types can have very different numbers depending on whether it is read or write. AD DS services, in general, being predominantly read-based, will be less
affected than some other application scenarios.
“Write endurance” – this is the concept that SSD cells will eventually wear out. Various manufacturers deal with this challenge different fashions. At least for the
database drive, the predominantly read IO profile allows for downplaying the significance of this concern as the data is not highly volatile.

↑ Back to top

Summary
One way to think about storage is picturing household plumbing. Imagine the IOPS of the media that the data is stored on is the household main drain. When this is
clogged (such as roots in the pipe) or limited (it is collapsed or too small), all the sinks in the household back up when too much water is being used (too many
guests). This is perfectly analogous to a shared environment where one or more systems are leveraging shared storage on an SAN/NAS/iSCSI with the same underlying
media. Different approaches can be taken to resolve the different scenarios:

A collapsed or undersized drain requires a full scale replacement and fix. This would be similar to adding in new hardware or redistributing the systems using
the shared storage throughout the infrastructure.
A “clogged” pipe usually means identification of one or more offending problems and removal of those problems. In a storage scenario this could be storage or
system level backups, synchronized antivirus scans across all servers, and synchronized defragmentation software running during peak periods.

In any plumbing design, multiple drains feed into the main drain. If anything stops up one of those drains or a junction point, only the things behind that junction
point back up. In a storage scenario, this could be an overloaded switch (SAN/NAS/iSCSI scenario), driver compatibility issues (wrong driver/HBA Firmware/storport.sys
combination), or backup/antivirus/defragmentation. To determine if the storage “pipe” is big enough, IOPS and IO size needs to be measured. At each joint add them
together to ensure adequate “pipe diameter.”

↑ Back to top

Appendix D - Discussion on storage troubleshooting -

Environments where providing at least as much RAM as
the database size is not a viable option
It is helpful to understand why these recommendations exist so that the changes in storage technology can be accommodated. These recommendations exist for two
reasons. The first is isolation of IO, such that performance issues (that is, paging) on the operating system spindle do not impact performance of the database and IO
profiles. The second is that log files for AD DS (and most databases) are sequential in nature, and spindle-based hard drives and caches have a huge performance
benefit when used with sequential IO as compared to the more random IO patterns of the operating system and almost purely random IO patterns of the AD DS
database drive. By isolating the sequential IO to a separate physical drive, throughput can be increased. The challenge presented by today’s storage options is that the
fundamental assumptions behind these recommendations are no longer true. In many virtualized storage scenarios, such as iSCSI, SAN, NAS, and Virtual Disk image
files, the underlying storage media is shared across multiple hosts, thus completely negating both the “isolation of IO” and the “sequential IO optimization” aspects. In
fact these scenarios add an additional layer of complexity in that other hosts accessing the shared media can degrade responsiveness to the domain controller.

In planning storage performance, there are three categories to consider: cold cache state, warmed cache state, and backup/restore. The cold cache state occurs in
scenarios such as when the domain controller is initially rebooted or the Active Directory service is restarted and there is no Active Directory data in RAM. Warm cache
state is where the domain controller is in a steady state and the database is cached. These are important to note as they will drive very different performance profiles,
and having enough RAM to cache the entire database does not help performance when the cache is cold. One can consider performance design for these two
scenarios with the following analogy, warming the cold cache is a “sprint” and running a server with a warm cache is a “marathon.”

For both the cold cache and warm cache scenario, the question becomes how fast the storage can move the data from disk into memory. Warming the cache is a

28 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

scenario where, over time, performance improves as more queries reuse data, the cache hit rate increases, and the frequency of needing to go to disk decreases. As a
result the adverse performance impact of going to disk decreases. Any degradation in performance is only transient while waiting for the cache to warm and grow to
the maximum, system-dependent allowed size. The conversation can be simplified to how quickly the data can be gotten off of disk, and is a simple measure of the
IOPS available to Active Directory, which is subjective to IOPS available from the underlying storage. From a planning perspective, because warming the cache and
backup/restore scenarios happen on an exceptional basis, normally occur off hours, and are subjective to the load of the DC, general recommendations do not exist
except in that these activities be scheduled for non-peak hours.

AD DS, in most scenarios, is predominantly read IO, usually a ratio of 90% read/10% write. Read IO often tends to be the bottleneck for user experience, and with write
IO, causes write performance to degrade. As IO to the NTDS.DIT is predominantly random, caches tend to provide minimal benefit to read IO, making it that much
more important to configure the storage for read IO profile correctly.

For normal operating conditions, the storage planning goal is minimize the wait times for a request from AD DS to be returned from disk. This essentially means that
the number of outstanding and pending IO is less than or equivalent to the number of pathways to the disk. There are a variety of ways to measure this. In a
performance monitoring scenario, the general recommendation is that LogicalDisk(< NTDS Database Drive >)\Avg Disk sec/Read be less than 20 ms. The desired
operating threshold must be much lower, preferably as close to the speed of the storage as possible, in the 2 to 6 millisecond (.002 to .006 second) range depending
on the type of storage.

Example:

Analyzing the chart:

Green oval on the left – The latency remains consistent at 10 ms. The load increases from 800 IOPS to 2400 IOPS. This is the absolute floor to how quickly an IO
request can be processed by the underlying storage. This is subject to the specifics of the storage solution.
Burgundy oval on the right – The throughput remains flat from the exit of the green circle through to the end of the data collection while the latency continues
to increase. This is demonstrating that when the request volumes exceed the physical limitations of the underlying storage, the longer the requests spend
sitting in the queue waiting to be sent out to the storage subsystem.

Applying this knowledge:

Impact to a user querying membership of a large group - Assume this requires reading 1MB of data from the disk, the amount of IO and how long it takes can
be evaluated as follows:
Active Directory database pages are 8 KB in size.
A minimum of 128 pages need to be read in from disk.
Assuming nothing is cached, at the floor (10 ms) this is going to take a minimum 1.28 seconds to load the data from disk in order to return it to the
client. At 20 ms, where the throughput on storage has long since maxed out and is also the recommended maximum, it will take 2.5 seconds to get the

29 of 30 3/20/2017 12:05 PM
Capacity Planning for Active Directory Domain Services - TechNet Artic... https://social.technet.microsoft.com/wiki/contents/articles/14355.capacity...

data from disk in order to return it to the end user.

At what rate will the cache be warmed – Making the assumption that the client load is going to maximize the throughput on this storage example, the cache
will warm at a rate of 2400 IOPS * 8 KB per IO. Or, approximately 20 MB/s per second, loading about 1 GB of database into RAM every 53 seconds.

Note
It is normal for short periods to observe the latencies climb when components aggressively read or write to disk, such as when the system is being backed up or
when AD DS is running garbage collection. Additional head room on top of the calculations should be provided to accommodate these periodic events. The goal
being to provide enough throughput to accommodate these scenarios without impacting normal function.

As can be seen, there is a physical limit based on the storage design to how quickly the cache can possibly warm. What will warm the cache are incoming client
requests up to the rate that the underlying storage can provide. Running scripts to “pre-warm” the cache during peak hours will provide competition to load driven by
real client requests. That can adversely affect delivering data that clients need first because, by design, it will generate competition for scarce disk resources as artificial
attempts to warm the cache will load data that is not relevant to the clients contacting the DC.

30 of 30 3/20/2017 12:05 PM

Opmanager Userguide
No ratings yet
Opmanager Userguide
385 pages
Bike Rental System
79% (39)
Bike Rental System
31 pages
FortiManager 7.2.4 Administration Guide
No ratings yet
FortiManager 7.2.4 Administration Guide
916 pages
Admin 2
No ratings yet
Admin 2
336 pages
Best Practices For Cisco IOS Devices Hardening
No ratings yet
Best Practices For Cisco IOS Devices Hardening
54 pages
AD Issues & Solutions
No ratings yet
AD Issues & Solutions
90 pages
The Ultimate Guide of SSL
No ratings yet
The Ultimate Guide of SSL
93 pages
Windows Server 2008 Group Policy Settings
100% (4)
Windows Server 2008 Group Policy Settings
344 pages
Microsoft Official Course: Designing and Implementing An Active Directory Domain Services Topology
No ratings yet
Microsoft Official Course: Designing and Implementing An Active Directory Domain Services Topology
46 pages
Day 8 AD III
No ratings yet
Day 8 AD III
10 pages
Microsoft Official Course: Maintaining Active Directory® Domain Services
No ratings yet
Microsoft Official Course: Maintaining Active Directory® Domain Services
39 pages
I05 HyperV TShoot
No ratings yet
I05 HyperV TShoot
65 pages
Company's Infrastructure FULL DONE
No ratings yet
Company's Infrastructure FULL DONE
34 pages
End Point Manager
No ratings yet
End Point Manager
31 pages
Microsoft Official Course: Designing and Maintaining An IP Configuration and Address Management Solution
No ratings yet
Microsoft Official Course: Designing and Maintaining An IP Configuration and Address Management Solution
34 pages
Group Policy The Ultimate Guide - Active Directory Pro
No ratings yet
Group Policy The Ultimate Guide - Active Directory Pro
42 pages
Veeam Backup 8 Userguide Vmware
100% (2)
Veeam Backup 8 Userguide Vmware
783 pages
Step-By-Step Guide For Upgrading SYSVOL Replication To DFSR (Distributed File System Replication) - RebelAdmin
No ratings yet
Step-By-Step Guide For Upgrading SYSVOL Replication To DFSR (Distributed File System Replication) - RebelAdmin
16 pages
Deploying Active Directory in Windows Azure: Name Title Microsoft
No ratings yet
Deploying Active Directory in Windows Azure: Name Title Microsoft
24 pages
System Design
No ratings yet
System Design
11 pages
Exchange 2013 Step by Step
100% (1)
Exchange 2013 Step by Step
26 pages
Microsoft Dynamics NAV 2013 R2 Sizing Guidelines For On-Premises Single Tenant Deployments
No ratings yet
Microsoft Dynamics NAV 2013 R2 Sizing Guidelines For On-Premises Single Tenant Deployments
20 pages
ADMT Guide: Migrating and Restructuring Active Directory Domains
No ratings yet
ADMT Guide: Migrating and Restructuring Active Directory Domains
281 pages
How To Rebuild The SYSVOL Tree and Its Content in A Domain - Windows Server - Microsoft Learn
No ratings yet
How To Rebuild The SYSVOL Tree and Its Content in A Domain - Windows Server - Microsoft Learn
14 pages
Abbreviations/Acronyms Expansion
No ratings yet
Abbreviations/Acronyms Expansion
7 pages
Identity Management Design Guide
No ratings yet
Identity Management Design Guide
436 pages
IAM Expert Interview Q&A
No ratings yet
IAM Expert Interview Q&A
2 pages
Module 4: Administering and Troubleshooting Exchange Online Lab: Administering and Troubleshooting Exchange Online
No ratings yet
Module 4: Administering and Troubleshooting Exchange Online Lab: Administering and Troubleshooting Exchange Online
9 pages
Exchange Server 2013 - Functional Test Plan Ver1 3
100% (1)
Exchange Server 2013 - Functional Test Plan Ver1 3
52 pages
LEP Network Firewall Implementation Policy
No ratings yet
LEP Network Firewall Implementation Policy
4 pages
Adfs O365
No ratings yet
Adfs O365
10 pages
Deploying and Managing Certificates
No ratings yet
Deploying and Managing Certificates
34 pages
gopas-goc-166-01-ADFS A WAP PDF
No ratings yet
gopas-goc-166-01-ADFS A WAP PDF
99 pages
Active Directory Improvements
No ratings yet
Active Directory Improvements
3 pages
Metadata Cleanup of A Domain Controller: Active Directory Users and Computers
No ratings yet
Metadata Cleanup of A Domain Controller: Active Directory Users and Computers
5 pages
Offline Assessment For Active Directory: Prerequisites
No ratings yet
Offline Assessment For Active Directory: Prerequisites
14 pages
LAPS OperationsGuide
No ratings yet
LAPS OperationsGuide
24 pages
Step by Step Radius Windows
100% (1)
Step by Step Radius Windows
55 pages
Creating and Managing Active Directory Groups and Organizational Units
No ratings yet
Creating and Managing Active Directory Groups and Organizational Units
9 pages
(Win2k16) Certificate Autoenrollment in Windows Server 2016
No ratings yet
(Win2k16) Certificate Autoenrollment in Windows Server 2016
45 pages
Configuration of FSSO With DC AD PDF
No ratings yet
Configuration of FSSO With DC AD PDF
2 pages
Upgrading Windows Server 2003 2008 Active Directory To Windows Server 2012 R2
No ratings yet
Upgrading Windows Server 2003 2008 Active Directory To Windows Server 2012 R2
58 pages
Big-Ip Virtual Edition Setup Guide For Microsoft Hyper-V
No ratings yet
Big-Ip Virtual Edition Setup Guide For Microsoft Hyper-V
26 pages
Computer Studies
No ratings yet
Computer Studies
242 pages
How To Install An SSL Certificate From A Commercial Certificate Authority - DigitalOcean
100% (1)
How To Install An SSL Certificate From A Commercial Certificate Authority - DigitalOcean
27 pages
Group Policy Planning and Deployment Guide
No ratings yet
Group Policy Planning and Deployment Guide
84 pages
Ad Health
No ratings yet
Ad Health
27 pages
TechNet Group Policy Tips and Tricks
No ratings yet
TechNet Group Policy Tips and Tricks
30 pages
Active Directory Planning and Design
No ratings yet
Active Directory Planning and Design
4 pages
How To - Integrate OpManager With Cyberoam
No ratings yet
How To - Integrate OpManager With Cyberoam
7 pages
Troubleshooting Active Directory Replication Problems
No ratings yet
Troubleshooting Active Directory Replication Problems
54 pages
Lecture:15-17 Memory Interfacing in 8085: Course Instructor: Dr. Khurai
No ratings yet
Lecture:15-17 Memory Interfacing in 8085: Course Instructor: Dr. Khurai
24 pages
Active Directory Interview Questions
No ratings yet
Active Directory Interview Questions
3 pages
Aws Odd Interview Questions
No ratings yet
Aws Odd Interview Questions
36 pages
Computer Science Solved Past Paper For IGCSE EDEXCEL
No ratings yet
Computer Science Solved Past Paper For IGCSE EDEXCEL
4 pages
TOOLBOX II Version 4.10 Installation Manual: © 2008 by Siemens AG
100% (1)
TOOLBOX II Version 4.10 Installation Manual: © 2008 by Siemens AG
18 pages
Oracle Applications - FNDLOAD
No ratings yet
Oracle Applications - FNDLOAD
8 pages
Module 2 Nosql
No ratings yet
Module 2 Nosql
31 pages
Geetha MPMC Term Work
No ratings yet
Geetha MPMC Term Work
25 pages
List of NetBackup Blueprints
No ratings yet
List of NetBackup Blueprints
12 pages
Proxy Server
No ratings yet
Proxy Server
23 pages
Zones Creatoin Under Vcs
No ratings yet
Zones Creatoin Under Vcs
22 pages
01-07 Key GPON Techniques
No ratings yet
01-07 Key GPON Techniques
10 pages
FM24C02A Ds Eng
No ratings yet
FM24C02A Ds Eng
20 pages
Membuat Lampu Traffic Light Dengan AVR Dan Bascom Atmega 16
No ratings yet
Membuat Lampu Traffic Light Dengan AVR Dan Bascom Atmega 16
14 pages
Unite28093v Character Arrays Strings File
No ratings yet
Unite28093v Character Arrays Strings File
18 pages
CS301 Midterm Solved Mega File With Reference
No ratings yet
CS301 Midterm Solved Mega File With Reference
30 pages
List Dokumentasi Aplikasi v4 (Rafif's Changes 2020-07-29)
No ratings yet
List Dokumentasi Aplikasi v4 (Rafif's Changes 2020-07-29)
49 pages
Computer College - Calamba Campus: Basics of Information Systems Information Concepts
No ratings yet
Computer College - Calamba Campus: Basics of Information Systems Information Concepts
18 pages
Book2 (KARA Solution Introduction v2.1401.03)
No ratings yet
Book2 (KARA Solution Introduction v2.1401.03)
20 pages
Windows Registry Foresics
100% (1)
Windows Registry Foresics
25 pages
Session Tracking
No ratings yet
Session Tracking
38 pages
Communication Networks NETW 501: Presented by
No ratings yet
Communication Networks NETW 501: Presented by
32 pages
Database and Web Database Systems
No ratings yet
Database and Web Database Systems
36 pages
C Pattern Program
No ratings yet
C Pattern Program
31 pages
Upnp Design by Example: Michael Jeronimo Jack Weast
No ratings yet
Upnp Design by Example: Michael Jeronimo Jack Weast
19 pages
File Menu: Results List
No ratings yet
File Menu: Results List
2 pages
Mapping Test
No ratings yet
Mapping Test
2 pages
Sap Predictive Analytics Certification Training
No ratings yet
Sap Predictive Analytics Certification Training
7 pages
Sap PP Tables
No ratings yet
Sap PP Tables
1 page
Ultimate Microsoft Intune for Administrators: Master Enterprise Endpoint Security and Manage Devices, Apps, and Cloud Security with Expert Microsoft Intune Strategies (English Edition)
From Everand
Ultimate Microsoft Intune for Administrators: Master Enterprise Endpoint Security and Manage Devices, Apps, and Cloud Security with Expert Microsoft Intune Strategies (English Edition)
Paul Winstanley
No ratings yet
Ultimate Azure IaaS for Infrastructure Management
From Everand
Ultimate Azure IaaS for Infrastructure Management
Dean Cefola
No ratings yet
Mastering Microsoft 365 ENTRA ID - 100 Practical Guides For Secure Identity and Access Management: Mastering Microsoft 365, #122
From Everand
Mastering Microsoft 365 ENTRA ID - 100 Practical Guides For Secure Identity and Access Management: Mastering Microsoft 365, #122
Openshelves
No ratings yet
Mastering Active Directory
From Everand
Mastering Active Directory
VICTOR P HENDERSON
No ratings yet
Microsoft AZURE® AZ-104 Administrator Practice Tests
From Everand
Microsoft AZURE® AZ-104 Administrator Practice Tests
iCertify Training
No ratings yet
Microsoft Exchange 2013 Cookbook
From Everand
Microsoft Exchange 2013 Cookbook
Michael Van Horenbeeck
No ratings yet
26 Ways to Save on Your Utility Bills!: 26 Ways, #1
From Everand
26 Ways to Save on Your Utility Bills!: 26 Ways, #1
Kimberly Peters
No ratings yet
Enterprise Mobile Device Management Complete Self-Assessment Guide
From Everand
Enterprise Mobile Device Management Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Active Directory Rights Management Services A Clear and Concise Reference
From Everand
Active Directory Rights Management Services A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
AppDynamics Third Edition
From Everand
AppDynamics Third Edition
Gerardus Blokdyk
No ratings yet