Pure Storage FlashArray m20 Microsoft SQL Server Data Warehouse Fast Track Reference Architecture
Pure Storage FlashArray m20 Microsoft SQL Server Data Warehouse Fast Track Reference Architecture
Audience ..................................................................................................................................... 5
FlashArray//m Specifications.................................................................................................................................. 7
Pure1 ........................................................................................................................................................................8
About the SQL Server Data Warehouse Fast Track Program ..................................................... 9
Hardware............................................................................................................................................................... 13
Software................................................................................................................................................................ 13
Certification ............................................................................................................................... 14
Summary ................................................................................................................................... 16
References ................................................................................................................................ 16
Our Pure Storage FlashArray//m, on its cost-effective, entry-level model -- the //m20 -- provides the storage
prowess for this solution. On the compute side, we leverage an off-the-shelf, 2 rack-unit, dual CPU socket
Cisco C240 M4 server, running SQL Server 2014 Enterprise Edition Service Pack 1 on top of Windows Server
2012 R2 Standard Edition.
Audience
This reference architecture is targeted toward Enterprise Architects, CIOs, CTOs, technical managers, and
data professionals such as BI Architects, BI Administrators and Database Administrators.
• 6 cables
• Up to 9 GB/s bandwidth
Modular Scale—Scale FlashArray//m inside and outside of the chassis for generations
The FlashArray//m expands upon the FlashArray’s modular, stateless architecture, designed to enable
expandability and upgradability for generations. The FlashArray//m leverages a chassis-based design with
customizable modules, enabling both capacity and performance to be independently improved over time
with advances in compute and flash, to meet your business’ needs today and tomorrow.
Accelerating Databases and Applications Speed transactions by 10x with consistent low latency, enable online
data analytics across wide datasets, and mix production, analytics, dev/test, and backup workloads without
fear.
Virtualizing and Consolidating Workloads Easily accommodate the most IO-hungry Tier 1 workloads, increase
consolidation rates (thereby reducing servers), simplify VI administration, and accelerate common
administrative tasks.
Delivering the Ultimate Virtual Desktop Experience Support demanding users with better performance than
physical desktops, scale without disruption from pilot to >1000’s of users, and experience all-flash
performance for under $100/desktop.
Protecting and Recovering Vital Data Assets Provide an always-on protection for business-critical data,
maintain performance even under failure conditions, and recover instantly with FlashRecover.
Pure Storage FlashArray sets the benchmark for all-flash enterprise storage arrays. It delivers:
Less Cost than Disk Inline de-duplication and compression deliver 5 – 10x space savings across a broad set of
I/O workloads including Databases, Virtual Machines and Virtual Desktop Infrastructure.
Mission-Critical Resiliency FlashArray delivers >99.999% proven availability, as measured across the Pure
Storage installed base and does so with non-disruptive everything without performance impact.
Disaster Recovery Built-In FlashArray offers native, fully-integrated, data reduction-optimized backup and
disaster recovery at no additional cost. Setup disaster recovery with policy-based automation within minutes.
And, recover instantly from local, space-efficient snapshots or remote replicas.
Simplicity Built-In FlashArray offers game-changing management simplicity that makes storage installation,
configuration, provisioning and migration a snap. No more managing performance, RAID, tiers or caching.
Achieve optimal application performance without any tuning at any layer. Manage the FlashArray the way
you like it: Web-based GUI, CLI, VMware vCenter, Rest API, or OpenStack.
FlashArray//m Specifications
//m20 //m50 //m70
Capacity • Up to 120+ TBs effective capacity* • Up to 250+ TBs effective capacity* • Up to 400+ TBs effective capacity*
• 5 – 40TBs raw capacity • 30 – 88TBs raw capacity (w/shelves) • 44 – 136TBs raw capacity
(base chassis) (w/shelves)
Performance • Up to 150,000 32K IOPS** • Up to 220,000 32K IOPS** • Up to 300,000 32K IOPS**
• <1ms average latency • <1ms average latency • <1ms average latency
• Up to 5 GB/s bandwidth • Up to 7 GB/s bandwidth • Up to 9 GB/s bandwidth
Connectivity • 8 Gb/s Fibre Channel • 16 Gb/s Fibre Channel • 16 Gb/s Fibre Channel
• 10 Gb/s Ethernet iSCSI • 10 Gb/s Ethernet iSCSI • 10 Gb/s Ethernet iSCSI
• Management and Replication ports • Management and Replication ports • Management and Replication ports
* Effective capacity assumes HA, RAID, and metadata overhead, GB-to-GiB conversion, and includes the benefit of data
reduction with always-on inline deduplication, compression, and pattern removal. Average data reduction is calculated at 5-to-1,
below the global average of the FlashArray user base.
** Why does Pure Storage quote 32K, not 4K IOPS? The industry commonly markets 4K IOPS, but real-world environments are dominated by
IO sizes of 32K or larger. FlashArray//m adapts automatically to 512B-32KB IO for superior performance, scalability, and data reduction.
Storage Software Built for Flash—The FlashCare technology virtualizes the entire pool of flash within the
FlashArray, and allows Purity to both extend the life and ensure the maximum performance of consumer-
grade MLC flash.
Granular and Adaptive—Purity Core is based upon a 512-byte variable block size metadata layer. This fine-
grain metadata enables all of Purity’s data and flash management services to operate at the highest
efficiency.
Best Data Reduction Available—FlashReduce implements five forms of inline and post-process data
reduction to offer the most complete data reduction in the industry. Data reduction operates at a 512-byte
aligned variable block size, to enable effective reduction across a wide range of mixed workloads without
tuning.
Highly Available and Resilient—FlashProtect implements high availability, dual-parity RAID-3D, non-
disruptive upgrades, and encryption, all of which are designed to deliver full performance to the FlashArray
during any failure or maintenance event.
Backup and Disaster Recovery Built In—FlashRecover combines space-saving snapshots, replication, and
protection policies into an end-to-end data protection and recovery solution that protects data against loss
locally and globally. All FlashProtect services are fully-integrated in the FlashArray and leverage the native
data reduction capabilities.
Pure1
Pure1 Manage—By combining local web-based management with cloud-based monitoring, Pure1 Manage
allows you to manage your FlashArray wherever you are – with just a web browser.
Pure1 Connect—A rich set of APIs, plugin-is, application connectors, and automation toolkits enable you to
connect FlashArray//m to all your data center and cloud monitoring, management, and orchestration tools.
Pure1 Support—FlashArray//m is constantly cloud- connected, enabling Pure Storage to deliver the most
proactive support experience possible. Highly trained staff combined with big data analytics help resolve
problems before they start.
• Reduce Total Cost of Ownership (TCO), since the architecture has been pre-tested and adheres to
Microsoft’s guidelines for predictable, consistent performance.
The array is connected directly to a Cisco C240 M4 server via eight (8) 8 gigabit-per-second fibre channel
ports on four (4) dual-port host bus adapters*. This maximizes throughput available, which is essential to
data warehousing workloads.
The server used in the solution is an off-the-shelf one, with 256GB of RAM. Cost-effective CPUs were used in
this solution (Intel Xeon E5-2670 v3). The server has dual CPU sockets, and each CPU has 24 hyper threaded
cores, for a total of 48 logical processors in the system.
The entire solution fits in 5 rack units: 3 units for the FlashArray//m20, 2 rack units for the Cisco C240 M4
server.
Storage Layout
The storage layout for this solution is extremely simple. As our reference server had no direct attached
storage, we configured it to boot from SAN on a FlashArray volume. We also created four volumes for it, in
addition to the boot volume:
1. Anchor (10GB) – a small volume that serves as the root and contains mount points for all other
volumes
2. tempdb (1TB) – placed all tempdb database files and transaction log file on this volume
3. Data and log files (8TB) – placed all user database files and transaction log files on this volume
4. Backup (2TB) – created a volume for backups (not required by the DWFT certification, however we do
suggest backups be split onto their own volume per our best practices)
Similarly, the ColumnStore-based version of the database reduces nicely to 1.7:1, occupying 913GB of space in
the array.
3. Database-specific settings
Maximum Degree of 24, however the Fast Track utility can push MAXDOP as high as
Parallelism (MAXDOP) the number of logical cores in the system
Database-Specific Settings
In this solution none of these compression settings were tuned or modified in any way, shape, or form. Pure
Storage FlashArrays data-reduction arrays can deliver excellent performance on both compressed and
uncompressed data.
Hardware
Cisco C240 M4 server with 2x Intel Xeon E5-2670 v3 CPUs, 256GB of RAM
Pure Storage FlashArray//m20, 20TB raw capacity (2x 10TB Capacity Packs), 4x Dual-port 8Gbps FC HBAs
Software
Purity Operating Environment 4.5.8
UCSC-RAILB-M4 1 Ball Bearing Rail Kit for C220 M4 and C240 M4 rack sewers
CAB-9K12A-NA 2 Power Cord, 125VAC 13A NEMA 5-15 Plug, North America
Certification
The figure below shows the Microsoft SQL Server Data Warehouse Fast Track certification awarded to our
FlashArray//m20-based solution described in this document. Of particular interest, the columns “Measured
Scan Rate Physical” and “Measured I/O Throughput”.
References
In this section you will find some relevant links for further information on this solution:
• SQL Server on Pure Storage: Best Practices (Pure Storage Community, login required)
http://community.purestorage.com/t5/Interoperability-Best-Practice/Microsoft-SQL-Server-Best-
Practices-Community-Page/ta-p/6097
• Fast Track Data Warehouse Reference Guide for SQL Server 2012 – Microsoft
https://msdn.microsoft.com/en-us/library/hh918452.aspx
• Windows Server on Pure Storage: Best Practices (Pure Storage Community, login required)
http://community.purestorage.com/t5/Interoperability-Best-Practice/Windows-Server-Best-
Practices/ta-p/166
Argenis has been a Microsoft Most Valuable Professional (MVP) for Data
Platform (previously known as SQL Server MVP) since 2014 – an award
given to highly influential technologists in the data management and
analytics space.
Argenis is a SQL community enthusiast and speaks frequently at major SQL Server conferences, including the
PASS Summit, IT/Dev Connections, SQLBits, and TechEd. He has also spoken at many SQLSaturday events
around the globe. He is also a Microsoft Certified Master (MCM), the highest technical certification Microsoft
ever offered, with less than 200 certified individuals worldwide. In 2013 he founded the Security Virtual
Chapter for the Professional Association for SQL Server (PASS) (http://security.sqlpass.org) .
Blog: http://blog.purestorage.com/blog/author/argenis
Twitter: @DBArgenis
T: 650-290-6088
F: 650-625-9667
Sales: [email protected]
Support: [email protected]
Media: [email protected]
General: [email protected]
© 2015 Pure Storage, Inc. All rights reserved. Pure Storage, Pure1, and the P Logo are trademarks of Pure Storage, Inc. All other trademarks are the property of their respective owners.