d ell poweredge r910 sql oltp virtualization studymedia.community.dell.com/en/dtc/attach/r910...

14
Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel ® Xeon E7 ® Processors and Low-Voltage Memory A Dell Technical White Paper Dell, Inc. Waseem Raja Systems Performance Analysis (OCTO) John Beckett Systems Performance Analysis (OCTO)

Upload: lamtuyen

Post on 10-Mar-2018

233 views

Category:

Documents


2 download

TRANSCRIPT

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel

® Xeon E7

® Processors and Low-Voltage Memory

A Dell Technical White Paper

Dell, Inc.

Waseem Raja Systems Performance Analysis (OCTO)

John Beckett Systems Performance Analysis (OCTO)

PowerVault™ DL Backup to Disk Appliance: Key Factors Impacting Performance

Page ii

THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL

ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED AS IS, WITHOUT EXPRESS OR

IMPLIED WARRANTIES OF ANY KIND.

Dell, the DELL logo, and the DELL badge, PowerEdge, PowerConnect, EqualLogic, and PowerVault are

trademarks of Dell Inc. Microsoft, Windows, Windows Server, SQL Server, Exchange, and Active

Directory are either trademarks or registered trademarks of Microsoft Corporation in the United States

and/or other countries. VMware, VSphere, VCenter, and ESX are either trademarks or registered

trademarks of VMware, Inc. Intel and Xeon are registered trademarks of Intel Corporation. Other

trademarks and trade names may be used in this document to refer to either the entities claiming the

marks and names or their products. Dell Inc. disclaims any proprietary interest in trademarks and trade

names other than its own.

April 2011

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 1

Contents Executive Summary ....................................................................................................... 2

Introduction ................................................................................................................ 2

Test Description ........................................................................................................... 3

Sample Environment Characterization .............................................................................. 4

Test Methodology .......................................................................................................... 6

Overview ................................................................................................................. 6

Data Collection Overview ............................................................................................. 6

Test Results................................................................................................................. 7

Performance Comparison Between Nehalem-EX and Westmere-EX ........................................... 7

Power Measurements at 100% and Idle ............................................................................. 7

OPM Per Watt (OPM/watt) ............................................................................................ 9

Conclusion ............................................................................................................. 10

Appendix A: Additional Configuration Details ..................................................................... 11

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 2

Executive Summary Dell’s Systems Performance Analysis team was commissioned by Server Outbound Marketing to measure

performance and performance/watt in a virtualized environment to compare the Intel® Xeon® X7560

(Nehalem-EX) processors on the Dell™ PowerEdge™ R910 against the new Intel Xeon E7-4870

(Westmere-EX) processors. The configuration was aimed at exploiting the strengths of the PowerEdge

R910 by equipping the system with four processors, a 10Gb network connection to backend SAN, and

sufficient memory to run the workload at 100% CPU utilization for both processor architectures tested.

The testing indicates that the Xeon E7 processors offer significant performance gains within the same

power envelope, while the addition of LV RDIMMs actually lowers system power draw at both idle and

100% CPU utilization.

Key Findings:

The Dell PowerEdge R910 with Intel Xeon E7-4870 processors provided 34% higher virtualized

OLTP performance than the PowerEdge R910 with Intel Xeon X7560 processors. The R910

configuration with Intel Xeon E7-4870 processors obtained 727,049 orders per minute (OPM) in

a virtualized environment, as compared to the R910 configuration with Intel Xeon X7560

processors, which scored 541,971 OPM.

The Dell PowerEdge R910 with Intel Xeon E7-4870 processors obtained 50% higher

performance/watt than the PowerEdge R910 with Intel Xeon X7560 processors. The R910

configuration with Intel Xeon E7-4870 processors obtained a performance/watt score of 825 in

a virtualized environment, as compared to the R910 configuration with Intel Xeon X7560

processors, which achieved a performance/watt score of 551.

The Dell PowerEdge R910 with Intel Xeon E7-4870 processors used 10% less overall power

while running the virtualized OLTP workload than the PowerEdge R910 with Intel Xeon X7560

processors. The R910 configuration with Intel Xeon E7-4870 processors used an average of 881

watts, as compared to the R910 configuration with Intel Xeon X7560 processors which used an

average of 984 watts.

The Dell PowerEdge R910 with Intel Xeon E7-4870 processors used 2% less overall power at

idle than the PowerEdge R910 with Intel Xeon X7560 processors. The R910 configuration with

Intel Xeon E7-4870 processors used an average of 552 watts at idle, as compared to the R910

configuration with Intel Xeon X7560 processors which used an average of 563 watts.

Introduction In order to assist customers to understand virtualization performance, this study was aimed primarily at

characterizing specific application performance in a virtualized environment to characterize the Dell

PowerEdge R910 with two generations of Intel Xeon processors. Many other virtualization benchmarks

exist, such as VMmark and SPECvirt, but this study is intended to be a means to answer a common

question which cannot be answered by other benchmarks; specifically, performance of a virtualized

Microsoft® SQL Server® scenario on a robust quad-socket Dell server.

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 3

Considerations include:

Comparing Microsoft SQL Server virtualized performance between the Dell PowerEdge R910 that launched in March of 2010 with Intel Nehalem-EX processors against the second version of the Dell PowerEdge R910 that launched in April 2011 equipped with Intel Westmere-EX processors.

Comparing the power utilization of the Dell PowerEdge R910 with Nehalem-EX processors against the Dell PowerEdge R910 with Westmere-EX processors at idle and 100% CPU utilization levels.

Comparing the performance/watt of the Dell PowerEdge R910 with Nehalem-EX processors against the Dell PowerEdge R910 with Westmere-EX processors.

This study is not intended to be a comprehensive analysis of virtualization performance between Dell

PowerEdge platforms, for only the R910 was chosen to characterize the relative gains in performance

and power efficiency by comparing two generations of processors. In addition, new supported memory

types and other elements of the R910 have been updated during the April 2011 launch of the

Westmere-EX processor lineup, so this plays a role as well in affecting the comparison.

Test Description DVD Store Version 2 (DS2) is a complete online e-commerce test application, with a back-end database

component, a Web application layer, and driver programs. The goal in designing the database

component as well as the mid-tier application was to utilize many advanced database features (e.g.,

transactions, stored procedures, triggers, and referential integrity) while keeping the database easy to

install and understand. The DS2 workload may be used to test databases or as a stress tool for any

purpose.

The distribution includes code for Microsoft SQL Server, Oracle, and MySQL databases. Included in the

release are data generation programs, shell scripts to build data for 10 MB, 1 GB, and 100 GB versions

of the DVD Store; database build scripts and stored procedures, PHP Web pages; and a C# driver

program.1

Database workload sizes can be small, medium, or large, depending on environment setup. See

Table 1 for details.

Table 1 DVD Store Database Size Description

Database Orders Size Products Customers

Small 10 MB 20,000 1,000/month 10,000

Medium 1 GB 2,000,000 100,000/month 100,000

Large 100 GB 200,000,000 10,000,000/month 1,000,000

1 Test Description information from Dell Enterprise Technology Center website: http://www.delltechcenter.com/page/DVD+Store+Readme

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 4

Sample Environment Characterization To measure consolidated VM infrastructure host performance in a simulated e-commerce scenario, a

full end-to-end solution was implemented. This environment consists of a physical server (host) running

VMware® ESX™ 4.1. This is referred to as the system under test (SUT) in the diagram.

In addition to the SUT, the environment contains an environment controller system and a workload

driver system. The environment controller is running Windows Server 2008 SP1. The intent of this

system is to manage the VMware® vCenter™/vSphere™ client and act as a logging device for the power

analyzer. The workload driver system is running Windows Server® 2008 R2 Enterprise. This system is

used to create simultaneous SQL user workload instances on the VMs that are being tested.

In addition to these three physical systems, the environment contains a 10Gb Ethernet PowerConnect™

8024F switch, four EqualLogic™ PS6010 SAN, and a Yokogawa WT210 power analyzer. The

PowerConnect switch enables communications between the VMs and controller systems (workload

driver and environment controller), and also between the system under test (SUT) and the iSCSI-based

EqualLogic storage backend. The power analyzer is connected to the environment controller system

through a serial cable. The power analyzer takes one-second snapshots of all power readings and the

controller logs this data during both the idle and active portions of the benchmark runs.

Figure 1 Test Environment

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 5

Virtualized SQL SUT Configurations Configurations used in the test environment are shown in Table 2 and Table 3.

Table 2 SUT Configurations

Platform R910 Nehalem-EX R910 Westmere-EX

Operating System VMware ESX 4.1

Processor Model 4 x Intel Xeon X7560 4 x Intel Xeon E7-4870

Processor Frequency 2267 MHz 2400 MHz

Memory Details 32 x 4GB DR 1.5V 32 x 8GB QR 1.35V

Memory Frequency 1066 MHz 1066 MHz

Internal Storage 2x 2.5” 146GB 15k RPM SAS (RAID 1)

BIOS Power Management Setting Maximum Performance

Physical Cores 32 40

Logical Processors 64 80

# 2-vCPU VMs 32 40

Table 3 Supporting Infrastructure Configurations

Environment Controller System SQL Workload Driver System

PowerEdge 1950 PowerEdge R905

Windows Server 2008 Enterprise SP1 Windows Server 2008 R2 Enterprise

2 x Intel Xeon processor E5410, 2.33 GHz 4 x AMD Opteron 8435, 2.60 GHz

Memory: 16 GB RAM Memory: 96 GB RAM

Disk Configuration: 2x 73 GB SAS (RAID 1) Disk Configuration: 2 x 146 GB SAS (RAID 1)

Networking Power Analyzer

PowerConnect 8024F Monitors SUT only

Utilizing 10 Gigabit connection rate Measures all power data during active portion of the benchmark

1 network port used per EqualLogic SAN (4 total)

Measurements recorded during official run:

1 network port used for SUT Average watts during active period

1 port used as management port (via vSphere) Average watts during idle period

Jumbo frames enabled on switch and on SANs

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 6

Test Methodology

Overview The following methodology was used in the testing procedure:

Determined overall system performance at an equal CPU utilization (100%) across all logical

processors with predetermined configurations and a methodology that produces consistent and

repeatable data.

Exercised all enabled VMs for a 10 minute run time.

Recorded Performance metric, SUT Performance counters, and Power Measurements.

Note: The metric recorded is OPM (orders per minute—a measure of the number of orders processed

during the benchmark runtime).

Data Collection Overview The following steps were used in collecting data:

1. Determined the number of VMs to enable during testing, keeping in mind that the VM count should be half the number of physical processor cores available. (Each VM is a two vCPU VM).

2. Built a medium-size SQL database on each VM using DVD Store scripts.

3. Backed up each SQL database to a restore file, allowing the database to be restored from a pristine state, and then deleted all VM databases.

4. Restored all databases from the backup restore file. 5. Rebooted all VMs. 6. Performed a warm-up run to prepare databases for the official run, allowing each

database to warm up individually and allowing a 10-minute idle period after warm up to allow VMs to return to a steady state.

7. Performed the official run starting all VMs simultaneously using a script. The power analyzer began measuring the system power at the start of the official run and an ESX host was triggered to start esxtop batch session during the official run. (Note: The VMware® esxtop tool gathered system-related performance counters for profiling CPU usage during official run.)

8. Adjusted workload by adding additional VMs after each run to lower or raise CPU usage and then restarted official run to get an official measurement as close to 100% CPU utilization on each.

9. Checked the official run data for performance stability, collecting data if stable and restarting the process if high variation was present.

Data collected during the official run included:

Average watts used during the official run (power metric)

Average of the total esxtop CPU counters during the official run (CPU percentage)

VM throughput in OPM (Orders per Minute) (metric used in this study is an aggregate OPM measurement of all enabled VMs and is represented as performance in the test results)

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 7

Test Results

Performance Comparison Between Nehalem-EX and Westmere-EX Quad-socket performance measurements on the Dell PowerEdge R910 show the increase in

performance between the two generations of Intel processors. The Intel Xeon X7560 2.26GHz eight-

core processor in the PowerEdge R910 was compared to the Intel Xeon E7-4870 2.4 GHz ten-core

processor. The addition of 16 additional logical processors in a Hyperthreaded 4P configuration using

the Intel E7-4870 processors necessitated adding 8 more VMs to the configuration in order to saturate

the system’s processing capabilities while running this benchmark. Additionally, the memory was

doubled from 128 GB with the Intel X7560 to 256GB with the Intel E7-4870 to provide sufficient memory

to run all VMs.

The PowerEdge R910 configuration using the Intel Xeon E7-4870 processor provided 34% more

performance, as recorded in aggregate orders per minute (OPM) across all VMs, as compared to the

R910 configuration equipped with the Intel Xeon X7560 processor.

Figure 2 Virtualized SQL Performance

Power Measurements at 100% and Idle Power consumption of the SUT during the active states of the benchmark was also measured. The

power analyzer used reports 1-second snapshots of the average power being consumed at any given

point of the measured sample. The average of these snapshots over the active period of the benchmark

was used as the basis for the average power consumed.

The R910 configuration using Intel Xeon E7-4870 processors used 10% less power at 100% utilization

than the R910 configuration using Intel Xeon X7560 processors. The R910 with E7-4870 processors used

an average of 881 watts, and the R910 with X7560 processors used an average of 984 watts. Some of

the power delta can be attributed to the capability of the E7-4870 configuration to use Low Voltage

DIMMs, even though the memory configuration was twice as large as the X7560 configuration.

Dell PowerEdge R910Intel Xeon X7560

Dell PowerEdge R910Intel Xeon E7-4870

541971 727049

Per

form

ance

(O

PM

) at

10

0%

Virtualized SQL Performance Orders per Minute at 100%

(Higher Is Better)

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 8

At idle, the configuration using Intel Xeon E7-4870 processors used an average of 2% less power than

the configuration using Intel Xeon X7560 processors. The configuration with E7-4870 processors used an

average of 552 Watts at idle, while the configuration with X7560 processors used an average of 563W.

Even lower idle power readings would be expected if the BIOS Setup Program Power Management policy

was modified to Dell Active Power Controller from the Maximum Performance mode used in this study.

Figure 3 Average Power at 100% (Lower Is Better)

Figure 4 Average Power at Idle (Lower is Better)

Dell PowerEdge R910Intel Xeon X7560

Dell PowerEdge R910Intel Xeon E7-4870

984

881

Ave

rage

Po

wer

(W

atts

)

Virtualized SQL Power Utilization Average Power at 100%

(Lower Is Better)

Dell PowerEdge R910Intel Xeon X7560

Dell PowerEdge R910Intel Xeon E7-4870

563

552

Ave

rage

Po

wer

(W

atts

)

Virtualized SQL Power Utilization Average Power at Idle

(Lower Is Better)

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 9

OPM Per Watt (OPM/watt) The overall benchmark score (OPM) is divided by the average power consumed to generate the OPM per

watt (OPM/watt) metric shown in Figure 5. This value is considered to be the measure of performance

per increment of power used. Higher is better in this metric.

Formula: Aggregate Performance (OPM) / Average Power Consumed = OPM per Watt (OPM/watt)

In this comparison, the PowerEdge R910 configuration with Intel Xeon E7-4870 processors achieved the

highest result with a score of 825 OPM/watt – a difference of 50% over the PowerEdge R910

configuration with Intel Xeon X7560 processors. The PowerEdge R910 configuration with X7560

processors scored 551 OPM/watt.

Figure 5 Performance Per Watt (Higher is Better)

Dell PowerEdge R910Intel Xeon X7560

Dell PowerEdge R910Intel Xeon E7-4870

551

825

Per

form

ance

/Wat

t at

10

0%

Virtualized SQL Performance/Watt OPM / Average Watts at 100%

(Higher Is Better)

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 10

Conclusion The goal of this study was to characterize the performance and power impacts of the transition of the

Dell PowerEdge R910 platform from Nehalem-EX support at initial platform launch in March 2010 to

Westmere-EX support in April 2011 using a virtualized database workload. It should be noted that

actual customer environments are unlikely to be running at 100% processor utilization levels, but the

relative scaling achieved with the processor transition to Westmere-EX on the R910 platform in this test

should still be indicative of the merits of the new incarnation of Intel Xeon processors on the Dell

PowerEdge R910.

Based on the data contained in this study, the PowerEdge R910 with E7-4870 processors provides

considerable performance and performance/watt advantages while reducing the overall system power

at idle and at the 100% utilization level over the previous generation Intel Xeon processors in the

PowerEdge R910. The PowerEdge R910 with Intel Xeon E7-4870 processors would be the platform of

choice for maximum performance and expandability with 64 DIMM slots and 10 PCIe slots, as well as the

ability to use low-voltage DIMMs to help reduce overall system power consumption.

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 11

Appendix A: Additional Configuration Details Configuration as Tested

Platform PowerEdge R910 PowerEdge R910

Processor Type Intel Xeon X7560 Intel Xeon E7-4870

# Processors Populated 4 4

Clock Speed (GHz) 2.26 2.40

Total Cores/System 32 40

Total Logical Processors 64 80

L3 Cache (MB) 24 30

Memory 128 GB (32 x 4GB 2R) 256 GB (32 x 8GB 4R LV)

# Memory Risers 8 8

# HDD/Size 2 x 146GB 15K 2.5" 6GBbps SAS 2 x 146GB 15K 2.5" 6GBbps SAS

RAID Type RAID 1 RAID 1

HBA PERC H700 PERC H700

BIOS Version 1.2.2 2.1.0

iDRAC Version 1.57.12 1.70.20

Total NIC ports used 1 x 10Gb, 1 x 1Gb 1 x 10Gb, 1 x 1Gb

# 10GB NICs 1 1

Type of 10Gb NICs 10Gb Intel XF PCI-E FC single port 10Gb Intel XF PCI-E FC single port

Total PCIe Slots Used 1 1

Power Supply Qty/Rating 2 x 1100W 2 x 1100W

Voltage Used for Testing 208v 208v

Dell™ PowerEdge™ R910 SQL OLTP Virtualization Study

Page 12

Virtualized Environment Details

VMware ESX Version 4.1.0 4.1.0

OS Version Installed/VM

Windows 2008 Enterprise R2 x64

Windows 2008 Enterprise R2 x64

SQL Version Installed per VM

Microsoft SQL 2008 R2 Microsoft SQL 2008 R2

#Virtual Machines 32 40

#Logical Processors per VM 2 2

EqualLogic Configuration

EqualLogic Systems in Use 4 4

EqualLogic System Type

4 x EqualLogic PS6010 (RAID-10)

4 x EqualLogic PS6010 (RAID-10)

Total Storage per EQL box 3.66TB 3.66TB

Size and # HDDs/EqualLogic 16 x 600GB 16 x 600GB

R910 BIOS settings used

Turbo Enabled Enabled

Prefetchers Both Enabled Both Enabled

C-states Enabled Enabled

C1E Enabled Enabled

Logical Processors Enabled Enabled

Node Interleave Disabled Disabled

Power Management Maximum Performance Maximum Performance