hs06 performance per watt and transition to sl6

34
HS06 performance per watt and transition to SL6 Michele Michelotto – INFN Padova 1

Upload: eytan

Post on 22-Feb-2016

41 views

Category:

Documents


0 download

DESCRIPTION

HS06 performance per watt and transition to SL6. Michele Michelotto – INFN Padova. The SL6 transition. Rumors of sizeable differences of HS06 across Scientific Linux distribution on same hardware I made detailed measurements on AMD Opteron 6272 SL5 with gcc 4.1.2 SL6 with gcc 4.4.0 - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: HS06 performance per watt and  transition  to SL6

HS06 performance per watt and transition to SL6

Michele Michelotto – INFN Padova

1

Page 2: HS06 performance per watt and  transition  to SL6

The SL6 transition

2

Rumors of sizeable differences of HS06 across Scientific Linux distribution on same hardware

I made detailed measurements on AMD Opteron 6272 SL5 with gcc 4.1.2 SL6 with gcc 4.4.0 SL6 with last compiler available at that time 4.7.0

End of August We started collecting SL6 results from WLCG sites Alessandra Forti asked to send results to me and

Manfred Manfred created a new page on the HEPiX site.

Page 3: HS06 performance per watt and  transition  to SL6

SL6 performance vs. SL5

3

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 360

50

100

150

200

250

300

SL5SL6

Page 4: HS06 performance per watt and  transition  to SL6

SL6+gcc4.7 vs. SL6 vs. SL5

4

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 360

50

100

150

200

250

300

AMD Opteron 6272 HS06 32bit

threads

HS

06

Page 5: HS06 performance per watt and  transition  to SL6

SL6+gcc4.7 and SL6 gcc 4.4Diff with SL5 and gcc 4.1.2

5

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 360.00%

5.00%

10.00%

15.00%

20.00%

25.00%

30.00%

35.00%

40.00%

AMD Opteron 6272 HS06 32bit

threads

HS

06

Page 6: HS06 performance per watt and  transition  to SL6

Let’s do it on Intel Xeon

6

Page 7: HS06 performance per watt and  transition  to SL6

Differences SL6 gcc4.7 and SL6 gcc4.4 wrt SL5 gcc4.1

7

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 380.00%

5.00%

10.00%

15.00%

20.00%

25.00%

30.00%

HEP-SPEC06eon E5 HS06 32bit

SL6 / SL5 Ratio

gcc4.7/SL5 ra-tio

threads

HE

P-S

PE

C06

Page 8: HS06 performance per watt and  transition  to SL6

HEP-SPEC06 site maintained mainly by Manfred

8

Page 9: HS06 performance per watt and  transition  to SL6

SL6 vs. SL5 from pair of similar worker node

9

AMD Opteron

6168

AMD Opteron

6174

AMD Opteron

6276

Intel Xeon 5520

Intel Xeon 5520

Intel Xeon

E5-2665

Intel Xeon

E5-2670

Intel Xeon

E5-2670

Intel Xeon

E5-2670

Intel Xeon

E5520

Intel Xeon

E5520

Intel Xeon

E5630

Intel Xeon

X5650

Intel Xeon

X5650

Intel Xeon

X5650

Intel Xeon

X5650

0.00%

2.00%

4.00%

6.00%

8.00%

10.00%

12.00%

14.00%

SL6/SL5

Page 10: HS06 performance per watt and  transition  to SL6

Ivy Bridge vs. Sandy Bridge

10

Page 11: HS06 performance per watt and  transition  to SL6

Ivy Bridge vs. Sandy Bridge 64 bit

11

Page 12: HS06 performance per watt and  transition  to SL6

Performances per clock

12

Page 13: HS06 performance per watt and  transition  to SL6

Mail from Manfred on Friday 25th

13

DELL C6620 (2U, 4nodes) Each node

2 x Intel Xeon E5-2670 v2 – 10 cores (20 Logical cpu) @ 2.5 GHz

64 GB (8x8 GB PC3-14900) 6x900 GB SAS

342 HS06 (20 copies) - 411 HS06 (40 copies)

Page 14: HS06 performance per watt and  transition  to SL6

Adding Manfred new beast

14

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 480

20

40

60

80

100

120

140

160

180

200

dual e5-2697v2 (2.7GHZ 24 cores 48 L-CPU) vs

dual E5-2660 (2.2 GHz 16 cores 32 L-CPU)Vs

dual E5-2670v2 (2.5 Ghz 20 cores 40 L-CPU)

2697v2 2.7 GHz 2660 2.2 GHz 2670v2 2.5 MHz

# concurrent runs

HS06

per

MHz

Page 15: HS06 performance per watt and  transition  to SL6

A new architecture: ARM

15

Page 16: HS06 performance per watt and  transition  to SL6

A new architecture

16

Exynos4412 Prime CPU

1.7 GHz Cortex -A9 quad core

2GB LP-DDR2 memory (512MB/core)

$89 each Fedora 18, armV7,

gcc4.8, ODROID kernel

Page 17: HS06 performance per watt and  transition  to SL6

Courtesy of Peter Elmer, Princeton Univ.

17

Page 18: HS06 performance per watt and  transition  to SL6

HS06 measured on ARM

18

0 1 2 3 4 50.00

2.00

4.00

6.00

8.00

10.00

12.00

14.00

HS06HS06/core

Page 19: HS06 performance per watt and  transition  to SL6

Measurements of power consumption

19

Measurements of voltage, amperage and power consumption

The power logger Measurements setup Single core Multicore 32 bit measurements. 64 bit measurements Collecting results from Manfred Measurements on ARM

Page 20: HS06 performance per watt and  transition  to SL6

Fluke 1735 Three-Phase Power Logger

20

Page 21: HS06 performance per watt and  transition  to SL6

Measurement setup for single phase

21

Page 22: HS06 performance per watt and  transition  to SL6

On display

22

Page 23: HS06 performance per watt and  transition  to SL6

Power logger sw

23

Idlecompile First run Second run Third run

Page 24: HS06 performance per watt and  transition  to SL6

Black average – Green min –Red Max

24

Page 25: HS06 performance per watt and  transition  to SL6

Power consumption (Watt) on Intel Xeon E5 2660

25

32 copies

30 copies

28 copies

26 copies

24 copies

22 copies

20 copies

18 copies

16 copies

14 copies

12 copies

10 copies

8 copies

6 copies

4 copies

3 copies

2 copies

1 copy gcc idle0.00

50.00

100.00

150.00

200.00

250.00

300.00

350.00

400.00

Intel Xeon E52660 - 2PSU

Page 26: HS06 performance per watt and  transition  to SL6

Min Average Max

26

32 copies

30 copies

28 copies

26 copies

24 copies

22 copies

20 copies

18 copies

16 copies

14 copies

12 copies

10 copies

8 copies

6 copies

4 copies

3 copies

2 copies

1 copy gcc idle0.00

50.00

100.00

150.00

200.00

250.00

300.00

350.00

400.00

450.00

Page 27: HS06 performance per watt and  transition  to SL6

Efficiency HS06/Watt

27

32 copies

30 copies

28 copies

26 copies

24 copies

22 copies

20 copies

18 copies

16 copies

14 copies

12 copies

10 copies

8 copies

6 copies

4 copies

3 copies

2 copies

1 copy gcc idle0.00

200.00

400.00

600.00

800.00

1000.00

1200.00

HS06/kWatt

Page 28: HS06 performance per watt and  transition  to SL6

Historical Trend from Manfred

28

Jan-04 May-05 Oct-06 Feb-08 Jul-09 Nov-10 Apr-12 Aug-130.00

0.20

0.40

0.60

0.80

1.00

1.20

HS06/WattXEON E5 26x0

XEON 54xx

XEON 51xxAMD 2xx

AMD 6168

Page 29: HS06 performance per watt and  transition  to SL6

HS06/Watt with ARM processor

29

Jan-04 May-05 Oct-06 Feb-08 Jul-09 Nov-10 Apr-12 Aug-13 Dec-140.00

0.50

1.00

1.50

2.00

2.50

3.00

3.50

HS06/Watt

XEON E5 26x0

ARM

XEON 54xxXEON 51xxAMD 2xx

AMD 6168

Page 30: HS06 performance per watt and  transition  to SL6

Mail from Manfred on Friday 25th

30

DELL C6620 (2U, 4nodes) Each node

2 x Intel Xeon E5-2670 v2 – 10 cores (20 Logical cpu) @ 2.5 GHz

64 GB (8x8 GB PC3-14900) 6x900 GB SAS

342 HS06 (20 copies) - 411 HS06 (40 copies) 1450 Watt on four nodes 362 Watt/node

Page 31: HS06 performance per watt and  transition  to SL6

Mail from Manfred on Friday 25th

31

DELL C6620 (2U, 4nodes) Each node

2 x Intel Xeon E5-2670 v2 – 10 cores (20 Logical cpu) @ 2.5 GHz

64 GB (8x8 GB PC3-14900) 6x900 GB SAS

342 HS06 (20 copies) - 411 HS06 (40 copies) 1450 Watt on four nodes 362 Watt/node

Page 32: HS06 performance per watt and  transition  to SL6

Jan-04 May-05 Oct-06 Feb-08 Jul-09 Nov-10 Apr-12 Aug-13 Dec-140.00

0.50

1.00

1.50

2.00

2.50

3.00

3.50

HS06/Watt

HS06/Watt +XeonE5v2670v2 (Manfred)

32

Page 33: HS06 performance per watt and  transition  to SL6

Future work

33

New Xeon E5 v2 very good performances Detailed measurements on Xeon E5 v2 in

HS06/watt New Intel server processor

Avoton New ARM processors

64bit processor will be available

Page 34: HS06 performance per watt and  transition  to SL6