scalable storage configuration for the physics database services luca canali, cern it lcg database...

22
Scalable Storage Scalable Storage Configuration for the Physics Configuration for the Physics Database Services Database Services Luca Canali Luca Canali , CERN IT , CERN IT LCG Database Deployment and LCG Database Deployment and Persistency Workshop Persistency Workshop October, 2005 October, 2005

Upload: nathanial-dabb

Post on 29-Mar-2015

225 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Scalable Storage Configuration for the Scalable Storage Configuration for the Physics Database ServicesPhysics Database Services

Luca CanaliLuca Canali, CERN IT, CERN IT

LCG Database Deployment and LCG Database Deployment and

Persistency WorkshopPersistency Workshop

October, 2005October, 2005

Page 2: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 22

OutlineOutline

• In this talk I will discussIn this talk I will discuss– Storage configuration for scalable database services

• Main challenges• Best practices

– An implementation of scalable storage• Impacts on DB logical to physical mapping• Performance and resource allocations

– How we can help you to size new database projects or to scale up existing applications

• Performance testing• Benchmark data

Page 3: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 33

Oracle Scalable ArchitectureOracle Scalable Architecture

Goal:Goal: A database infrastructure that provides the required system A database infrastructure that provides the required system resources to the end-users and applications.resources to the end-users and applications.

How:How: A modular architecture that can scale up to a large number of A modular architecture that can scale up to a large number of componentscomponents

Page 4: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 44

RAID 1: HA StorageRAID 1: HA Storage

• MirroringMirroring– 2-Way mirroring (RAID 1) protects against single point

of failures– Can be used to redistribute I/O load (performance)

Page 5: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 55

RAID 0: Scalable Performances RAID 0: Scalable Performances

• RAID 0 (Striping) automatically redistributes files across RAID 0 (Striping) automatically redistributes files across multiple disks.multiple disks.

• Performance and scalability are increasedPerformance and scalability are increased• Error resiliency is decreasedError resiliency is decreased

Unstriped Disks Striped Disks

Page 6: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 66

Mechanical and Geometrical constraintsMechanical and Geometrical constraints

• The external part of the disk providesThe external part of the disk provides– More throughput– Less latency

Page 7: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 77

S.A.M.E StrategyS.A.M.E Strategy

• Goal: optimize storage I/O utilization Goal: optimize storage I/O utilization • S.A.M.E. (Stripe And Mirror Everything) StrategyS.A.M.E. (Stripe And Mirror Everything) Strategy

– Built on the concepts of RAID 1 + 0– Proposed by J. Loaiza (Oracle) in 1999– Replaces “old recipes”: manual balancing across

volumes

• Need a Software or Hardware Volume ManagerNeed a Software or Hardware Volume Manager– ASM is Oracle’s solution with 10g “S.A.M.E. out of the

box”– Other solutions available from different vendors

require configuration

Page 8: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 88

Storage Configuration GuidelinesStorage Configuration Guidelines

• Use all available disk drives Use all available disk drives • Place frequently used data at outer half of diskPlace frequently used data at outer half of disk

– Fastest transfer rate– Minimize seek time

• Stripe data at 1MB extentsStripe data at 1MB extents– Distribute the workload across disks – Eliminate hot spots– Optimum sequential bandwidth gained with1MB I/O

• Stripe redo logs across multiple drivesStripe redo logs across multiple drives– Maximize write throughput for small writes– Smaller stripe size (128KB) and/or dedicated disks

• Use cache on the controller Use cache on the controller – ‘Write-back’ cache– Battery-backed cache

Page 9: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 99

Oracle’s ASM Main Features Oracle’s ASM Main Features

• Mirror protection: – 2-way and 3-way mirroring available.– Mirror on a per-file basis– Can mirror across storage arrays

• Data striping across the volume: – 1MB and 128KB stripes available

• Supports clustering and single instance• Dynamic data distribution

– A solution to avoid ‘hot spots’– On-line add/drop disk with minimal data relocation– Automatic database file management

• Database File System with performance of RAW I/O

Page 10: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1010

ASM’s Configuration – ExamplesASM’s Configuration – Examples

• ASM is a volume manager, its output are disk groups (DG) that ASM is a volume manager, its output are disk groups (DG) that Oracle databases can mount to allocate their filesOracle databases can mount to allocate their files

RECOVERY-DGDATA-DG

Config 1: Disk groups createdwith dedicated disks

Config 2: Disk groups createdby ‘horizontal’ slicing

Page 11: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1111

Proposed Storage ConfigurationProposed Storage Configuration

• Proposed storage configuration:Proposed storage configuration:– High availability - Allows backups to disk– High performance - Allows clusterware mirroring (10.2) – DBs have dedicated resources

StorageArrays

Oracle RAC Nodes

Data DG-2 Recovery DG-1 Data DG-1 Recovery DG-2 Disk Groups (ASM)

DB N.1 DB N.2

Page 12: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1212

FAQ 1: DatafilesFAQ 1: Datafiles

• Do I need to worry on the number and names of Do I need to worry on the number and names of the datafiles allocated for each tablespace?the datafiles allocated for each tablespace?

• ““Traditional” storage allocation across multiple volumes:Traditional” storage allocation across multiple volumes:– Requires a careful allocation of multiple datafiles across logical

volumes and/or filesystems– Datafile-to-filesystem and filesystem-to-physical storage mappings

have to be frequently tuned • S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access

across disksacross disks– There is NO NEED, for performance reasons, to allocate multiple

datafiles per tablespace.– 10g new feature “bigfile tablespace” allows for tablespaces with a

single datafile that can grow up to 32 TB (db_block_size=8k)

Page 13: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1313

FAQ 2: Data and Index TablespacesFAQ 2: Data and Index Tablespaces

• Do I need dedicated tablespaces for indexes and Do I need dedicated tablespaces for indexes and tables?tables?

• Separation of indexes and tables has often been advised to:Separation of indexes and tables has often been advised to:– Distribute I/O – Reduce fragmentation– Allow separate backup of tables and indexes

• S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access across disksacross disks– No performance gains are expected by using dedicated

tablespaces for INDEXes and TABLEs.• Additional Notes:Additional Notes:

– Tablespaces fragmentation has little impact when using locally managed TBSs and automatic segment space management (9i and 10g)

– Very large database can profit from using multiple tablespaces for admin purposes and logical separation of objects

Page 14: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1414

FAQ 3: Sizing StorageFAQ 3: Sizing Storage

• Storage sizing for database should not take size as the Storage sizing for database should not take size as the first requirementfirst requirement– Bandwidth and performance metrics are bound to the number of

disk spindles– Magnetic HD technology has improved the GByte/$ ratio– The rest of HD technology has not seen much improvements in

the last 5 years (since 15K rpms HDs)

• Sizing for storage requirements should Sizing for storage requirements should – Be based on stress test measurements– Past performance measurements on comparable systems– New projects can leverage benchmark data.

• Extra HD space is not wastedExtra HD space is not wasted– Can be used to strengthen the B&R policy with Disk Backups

Page 15: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1515

IO Benchmark MeasurementsIO Benchmark Measurements

• Benchmark data can be used forBenchmark data can be used for– Sizing of new projects and upgrades– Performance baseline, testing for new hardware

• The following metrics have been measured:The following metrics have been measured:– Sequential throughput (full scans)– Random access (indexed access)– I/O per second (indexed access)– Metrics are measured as a function of workload

• Other test detailsOther test details– Benchmark tool: Oracle’s ORION– Infortrend Storage array: 16 SATA x 400 GB disks, 1

controller and 1 GB cache

Page 16: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1616

IO Benchmark DataIO Benchmark Data

Sequential Workload Performance (Read Only Workload)

0

20

40

60

80

100

120

140

160

1 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32

Load (N# Outstanding asynch I/Os)

Th

rou

gh

pu

t (M

Bp

s)

12 LUNs, noraid

16 LUNs, no raid,external partition

Throughput:+ 25%

Page 17: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1717

IO Benchmark DataIO Benchmark Data

Latency for Small Random I/O (Read Only Workload)

0

20

40

60

80

100

120

140

160

1 2 3 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68 72 76 80

Load (N# Outstanding asynch I/Os)

Lat

ency

(m

s)

12 LUNs, noraid

16 LUNs, no raid,external partition

Latency:- 35%

Page 18: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1818

IO Benchmark DataIO Benchmark Data

Small Random I/O Performance (Read Only Workload)

0

100

200

300

400

500

600

700

800

900

1000

1 4 16 28 40 52 64 76

Load (N# Outstanding asynch I/Os)

I/O p

er s

eco

nd

12 LUNs, noraid

16 LUNs, no raid,external partition

I/O /sec:+60%

Page 19: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 1919

ConclusionsConclusions

• The Database Services for Physics can provideThe Database Services for Physics can provide– Scalable Database services

• Scalable on CPU and Memory resources• Scalable on Storage resources

– Sizing for new projects or upgrades• Stress/Performance testing • Integration and Validation Testing • Benchmark data for capacity planning

Page 20: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 2020

Additional Benchmark DataAdditional Benchmark Data

Sequential Workload Performance (Read Only Workload)

0

20

40

60

80

100

120

140

1 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32

Load (N# Outstanding asynch I/Os)

Th

rou

gh

pu

t (M

Bp

s)

7 LUNs, raid0

14 LUNs, no raid

14 LUNs, no raid, outerpartition used14 LUN, no raid, innerpartition used

Page 21: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 2121

Additional Benchmark DataAdditional Benchmark Data

Latency for Small Random I/O (Read Only Workload)

0

20

40

60

80

100

120

140

1 2 3 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68 72 76 80

Load (N# Outstanding asynch I/Os)

Lat

ency

(m

s)

7 LUNs, raid0

14 LUNs, no raid

14 LUNs, no raid, outerpartition used14 LUNS, no raid, innerpartition used

Page 22: Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005Database Workshop, October 2005 L.Canali, CERNL.Canali, CERN 2222

Additional Benchmark DataAdditional Benchmark Data

Small Random I/O Performance (Read Only Workload)

0

100

200

300

400

500

600

700

800

900

1 4 16 28 40 52 64 76

Load (N# Outstanding asynch I/Os)

I/O p

er

se

co

nd

7 LUNs, raid0

14 LUNs, no raid

14 LUNs, noraid, outerpartition used14 LUNS, noraid, innerpartition used