data storage systems: a survey
DESCRIPTION
Data Storage Systems: A Survey. Abdullah Aldhamin July 29, 2013. CMPT 880: Large-Scale Multimedia Systems and Cloud Computing Course Project. Motivation. Research interest in storage systems, specifically in SSDs. Outline. Objective Overview Solid-State Drives Use Cases. Objective. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/1.jpg)
Data Storage Systems: A Survey
Abdullah AldhaminJuly 29, 2013
CMPT 880: Large-Scale Multimedia Systems and Cloud ComputingCourse Project
![Page 2: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/2.jpg)
Motivation
• Research interest in storage systems, specifically in SSDs
![Page 3: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/3.jpg)
Outline
• Objective• Overview• Solid-State Drives Use Cases
![Page 4: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/4.jpg)
Objective
• Storage system architectures in enterprise data centers.
• What is cloud storage?• Integrating flash-based solid-state drives in
large-scale storage systems.
![Page 5: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/5.jpg)
Overview
• Different storage architectures in data centers:– Block I/O interface (DAS and SAN)– File I/O interface (NAS)– Is there a “better” solution?– Shortcomings for today’s computing…
![Page 6: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/6.jpg)
Overview
• Cloud Storage:– What is cloud storage?• Object-based storage
– Example: Windows Azure Storage (WAS)– Some research problems
![Page 7: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/7.jpg)
Overview
• Flash-based solid state drive:– What is it?– Pros and Cons.– How can we integrate it in large-scale storage
systems?• Future direction
![Page 8: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/8.jpg)
Integrating SSDs in Large-Scale Storage Systems
• Considerations and Facts:– Non-uniform read access latencies, correlated with
workload access pattern– Internal drive-specific operations impacts the
performance– Internal fragmentation leads to performance
degradation– More writes write amplification bad wear
leveling– Performance vs. Lifetime vs. Cost
![Page 9: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/9.jpg)
Integrating SSDs in Large-Scale Storage Systems
SSD in Storage System
SSD-Only System
Hybrid System
End-Point Accelerator
Write Buffer Read Cache
![Page 10: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/10.jpg)
Gordon: SSD-only HPC Cluster
• The first HPC cluster designed with SSD-only storage
• Optimized to utilize SSDs high bandwidth, for data-intensive applications
![Page 11: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/11.jpg)
Gordon … (Cont’d)
• Design goals:– Reduce performance gap between processor and
I/O in large-scale data-intensive computing– Improve the system performance– Less power
![Page 12: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/12.jpg)
Gordon… (Cont’d)
• How SSD is integrated?– Replaced conventional hard disks with SSDs– Major device-level modification: New flash
translation layer
![Page 13: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/13.jpg)
Gordon… (Cont’d)
• Costly $$$• Not suitable for widespread adoption• Requires major device level modification– Optimized for specific workloads
![Page 14: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/14.jpg)
Hybrid: Griffin
• Griffin hybrid storage system– SSD is an end-point store for the data– Uses HDDs as write-back buffers• Log-structured HDDs to buffer incoming writes• Extends SSD lifetime
![Page 15: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/15.jpg)
Hybrid: Hystor
• Hybrid storage system– SSD is used to improve I/O performance• Read cache• Write-back buffer
– Challenge:• What data to should be cached to gain from SSD
performance and improve performance?• Minimum system changes.
![Page 16: Data Storage Systems: A Survey](https://reader030.vdocuments.us/reader030/viewer/2022032607/568130b0550346895d96c193/html5/thumbnails/16.jpg)
Conclusion
• The choice of I/O interface allows for different storage access features
• Cloud storage continues to grow to accommodate for the overflowing of data collected
• Solid state drive has become an instrumental player in storage systems: but how can we best use it?