Towards Performance and Reliability Enhancements of Enterprise Data Storage Systems

Loading...
Thumbnail Image

Persistent link to this item

Statistics
View Statistics

Journal Title

Journal ISSN

Volume Title

Title

Towards Performance and Reliability Enhancements of Enterprise Data Storage Systems

Published Date

2018-08

Publisher

Type

Thesis or Dissertation

Abstract

Enterprise data storage arrays are on the verge of transitioning to ultra-fast storage technologies like Storage Class Memories (SCM) and NVMe interfaces This transition demands simple and low-overhead softwares to drive and benchmark the new hardwares without sacrificing storage performance and reliability. In this dissertation, three methods have been proposed to enhance performance and reliability models for enterprise storage systems. First, we introduce new methods to replay intensive block I/O workloads more accurately. These methods can be used to reproduce realistic workloads for benchmarking of a high-performance block storage device/system. Reproduction of such an intensive workload with high accuracy without addressing the intrinsic performance variations in the I/O stack and considering the dependencies between block I/O requests is a great challenge. The I/O stack performance variations prevent to reproduce the captured workload on the same storage device accurately. We study the root cause of these performance variations in the Linux I/O stack and propose a high-fidelity replay method that replay an I/O workload with more accuracy on a similar (unscaled) storage device. Moreover, we have proposed a scalable replay method that can infer I/O dependencies and correctly propagating IO related performance gains along dependency chains during a workload replay on a faster (scaled) storage device. We evaluate our replay methods with multi-dimensional accuracy metrics and verify that it reproduces realistic I/O workloads with better accuracy compared to other replay tools. Second, we introduce TxRAID, a transactional crash recovery method for all-flash RAID arrays that can close the write-hole gap with a negligible overhead and prevent silent data corruptions after a crash. TxRAID can guarantee write atomicity without a non-volatile memory or extra journaling layer and simplifies the I/O stack in software and hardware RAID arrays. We propose limited enhancements in RAID, SSD firmware and SCSI interface to coordinate the crash recovery of individual drives with each other. During the recovery process, TxRAID rolls-back partially-written transactions to their last consistent state by leveraging the invalid data pages that co-exist on the SSDs. We have developed a trace-driven simulator to evaluate TxRAID with a wide range of synthetic and realistic workloads and compare it with existing recovery methods in the Linux kernel. TxRAID can plug the write-hole gap with a negligible impact on the I/O performance. Finally, we propose the Offline Write Buffer Policy (OWBP) which can be used for the performance evaluation of the write buffer in SSDs. Since flash memory can tolerate limited erase cycles until it wears-out, a write buffer is usually used on the top of flash memory in SSDs to coalesce write requests and reduce erase operations. OWBP can estimate a lower bound for these erase operation counts and determines how much further a practical (online) write-buffer replacement policy can reduce the flash write traffic.

Description

University of Minnesota Ph.D. dissertation. August 2018. Major: Computer and Information Sciences. Advisor: David Du. 1 computer file (PDF); viii, 104 pages.

Related to

Replaces

License

Collections

Series/Report Number

Funding information

Isbn identifier

Doi identifier

Previously Published Citation

Suggested citation

Haghdoost, Alireza. (2018). Towards Performance and Reliability Enhancements of Enterprise Data Storage Systems. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/201038.

Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.