Results 1 - 10
of
29
Venti: A New Approach to Archival Storage
, 2002
"... This paper describes a network storage system, called Venti, intended for archival data. In this system, a unique hash of a block's contents acts as the block identifier for read and write operations. This approach enforces a write-once policy, preventing accidental or malicious destruction of ..."
Abstract
-
Cited by 342 (0 self)
- Add to MetaCart
(Show Context)
This paper describes a network storage system, called Venti, intended for archival data. In this system, a unique hash of a block's contents acts as the block identifier for read and write operations. This approach enforces a write-once policy, preventing accidental or malicious destruction of data. In addition, duplicate copies of a block can be coalesced, reducing the consumption of storage and simplifying the implementation of clients. Venti is a building block for constructing a variety of storage applications such as logical backup, physical backup, and snapshot file systems.
Provable Data Possession at Untrusted Stores
, 2007
"... We introduce a model for provable data possession (PDP) that allows a client that has stored data at an untrusted server to verify that the server possesses the original data without retrieving it. The model generates probabilistic proofs of possession by sampling random sets of blocks from the serv ..."
Abstract
-
Cited by 302 (9 self)
- Add to MetaCart
We introduce a model for provable data possession (PDP) that allows a client that has stored data at an untrusted server to verify that the server possesses the original data without retrieving it. The model generates probabilistic proofs of possession by sampling random sets of blocks from the server, which drastically reduces I/O costs. The client maintains a constant amount of metadata to verify the proof. The challenge/response protocol transmits a small, constant amount of data, which minimizes network communication. Thus, the PDP model for remote data checking supports large data sets in widely-distributed storage systems. We present two provably-secure PDP schemes that are more efficient than previous solutions, even when compared with schemes that achieve weaker guarantees. In particular, the overhead at the server is low (or even constant), as opposed to linear in the size of the data. Experiments using our implementation verify the practicality of PDP and reveal that the performance of PDP is bounded by disk I/O and not by cryptographic computation.
Optical Data Storage
- Principles, Potential, and Problems. Presented at the 2002 Conference on File and Storage Technologies
, 2002
"... this article highlights di#erent approaches towards achieving this goal of higher density optical storage, either by increasing the density at the surface of an optical recording material, or by utilizing the unique capability of optics to access the volume of suitable media. 2 Approaches to increas ..."
Abstract
-
Cited by 16 (0 self)
- Add to MetaCart
this article highlights di#erent approaches towards achieving this goal of higher density optical storage, either by increasing the density at the surface of an optical recording material, or by utilizing the unique capability of optics to access the volume of suitable media. 2 Approaches to increased areal density For many years, the demands for better data storage have been met by evolutionary advances: steady increases in the areal density and other performance specifications of magnetic and optical recording devices. The areal density of hard disks has been growing continuously through more than seven orders of magnitude over the last fifty years, but during the last five years the growth accelerated to a compounded annual growth rate of slightly more than 100%! Optical storage, with a much shorter track record, has increased in storage density by a factor of 5 from the original CD standard to the recent DVD format (Table I). The reason for this discrepancy is removability. Since high performance magnetic recording does not support removability, hard drives need not work with the media of previous generations and the only standards to satisfy are those on data input/output and form factor, thus encouraging strong technical competition that has resulted in tremendous progress. Optical storage, however, is dominated by interchangeable media and backward compatibility. This compatibility facilitates the introduction of each new generation of technology in the market, but forces a time--consuming standards process for each higher density generation
MEMS-based Disk Buffer for Streaming Media Servers
- International Conference on Data Engineering
, 2003
"... The performance of streaming media servers has been limited due to the dual requirements of high throughput and low memory use. Although disk throughput has been enjoying a 40% annual increase, slower improvements in disk access times necessitate the use of large DRAM buffers to improve the overall ..."
Abstract
-
Cited by 14 (2 self)
- Add to MetaCart
The performance of streaming media servers has been limited due to the dual requirements of high throughput and low memory use. Although disk throughput has been enjoying a 40% annual increase, slower improvements in disk access times necessitate the use of large DRAM buffers to improve the overall streaming throughput. MEMS-based storage is an exciting new technology that promises to bridge the widening performance gap between DRAM and disk-drives in the memory hierarchy. This paper explores the impact of integrating these devices into the memory hierarchy on the class of streaming media applications. We evaluate the use of MEMS-based storage for buffering and caching streaming data. We also show how a bank of k MEMS devices can be managed in either configuration and that they can provide a k-fold improvement in both throughput and access latency. An extensive analytical study shows that using MEMS storage can reduce the buffering cost and improve the throughput of streaming servers significantly.
Indirection Systems for Shingled-Recording Disk Drives
"... Abstract—Shingled magnetic recording is a promising technology to increase the capacity of hard-disk drives with no significant cost impact. Its main drawback is that random-write access to the disk is restricted due to overlap in the layout of data tracks. For computing and storage systems to enjoy ..."
Abstract
-
Cited by 14 (0 self)
- Add to MetaCart
(Show Context)
Abstract—Shingled magnetic recording is a promising technology to increase the capacity of hard-disk drives with no significant cost impact. Its main drawback is that random-write access to the disk is restricted due to overlap in the layout of data tracks. For computing and storage systems to enjoy the increased capacity, it is necessary to mitigate these access restrictions, and present a storage device that serves unrestricted read/write requests with adequate performance. This paper proposes two different indirection systems to mask access restrictions and optimize performance. The first one is a diskcache based architecture that provides unrestricted access with manageable drop in performance. A second, more complex indirection system, utilizes a new storage unit called S-block. It is shown that the S-block architecture allows good sustained random-write performance, a point where the disk-cache architecture fails. The organization and algorithms of both architectures are specified in detail. Each was implemented and simulated as a discrete-event simulation, mimicking its operation on real storage devices. For the performance evaluation both synthetic workloads and traces from real workloads were used. I.
Accessing scientific data: Simpler is better
- In Proc. of the 8th International Symposium on Spatial and Temporal Databases
, 2003
"... Abstract. A variety of index structures has been proposed for supporting fast access and summarization of large multidimensional data sets. Some of these indices are fairly involved, hence few are used in practice. In this paper we examine how to reduce the I/O cost by taking full advantage of recen ..."
Abstract
-
Cited by 5 (2 self)
- Add to MetaCart
(Show Context)
Abstract. A variety of index structures has been proposed for supporting fast access and summarization of large multidimensional data sets. Some of these indices are fairly involved, hence few are used in practice. In this paper we examine how to reduce the I/O cost by taking full advantage of recent trends in hard disk development which favor reading large chunks of consecutive disk blocks over seeking and searching. We present the Multiresolution File Scan (MFS) approach which is based on a surprisingly simple and flexible data structure which outperforms sophisticated multidimensional indices, even if they are bulk-loaded and hence optimized for query processing. Our approach also has the advantage that it can incorporate a priori knowledge about the query workload. It readily supports summarization using distributive (e.g., count, sum, max, min) and algebraic (e.g., avg) aggregate operators. 1
High Density Hard Disk Drive Trends in the USA, tech report at http://www.tomcoughlin.com /techpapers.htm
"... The United States is a major center for disk drive technology. Feeding the current extraordinary growth in data storage areal density has been the developments in component technology. This explosion in storage capacity has provided a remarkable value for our global information society. As the cost ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
The United States is a major center for disk drive technology. Feeding the current extraordinary growth in data storage areal density has been the developments in component technology. This explosion in storage capacity has provided a remarkable value for our global information society. As the cost of storage has gone down new non-computer markets have opened up for disk drives and new form factors for mobile markets have appeared. Technically it appears that disk drive areal densities can go well above 100 Gb/in 2. However the lack of profitability in the disk drive and component industries could prevent the capital investments required to achieve these higher areal densities. This has already led to a great deal of consolidation in the drive and component industries. Key developments in heads, media, channel, disk and head actuator motor technologies will be described with projections for the future. We complete this review by projecting three classes of enterprise disk drives developing by 2002 to address different market needs.
Shingled Magnetic Recording for Big Data Applications
, 2012
"... Modern Hard Disk Drives (HDDs) are fast approaching the superparamagnetic limit forcing the storage industry to look for innovative ways to transition from traditional magnetic recording to Heat-Assisted Magnetic Recording or Bit-Patterned Magnetic Recording. Shingled Magnetic Recording (SMR) is a s ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
(Show Context)
Modern Hard Disk Drives (HDDs) are fast approaching the superparamagnetic limit forcing the storage industry to look for innovative ways to transition from traditional magnetic recording to Heat-Assisted Magnetic Recording or Bit-Patterned Magnetic Recording. Shingled Magnetic Recording (SMR) is a step in this direction as it delivers high storage capacity with minimal changes to current production infrastructure. However, since it sacrifices random-write capabilities of the device, SMR cannot be used as a drop-in replacement for traditional HDDs. We identify two techniques to implement SMR. The first involves the insertion of a shim layer between the SMR device and the host, similar to the Flash Translation Layer found in Solid-State Drives (SSDs). The second technique, which we feel is the right direction for SMR, is to push enough intelligence up into the file system to effectively mask the sequential-write nature of the underlying SMR device. We present a custom-built SMR Device Emulator and ShingledFS, a FUSE-based SMR-aware file system that operates in tandem with the SMR Device Emulator. Our evaluation studies SMR for Big Data applications and we also examine the overheads introduced by the emulation. We show that Big Data workloads can be run effectively on SMR devices with an overhead as low as 2.2 % after eliminating the overheads of emulation. Finally we present insights on garbage collection mechanisms
What is the Future of Disk Drives, Death or Rebirth? 9
"... Disk drives have experienced dramatic development to meet the performance requirements, since the IBM 1301 disk drive was announced in 1961. However, the performance gap between memory and disk drives has widened to 6 orders of magnitude and continues to widen by about 50 % per year. Furthermore, en ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Disk drives have experienced dramatic development to meet the performance requirements, since the IBM 1301 disk drive was announced in 1961. However, the performance gap between memory and disk drives has widened to 6 orders of magnitude and continues to widen by about 50 % per year. Furthermore, energy efficiency has become one of the most important challenges in designing disk drive storage systems. The architectural design of disk drives has reached a turning point which should allow their performance to advance further, while still maintaining high reliability and energy efficiency. This paper explains how disk drives have evolved over five decades to meet challenging customer demands. First of all, it briefly introduces the development of disk drives, and deconstructs disk performance and power consumption. Secondly, it describes the design constraints and challenges that traditional disk drives are facing. Thirdly, it presents some innovative disk drive architectures discussed in the community. Fourthly, it introduces some new storage media types and the impacts they have on the architecture of the traditional disk drives. Finally, it discusses two of the important evolutions of the disk drives: hybrid disk and solid state disk. The paper highlights the challenges and opportunities facing these storage devices, and explores how we can expect them to affect the storage systems.