Results 1 -
3 of
3
Examining Extended and Scientific Metadata for Scalable Index Designs
, 2012
"... The sheer volume of modern data makes manual file management impractical. Search-oriented file systems, where data and metadata are indexed for fast search, are increasingly viewed as a necessity, everywhere from desktops to HPC. However, current techniques have been designed and tested for file sys ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
(Show Context)
The sheer volume of modern data makes manual file management impractical. Search-oriented file systems, where data and metadata are indexed for fast search, are increasingly viewed as a necessity, everywhere from desktops to HPC. However, current techniques have been designed and tested for file system metadata, such as POSIX metadata, and fail to account for the wide variety of metadata users would like to search. In particular, the scientific world has been vocal about a desire to search extended and content metadata. While file system metadata is well characterized by a variety of workload studies, scientific metadata is much less well understood. We characterize scientific metadata, in order to better understand the implications for index design. We demonstrate that previously suggested index structures, such as k-d trees, R-trees, and row major databases, are not well suited to scientific metadata. Finally, we provide suggestions for a system design based on our findings. 1
Examining Scientific Data for Scalable Index Designs
"... The largest modern file systems contain billions of files. Faced with these kinds of volumes, manual file naviga-tion and management is no longer feasible and users have turned to search as an alternate method of finding files. ..."
Abstract
- Add to MetaCart
(Show Context)
The largest modern file systems contain billions of files. Faced with these kinds of volumes, manual file naviga-tion and management is no longer feasible and users have turned to search as an alternate method of finding files.
Towards a Flexible, Lightweight Virtualization Alternative
"... In recent times, two virtualization approaches have become dominant: hardware-level and operating system-level virtualization. They differ by where they draw the virtualization boundary between the virtualizing and the virtualized part of the system, resulting in vastly different properties. We argu ..."
Abstract
- Add to MetaCart
(Show Context)
In recent times, two virtualization approaches have become dominant: hardware-level and operating system-level virtualization. They differ by where they draw the virtualization boundary between the virtualizing and the virtualized part of the system, resulting in vastly different properties. We argue that these two approaches are extremes in a continuum, and that boundaries in between the extremes may combine several good properties of both. We propose abstractions to make up one such new virtualization boundary, which combines hardware-level flexibility with OS-level resource sharing. We implement and evaluate a first prototype.