63 citations found. Retrieving documents...
David Kotz and Nils Nieuwejaar, "Dynamic file-access characteristics of a production parallel scientific workload," in Proceedings of Supercomputing '94, Washington, DC, 1994, pp. 640--649, IEEE Computer Society Press.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

A Human Organization Analogy for Self-* Systems - Strunk, Ganger (2003)   (Correct)

....are properly equipped to specify detailed goals. There is some hope that availability and reliability goals can be derived from various business and insurance costs (related to risk management) but the proper way to set performance targets is a different problem. Workload characterization [4, 8] can provide a first estimate of performance requirements and can even be used to guide storage system design [2, 3] but further tuning by the administrator will be necessary. Providing an easy to use interface for this refinement is a necessity. We believe that using a system of complaints from ....

David Kotz and Nils Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. ACM Symposium on Operating System Principles (Washington, DC, USA, 14--18 November 1994.


Improving Collective I/O Performance Using Threads - Dickens, Thakur   (Correct)

....represents a significant obstacle to achieving good performance. The problem is generally not with the hardware; many parallel I O subsystems offer excellent performance. Rather, the problem arises from other factors, primarily the I O patterns exhibited by many parallel scientific applications [5, 14]. In particular, each processor tends to make a large number of small I O requests, incurring the high cost of I O on each such request. One reason for this access pattern is that parallel scientific codes frequently involve large arrays distributed across the processor s local memory. After a ....

Kotz, D. and N. Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In Supercomputing '94 pages 640-649, November 1994.


Improving Collective I/O Performance Using Threads - Dickens, Thakur   (Correct)

....represents a significant obstacle to achieving good performance. The problem is generally not with the hardware; many parallel I O subsystems offer excellent performance. Rather, the problem arises from other factors, primarily the I O patterns exhibited by many parallel scientific applications [5, 14]. In particular, each processor tends to make a large number of small I O requests, incurring the high cost of I O on each such request. One reason for this access pattern is that parallel scientific codes frequently involve large arrays distributed across the processor s local memory. After a ....

Kotz, D. and N. Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In Supercomputing '94 pages 640-649, November 1994.


Glossary on Parallel Input/Output - Stockinger   (Correct)

....in order to run operations on distributed data structures in parallel. CHARISMA (CHARacterize I O in Scientific Multiprocessor Applications) CHARISMA is a project to characterize I O in scientific multiprocessor applications from a variety of production parallel computing platforms and sites [23]. It recorded individual read and write requests in live, multiprogramming workloads. It turned out that most files were accessed in complex, highly regular patterns. checkpointing Checkpointing allows processes to save their state from time to time so that they can be restarted in case of ....

David Kotz and Nils Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In Proceedings of Supercomputing '94, pages 640--649, Washington, DC, November 1994. IEEE Computer Society Press.


Evaluation of Collective I/O Implementations on Parallel.. - Dickens, Thakur   (Correct)

....and represents a significant obstacle to achieving good performance. The problem is often not with the hardware; many parallel I O subsystems offer excellent performance. Rather, the problem arises from other factors, primarily the I O patterns exhibited by many parallel scientific applications [9, 18, 2, 3, 22, 24, 25, 28]. In particular, each processor tends to make a large number of small I O requests, incurring the high cost of I O on each such request. One reason for this access pattern is that parallel scientific codes frequently involve large arrays distributed across the processor s local memory. After a ....

Kotz, D. and N. Nieuwejaar. Dynamic File-Access Characteristics of a Production Parallel Scientific Workload. In Supercomputing '94 pages 640--649, November 1994.


On Implementing High-Performance Collective I/O - Dickens, Thakur   (Correct)

....and represents a significant obstacle to achieving good performance. The problem is often not with the hardware; many parallel I O subsystems offer excellent performance. Rather, the problem arises from other factors, primarily the I O patterns exhibited by many parallel scientific applications [9, 18, 2, 3, 22, 24, 25, 28]. In particular, each processor tends to make a large number of small I O requests, incurring the high cost of I O on each such request. One reason for this access pattern is that parallel scientific codes frequently involve large arrays distributed across the processor s local memory. After a ....

Kotz, D. and N. Nieuwejaar. Dynamic File-Access Characteristics of a Production Parallel Scientific Workload. In Supercomputing '94 pages 640--649, November 1994.


Stream*: Fast, Flexible, Data-Parallel I/O - Moore, Hatcher, Quinn (1995)   (1 citation)  (Correct)

....general parallel file operations. We say files relying on this generality use Independent Buffering Mode (IB) because each VP s stream is managed separately. Many debugged, production quality, data parallel applications do not require such generality; they perform very 5 regular file operations [1, 2, 5, 7, 8, 10]. When regular operations are performed, all VPs read or write the same amount of data at the same time. In such instances, a single value can act as the file pointer for all the VP streams. During a write, each VP copies data from its parallel variable into its file buffer. Under the assumption ....

D. Kotz and N. Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In Proceedings of Supercomputing '94, pages 640--649, November 1994.


Automatic Parallel I/O Performance Optimization in Panda - Chen, Winslett, Cho, Kuo (1998)   (1 citation)  (Correct)

....6, and draw conclusions and outline the future work in section 7. 2 Panda basics Panda is designed for large scale scientific applications performing collective array I O operations on distributedmemory multiprocessors, a commonly observed I O pattern in large scale scientific applications [26, 21]. In this paper, we base our discussion on Panda 2.1 with each node either dedicated to application computation or I O. The dedicated I O nodes are called Panda servers and the compute nodes are called Panda clients . During each I O request, Panda servers read or write array data from or to ....

D. Kotz and N. Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In Proceedings of Supercomputing '94, pages 640--649, Washington, DC, November 1994. IEEE Computer Society Press.


Job Characteristics of a Production Parallel Scientific.. - Feitelson, Nitzberg (1995)   (75 citations)  (Correct)

....Trace Availability The trace used in this study is available upon request from Bill Nitzberg, at nitzberg nas.nasa.gov. Acknowledgements This study was initiated using the CHARISMA traces, collected by David Kotz and Nils Nieuwejaar of Dartmouth College to study I O behavior of applications [14]. These traces were later replaced by an accounting trace collected at NASA Ames, which had the advantage that it included user and job information, and that it covered a much longer period. The generosity and help provided by Dave and Nils at the initial stages of this work are greatly ....

D. Kotz and N. Nieuwejaar, "Dynamic file-access characteristics of a production parallel scientific workload". In Supercomputing '94, pp. 640--649, Nov 1994.


High-Performance File I/O in Java: Existing Approaches.. - Bonachea, Dickens.. (2000)   (Correct)

....that Java offers as a programming language. To be useful as a language for high performance computing, however, Java must not only have good support for computation but must also be able to provide high performance file I O, as many scientific applications have significant I O requirements [6, 22, 34]. In this paper, we investigate the I O capabilities of Java for high performance computing and provide suggestions for relatively simple changes to the Java I O model that can improve performance significantly. We first examine several approaches that attempt to provide high performance I O in ....

....and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439. thakur mcs.anl.gov 1 1. 1 I O in High Performance Computing Many computationally intensive scientific applications also need to access large amounts of data, and I O is often the bottleneck in such applications [6, 22, 34]. A common I O requirement is as follows. The application has some large data structures, say multidimensional arrays, distributed among processes in some fashion. The arrays must be read from or written to a file containing the global array. The program may begin by reading in an input array and ....

Kotz, D. and N. Nieuwejaar. Dynamic File-Access Characteristics of a Production Parallel Scientific Workload. In Proceedings of Supercomputing '94, pages 640-649, November 1994.


A Performance Study of Two-Phase I/O - Dickens, Thakur (1998)   (4 citations)  (Correct)

....represents a significant obstacle to achieving good performance. The problem is generally not with the hardware; many parallel I O subsystems offer excellent performance. Rather, the problem arises from other factors, primarily the I O patterns exhibited by many parallel scientific applications [1, 5] In particular, each processor tends to make a large number of small I O requests, incurring the high cost of I O on each such request. The technique of collective I O has been developed to better utilize the parallel I O subsystem [2, 7, 8] In this approach, the processors exchange information ....

Kotz, D. and N. Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In Supercomputing '94 pages 640-649, November 1994.


An Evaluation of Java's I/O Capabilities for High-Performance .. - Dickens, Thakur (2000)   (Correct)

....the Java I O model in Section 6. Related work is discussed in Section 7, followed by conclusions in Section 8. 2. I OINHIGH PERFORMANCECOMPUTING Many computationally intensive scientific applications also need to access large amounts of data, and I O is often the bottleneck in such applications [3, 13, 22]. A common I O requirement is as follows. The application has some large data structures, say multidimensional arrays, distributed among processes in some fashion. The arrays must be read from or written to a file containing the global array. The program may begin by reading in an input array and ....

Kotz, D. and N. Nieuwejaar. Dynamic File-Access Characteristics of a Production Parallel Scientific Workload. In Proceedings of Supercomputing '94, pages 640-649, November 1994.


An Evaluation of Java's I/O Capabilities for High-Performance .. - Dickens, Thakur (2000)   (Correct)

....suggestions for improving the Java I O model in Section 6. In Section 7 we discuss related work, and we conclude in Section 8. 2 I O in High Performance Computing Many large scientific applications also need to access large amounts of data, and I O is often the bottleneck in such applications [3, 13, 21]. A common I O requirement is as follows. The application has some large data structures, say multidimensional arrays, distributed among processes in some fashion. The arrays must be read from or written to a file containing the global array. The program may begin by reading in an input array and ....

Kotz, D. and N. Nieuwejaar. Dynamic File-Access Characteristics of a Production Parallel Scientific Workload. In Proceedings of Supercomputing '94, pages 640-649, November 1994.


Performance Modeling for the Panda Array I/O Library - Chen, Winslett, Kuo, Cho.. (1996)   (1 citation)  (Correct)

....(BLOCK, and ( BLOCK) from left to right. The server meshes are indicated in the figure. 4 Related Work To design a good parallel i o subsystem we must understand its intended usage. Thus there are many recent studies characterizing dynamic i o patterns in scientific applications ([Kotz94a, Thakur95, Pura94, Pasquale94, Miller91]) They show that many scientific applications have regular patterns of i o behavior, such as physical periodicity in strided access to multidimensional arrays, or temporal periodicity in checkpoint and restart operations. In both cases all compute nodes participate in collective i o operations to ....

D. Kotz, and N. Nieuwejaar, Dynamic file-access characteristics of a production parallel scientific workload, Supercomputing, 1994.


Automatic Classification Of Input/Output Access Patterns - Madhyastha (1997)   (Correct)

....due to inadequate file system performance; those that do exhibit architecture centric access patterns. Several studies attempt to address this dearth of characterization data by gathering application input output access pattern information at the system and application level. The CHARISMA project [44, 60] has examined input output accesses on the iPSC 860 Concurrent File System (CFS) and the CM5 Scalable Disk Array to obtain some generalizations of access patterns in parallel input output workloads. They have observed predominantly write accesses, small request sizes, and generally sequential ....

Kotz, D., and Nieuwejaar, N. Dynamic File-Access Characteristics of a Production Parallel Scientific Workload. In Proceedings of Supercomputing '94 (November 1994), pp. 640--649.


Implementation and Evaluation of Collective I/O in the .. - Rajesh Bordawekar.. (1996)   (1 citation)  (Correct)

....and the user interfaces provide special I O modes that allow multiple processors to simultaneously access a file. Unfortunately, most of the parallel file system interfaces are derived from UNIX. As a result, these interfaces often fail to express access patterns generated in parallel workloads [KN94] This often leads to situations where processors are forced to make a large number of small non contiguous data requests. It has been shown that in such cases, a technique called Collective I O can be effectively used to improve the I O performance [BdC93, Kot94] The primary focus of this work ....

David Kotz and Nils Nieuwejaar. Dynamic File-Access Characteristics of a Production Parallel Scientific Workload. In Proceedings of Supercomputing '94, pages 640--649, November 1994.


Server-Directed Collective I/O in Panda - Seamons, Chen, Jones, Jozwiak.. (1995)   (29 citations)  (Correct)

....permit a proper discussion of all the work related to Panda; we will limit our discussion to papers from the HPCC community that appeared in 1993 or later and that focus on parallel implementations of collective i o. Recent studies characterizing dynamic i o patterns in scientific applications [Kotz94a, Purakayastha94, Pasquale94] show that many scientific applications have regular patterns of i o behavior, such as physical periodicity in strided access to multidimensional arrays, or temporal periodicity in checkpoint and restart operations. In both cases all application compute nodes ....

D. Kotz, and N. Nieuwejaar, Dynamic file-access characteristics of a production parallel scientific workload. In Proceedings of Supercomputing '94, pages 640-649, November 1994.


Parallel I/O Systems and Interfaces for Parallel Computers - Feitelson, Corbett, Hsu.. (1995)   (1 citation)  (Correct)

....with no hardware protection. 3 Semantics of Parallel I O Operations In conventional (Unix) systems, files are nearly never shared at the same time by more than one process (at least not for writing) 3] In parallel systems such sharing is the norm, including extensive sharing at the block level [38]. Thus there is an urgent need to define what happens when multiple processes open and access the same file. For example, if p processes write hello world to a file, what should happen Options are that p copies be written, that only a single copy be written, or that the multiple copies be ....

....the next set of p p processes takes over. The net effect is that the access is serialized and involves a lot of redundant synchronization. The problems in implementing the desired access patterns based on a scatter gather file mode has led most programmers to use the independent mode instead [38]. In this mode, each process can seek to a different offset in the file, and access the desired data irrespective of what the other processes are doing. For example, each set of p p processes can seek ahead to their part of the matrix in the block block decomposition, rather than participating in ....

[Article contains additional citation context not shown here]

D. Kotz and N. Nieuwejaar, "Dynamic file-access characteristics of a production parallel scientific workload". In Supercomputing '94, pp. 640--649, Nov 1994.


Overview Of The MPI-IO Parallel I/O Interface - Corbett, Feitelson, Fineberg, .. (1995)   (16 citations)  (Correct)

....simultaneous access by multiple processes is the rule rather than the exception. Moreover, parallel processes often access the file in an interleaved manner, where each process accesses a fragmented subset of the file, while other processes access the parts that the first process does not access [19]. UNIX file operations provide no support for such access, and in particular, do not allow access to multiple non contiguous parts of the file in a single operation. Parallel file systems and programming environments have typically solved the problems of data partitioning and collective access by ....

....being accessed by a single process. In a parallel environment, we must decide whether a file pointer is shared by multiple processes or if an individual file pointer will be maintained by each process. In addition, parallel programs do not generally exhibit locality of reference within a file [19]. Instead, they tend to move between distinct noncontiguous regions of a file. This means that the process must seek on almost every read or write operation. In addition, in multithreaded environments or when performing I O asynchronously, it is difficult to ensure that the file pointer will be in ....

David Kotz and Nils Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In Proceedings of Supercomputing '94, pages 640--649, November 1994.


Usability of Parallel I/O Templates - Parsons, Unrau, al.   (Correct)

....5 which has fine grained I O spread throughout the application. The second application uses large grained I O operations. Finally, the conclusions are presented. 2 Related Work There have been several studies of parallel I O for applications using real data. Some examples of these are found in [10, 25, 29, 30, 34, 35]. These studies look at traces of actual parallel applications doing I O using specific parallel I O libraries and architectures. Characterizing well understood parallel programs under controlled conditions facilitates development of optimization techniques. Parallel I O systems such as PIOUS[27] ....

D. Kotz and N. Nieuwejaar, "Dynamic File-Access Characteristics of a Production Parallel Scientific Workload," In proceedings of Supercomputing `94, pp. 640649, Washington, DC, 1994.


Design, Implementation and Performance Evaluation of.. - Zhu, Jiang, Qin..   (Correct)

No context found.

David Kotz and Nils Nieuwejaar, "Dynamic file-access characteristics of a production parallel scientific workload," in Proceedings of Supercomputing '94, Washington, DC, 1994, pp. 640--649, IEEE Computer Society Press.


Predicate Control: Synchronization in Distributed.. - Tarafdar, Garg (2003)   (Correct)

No context found.

D. Kotz and N. Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In Proc. Supercomputing '94, pages 640--649, 1994.


An MPI-IO Interface to HPSS - Terry Jones Richard (1996)   (10 citations)  (Correct)

No context found.

D. Kotz and N. Nieuwejaar, Dynamic file-access characteristics of a production parallel scientific workload, in Proceedings of Supercomputing 94, November 1994.


Design, Implementation and Performance Evaluation of.. - Zhu, Jiang, Qin..   (Correct)

No context found.

David Kotz and Nils Nieuwejaar, "Dynamic file-access characteristics of a production parallel scientific workload," in Proceedings of Supercomputing '94, Washington, DC, 1994, pp. 640--649, IEEE Computer Society Press.


Models of Parallel Applications with Large.. - Rosti, Serazzi.. (2002)   (7 citations)  (Correct)

No context found.

D. Kotz and N. Nieuwejaar, Dynamic File-Access Characteristics of a Production Parallel Scientific Workload, Proc. Supercomputing '94, pp. 640649, Nov. 1994.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC