(Enter summary)
Abstract: you design an efficient out-of-core iterative algorithm? These are the two questions answered
in this thesis. (Update)
Context of citations to this paper: More
...abstract is more than 3 times faster than naive implementations that rely on paging to perform I O. This abstract summarizes results from [3]. Krylov subspace methods are a class of iterative numerical methods for solving linear equations. Given an n by n matrix A and an n...
.... driven strategy is a better choice [dRSBH93] Systems using execution driven methodology include [Adi88, dRSBH93, VSV94, Jon95, Tol95, XZS96, SW96] Benchmark driven methodology uses data analysis techniques for the measured benchmarks [Jai91] to estimate the predicted...
Cited by: More
Research Accomplishments and Objectives - Sivan Toledo March (2002)
(Correct)
A Survey of Out-of-Core Algorithms in Numerical Linear Algebra - Toledo (1999)
(Correct)
Performance Nonmonotonicities: A Case Study of the UltraSPARC.. - Kushman (1998)
(Correct)
Similar documents (at the sentence level):
7.2%: Efficient Out-of-Core Algorithms for Linear Relaxation.. - Leiserson, Rao, Toledo (1993)
(Correct)
6.1%: PERFSIM: A Tool for Automatic Performance Analysis of.. - Sivan Toledo (1995)
(Correct)
Active bibliography (related documents): More All
1.5: An Efficient Out-of-Core Algorithm for Implicit Time-Stepping.. - Sivan Toledo
(Correct)
1.3: Performance Prediction with Benchmaps - Toledo
(Correct)
0.9: Locality Of Reference In Lu Decomposition With Partial Pivoting - Sivan Toledo (1997)
(Correct)
Similar documents based on text: More All
0.4: The Cilk System for Parallel Multithreaded Computing - Joerg (1996)
(Correct)
0.4: A Timing Analysis and Optimization System for Level-Clocked.. - Papaefthymiou (1993)
(Correct)
0.4: Executing Multithreaded Programs Efficiently - Blumofe (1995)
(Correct)
Related documents from co-citation: More All
2: PerfSim: A tool for automatic performance analysis of data parallel Fortran prog..
- Toledo - 1995
2: Applications of parametric searching in geometric optimization
- Agarwal, Sharir et al. - 1992
2: On critical orientations in the Kedem-Sharir motion planning algorithm for a con..
- Kedem, Sharir et al. - 1993
BibTeX entry: (Update)
Sivan A. Toledo. Quantitative Performance Modeling of Scientific Computations and Creating Locality in Numerical Algorithms. PhD thesis, Massachusetts Institute of Technology, 1995. Also available as Technical Report MIT-LCS-TR-656. http://citeseer.ist.psu.edu/toledo95quantitative.html More
@misc{ toledo95quantitative,
author = "S. Toledo",
title = "Quantitative Performance Modeling of Scientific Computations and Creating
Locality in Numerical Algorithms",
text = "Sivan A. Toledo. Quantitative Performance Modeling of Scientific Computations
and Creating Locality in Numerical Algorithms. PhD thesis, Massachusetts
Institute of Technology, 1995. Also available as Technical Report MIT-LCS-TR-656.",
year = "1995",
url = "citeseer.ist.psu.edu/toledo95quantitative.html" }
Citations (may not include all citations):
3972
Introduction to Algorithms (context) - Cormen, Leiserson et al. - 1990
835
High Performance Fortran language specification
- Fortran - 1994
394
The High Performance Fortran Handbook (context) - Koelbel, Loveman et al. - 1994
372
Matrix Iterative Analysis (context) - Varga - 1962
367
Computer Architecture: A Quantitative Approach (context) - Henessy, Patterson - 1990
318
Methods of conjugate gradients for solving linear systems (context) - Hestenes, Stiefel - 1952
297
A Multigrid Tutorial (context) - Briggs - 1987
231
Active messages: a mechanism for integrated communication an..
- von Eicken, Culler et al. - 1992
198
The Art of Computer Systems Performance Analysis: Techniques.. (context) - Jain - 1991
178
The Connection Machine CM-5 Technical Summary (context) - Corporation, MA - 1992
166
NESL: A nested data-parallel language
- Blelloch - 1993
164
The network architecture of the Connection Machine CM
- Leiserson - 1992
157
Data optimization: Allocation of arrays to reduce communicat.. (context) - Knobe, Lucas et al. - 1990
130
Implementation of a portable nested data-parallel language
- Blelloch, Chatterjee et al. - 1993
110
Nested dissection of regular finite element mesh (context) - George - 1973
104
Sparse partitions (context) - Awerbuch, Peleg - 1990
102
MATLAB Reference Guide (context) - MathWorks, Natick - 1992
91
CM Fortran Reference Manual (context) - Corporation, MA - 1992
90
The data alignment phase in compiling programs for distribut.. (context) - Li, Chen - 1991
83
A static parameter based performance prediction tool for par..
- Fahringer, Zima - 1993
79
A class of first order factorization methods (context) - Gustafsson - 1978
79
The effect of ordering on preconditioned conjugate gradient (context) - Duff, Meurant - 1989
67
A unified geometric approach to graph separators (context) - Miller, Teng et al. - 1991
62
Computational Frameworks for the Fast Fourier Transform (context) - Van Loan - 1992
61
FFTs in external or hierarchical memory
- Bailey - 1990
57
The multifrontal method for sparse matrix solution: Theory a.. (context) - Liu - 1992
54
Technical Report CMU-CS (context) - Blelloch, Chatterjee et al. - 1993
54
complexity: the red-blue pebble game (context) - Hong, Kung - 1981
51
Practical use of polynomial preconditioning for the conjugat.. (context) - Saad - 1985
49
Linear and Nonlinear Programming (context) - Murty - 1988
49
Templates for the Solution of Linear Systems: Building Block.. (context) - Barret, Berry et al. - 1993
42
Parallel performance prediction using lost cycles analysis
- Crovella, LeBlanc - 1994
40
CMSSL for CM-Fortran (context) - Corporation, MA - 1993
37
High-level optimization via automated statistical modeling (context) - Brewer - 1995
35
Fast Fourier transforms for fun and profit (context) - Gentleman, Sande - 1966
31
Separators in two and three dimensions (context) - Miller, Thurston - 1990
25
Efficient out-of-core algorithms for linear relaxation using..
- Leiserson, Rao et al. - 1993
20
On vectorizing incomplete factorization and SSOR preconditio.. (context) - Ashcraft, Grimes - 1988
19
Multilevel computations: review and recent development (context) - Brandt - 1988
17
step iterative methods for symmetric linear systems (context) - Chronopoulos, Gear - 1989
17
Parallel ocean general circulation modeling (context) - Smith, Dukowicz et al. - 1992
17
Shallow excluded minors and improved graph decomposition
- Plotkin, Rao et al. - 1994
14
Scan primitives and parallel vector models (context) - Blelloch - 1989
14
Two color Fourier analysi iterative algorithm elliptic probl.. (context) - Kuo, Two et al. - 1990
14
the storage requirement in the out-of-core multifrontal meth.. (context) - Liu - 1986
14
Linear programming (context) - Dantzig - 1991
13
A fast computer method for matrix transposing (context) - Eklundh - 1972
12
On adaptive weighted polynomial preconditioning for hermitia.. (context) - Fischer, Freund - 1994
12
Memory requirements for balanced computer architectures (context) - Kung - 1986
12
Performance assertion checking
- Perl, Weihl - 1993
10
Parameter Estimation in Engineering and Science (context) - Beck, Arnold - 1977
10
A detailed look at some popular benchmarks (context) - Weicker - 1991
10
The block preconditioned conjugate gradient method on vector.. (context) - Meurant - 1984
9
Fast Fourier transform of externally stored data (context) - Brenner - 1969
8
The Soul of a New Machine (context) - Kidder - 1981
8
Modeling data-parallel programs with the alignment distribut..
- Chatterjee, Gilbert et al. - 1994
7
Isolating the reasons for the performance of parallel machin..
- Formella, uller et al. - 1994
7
New graph decompositions and fast emulations in hypercubes a.. (context) - Kaklamanis, Krizanc et al. - 1993
7
Writing Efficient Programs (context) - Bentley - 1982
6
Solving large full sets of linear equations in a paged virtu.. (context) - Cruz, Nugent et al. - 1981
6
A method for computing the fast Fourier transform with auxil.. (context) - Singleton - 1967
6
DPEAC Reference Manual (context) - Corporation, MA - 1993
6
Building analytical models into an interactive performance p.. (context) - Atapattu, Gannon - 1989
5
Out-of-core solver for large dense nonsymmetric linear syste.. (context) - Geers, Klee - 1993
5
An Analysis of Time-Shared Computer Systems (context) - Scherr - 1962
5
The multifrontal method and paging in sparse Cholesky factor.. (context) - Liu - 1989
5
dense symmetric generalized eigenvalue problems using second.. (context) - Grimes, Simon et al. - 1988
5
Solution of simultaneous linear equations using a magnetic t.. (context) - Barron, Swinnerton-Dyerm - 1960
5
Performance Analysis of Transaction Processing Systems (context) - Highleyman - 1989
5
Matrix computations with Fortran and paging (context) - Moler - 1972
5
Prism Reference Manual (context) - Corporation, MA - 1992
5
Mechanisms and Interfaces for Software-Extended Coherent Sha..
- Chaiken - 1994
4
Quantitative System Performance (context) - Lazowska, Zahorjan et al. - 1984
4
An inner product-free conjugate gradient-like algorithm for ..
- Fischer, Freund - 1993
4
Organizing matrices and matrix operations for paged memory s.. (context) - McKeller, Coffman - 1969
4
Electronic computers: a historical survey (context) - Rosen - 1969
4
A static performance estimator to guide partitioning decisio.. (context) - Balasundaram, Fox et al. - 1991
4
version finite elements in three dimensions (context) - Mandel, solver - 1994
4
Auxiliary storage methods for solving finite element systems (context) - George, Rashwan - 1985
4
Predicting execution times of sequential scientific kernels
- MacDonald - 1994
3
Solving systems of large dense linear equations (context) - Grimes - 1988
3
A fast computer method for matrix transposing (context) - Schumann - 1973
3
VU Programmer's Handbook (context) - Corporation, MA - 1993
3
A block equation solver for large unsymmetric linear equatio.. (context) - Stabrowski - 1982
2
Advanced Linear Programming Computing Techniques (context) - Orchard-Hays - 1968
2
the pages in that issue are marked as Vol (context) - Ari, large et al. - 1979
2
Portable High-Performance Superconducting: High-Level Platfo.. (context) - Brewer - 1994
2
An iterative method for linear systems of which the coeffici.. (context) - Meijerink, van der Vorst - 1977
2
Large capacity equation solver for structural analysis (context) - Mondkar, Powell - 1974
2
An equation solver of very large capacity (context) - Cantin - 1971
2
inputouput complexity sorting and related problem (context) - Vitter, ouput et al. - 1988
2
problems on vector computers (context) - van der Vorst, ICCG - 1988
2
Input-Output Performance Evaluation: Self-Scaling Benchmarks (context) - Chen - 1992
2
Solution of large-scale sparse least squares problems using .. (context) - George, Heath et al. - 1981
2
An extension of Eklundh's matrix transposition algorithm and.. (context) - Twogood, Ekstrom - 1976
2
Mathematical Tables and other Aids to Computation (context) - Riesel, on et al. - 1956
2
Elliptic problems in linear difference equations over a netw.. (context) - Thomas - 1949
2
Numerical Solutions of Patial Differential Equations (context) - Smith - 1978
2
A not one matrix multiplication in a paging environment (context) - Fischer, Probert - 1976
2
Finite difference techniques for partial differential equati.. (context) - Noye - 1984
2
Software for sparse Gaussian elimination with limited core m.. (context) - Eisenstat, Schultz et al. - 1978
2
A block equation solver for large unsymmetric matrices arisi.. (context) - Crotty - 1982
2
Digital computers in nuclear reactor design (context) - Cuthill - 1964
2
Direct solution of large systems of linear equations (context) - Wilson, Bathe et al. - 1974
2
Berkeley Mathematics Lecture Notes (context) - Demmel, Algebra - 1993
1
Rapid solution fo intergal equation of classical potential t.. (context) - Rohklin - 1985
1
A new approach to I/O perfomance evaluation--- self-scaling .. (context) - Chen, Patterson - 1993
1
Linear programming at the National Bureau of Standards (context) - Hoffman - 1991
1
The effect of caches on the performance analysis of data par.. (context) - Chen - 1994
1
The influence of the Los Alamos and Livermore National Labor.. (context) - MacKenzie - 1991
1
latest benchmark result are available online from httpwww (context) - Eric, Dagum et al. - 1994
1
direct linear equation solver for structural analysis and it.. (context) - Bjrstad, scale et al. - 1987
1
An application of linear programming to curve fitting (context) - Kelly - 1958
1
A modified conjugate gradient solver for very large systems (context) - Barkai, Mortiarty et al. - 1985
1
Also available as Center for Supercomputing Research and Dev.. (context) - Malony, Observability et al. - 1990
1
Itertive solution of linear systems (context) - Freund, Golub et al. - 1992
1
Memory management for solution of linear systems arising in .. (context) - Wimberly - 1978
1
Also available online from httpwww (context) - Bischof, Du et al. - 1994
1
Algorithms for drift-diffution device simulation using massi.. (context) - Tomacruz, Sanghavi et al. - 1994
1
Also available as MIT Laboratory Computer Science Technical .. (context) - Perl, Checking et al. - 1992
1
Data flow and storage allocation for the PDQ-5 program on th.. (context) - Pfeifer - 2000
1
Connection Machine Run-Time System Architectural Specificati.. (context) - Swanson - 1993
1
LOQO User's Manual (context) - Venderbei - 1992
1
Queueing models for file memory operation
- Denning - 1964
1
The numerical solution of a partial differential equation on.. (context) - Ladd, Sheldon - 1952
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.lcs.mit.edu/publications/pubs/pdf/): More
Proving Correctness of a Distributed Shared Memory Implementation - Castro (1999)
(Correct)
Experience with Fine-Grain Synchronization in MIMD Machines.. - Yeung, Agarwal (1993)
(Correct)
Write Barrier Removal by Static Analysis - Zee, Rinard (2002)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC