• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 1,296
Next 10 →

SMARTS: Accelerating Microarchitecture Simulation via Rigorous Statistical Sampling

by Roland E. Wunderlich, Thomas F. Wenisch, Babak Falsafi, James C. Hoe - in Proceedings of the 30th annual international symposium on Computer architecture , 2003
"... Current software-based microarchitecture simulators are many orders of magnitude slower than the hardware they simulate. Hence, most microarchitecture design studies draw their conclusions from drastically truncated benchmark simulations that are often inaccurate and misleading. This paper presents ..."
Abstract - Cited by 258 (25 self) - Add to MetaCart
benchmarks, running with average speedups of 35 and 60 over detailed simulation of 8-way and 16-way out-of-order processors, respectively. 1.

A Model for Speedup of Parallel Programs

by Allen B. Downey , 1997
"... We propose a new model for parallel speedup that is based on two parameters, the average parallelism of a program and its variance in parallelism. We present a way to use the model to estimate these program characteristics using only observed speedup curves (as opposed to the more detailed program k ..."
Abstract - Cited by 32 (3 self) - Add to MetaCart
We propose a new model for parallel speedup that is based on two parameters, the average parallelism of a program and its variance in parallelism. We present a way to use the model to estimate these program characteristics using only observed speedup curves (as opposed to the more detailed program

Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing

by Haluk Topcuoglu, Min-you Wu, et al. - IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS , 2002
"... Efficient application scheduling is critical for achieving high performance in heterogeneous computing environments. The application scheduling problem has been shown to be NP-complete in general cases as well as in several restricted cases. Because of its key importance, this problem has been exte ..."
Abstract - Cited by 255 (0 self) - Add to MetaCart
approaches in terms of both quality and cost of schedules, which are mainly presented with schedule length ratio, speedup, frequency of best results, and average scheduling time metrics.

Speculative Precomputation: Long-range Prefetching of Delinquent Loads

by Jamison D. Collins, Yong-Fong Lee, Hong Wang Z, Dean M. Tullsen, Christopher Hughes, Yong-fong Lee Q, John P. Shen, Dan Lavery, John P. Shen Z , 2001
"... This paper explores Speculative Precomputation, a technique that uses idle thread contexts in a multithreaded architecture to improve performance of single-threaded applications. It attacks program stalls from data cache misses by pre-computing future memory accesses in available thread contexts, an ..."
Abstract - Cited by 180 (23 self) - Add to MetaCart
significantly more aggressive speculation, overcoming this limitation. Even with realistic costs for spawning threads, speedups as high as 169% are achieved, with an average speedup of 76%. 1.

Scalably scheduling processes with arbitrary speedup curves

by Jeff Edmonds, Kirk Pruhs - In ACM-SIAM Symposium on Discrete algorithms. Society for Industrial and Applied Mathematics , 2009
"... “With multi-core it’s like we are throwing this Hail Mary pass down the field and now we have to run down there as fast as we can to see if we can catch it.” — David Patterson, UC Berkeley computer science professor We give a scalable ((1+ǫ)-speed O(1)-competitive) nonclairvoyant algorithm for sched ..."
Abstract - Cited by 44 (16 self) - Add to MetaCart
for scheduling jobs with sublinear nondecreasing speed-up curves on multiple processors with the objective of average response time. 1

The Speedup-Test: a statistical methodology for program speedup analysis and computation

by Sidi Touati, Julien Worms - in "Concurrency and Computation: Practice and Experience", 2012, http://dx.doi.org/10. 1002/cpe.2939. AOSTE 27 International Conferences with Proceedings
"... In the area of high performance computing and embedded systems, numerous code optimisation methods exist to accelerate the speed of the computation (or optimise another performance criteria). They are usually experimented by doing multiple observations of the initial and the optimised execution time ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
times of a program in order to declare a speedup. Even with fixed input and execution environment, program execution times vary in general. Hence different kinds of speedups may be reported: the speedup of the average execution time, the speedup of the minimal execution time, the speedup of the median

Diversity maximization speedup for fault localization

by Liang Gong, David Lo, Lingxiao Jiang, Hongyu Zhang - In Automated Software Engineering (ASE), 2012 Proceedings of the 27th IEEE/ACM International Conference on , 2012
"... Fault localization is useful for reducing debugging effort. However, many fault localization techniques require non-trivial number of test cases with oracles, which can deter-mine whether a program behaves correctly for every test input. Test oracle creation is expensive because it can take much man ..."
Abstract - Cited by 5 (2 self) - Add to MetaCart
on Diversity Maximization Speedup (Dms). Dms orders a set of unlabeled test cases in a way that maximizes the effectiveness of a fault localization technique. Developers are only expected to label a much smaller number of test cases along this ordering to achieve good fault localization results. Our

Quantum walk speedup of backtracking algorithms

by Ashley Montanaro , 2015
"... We describe a general method to obtain quantum speedups of classical algorithms which are based on the technique of backtracking, a standard approach for solving constraint satisfaction problems (CSPs). Backtracking algorithms explore a tree whose vertices are partial solutions to a CSP in an attemp ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
walk algorithm of Belovs to search in the backtracking tree. We also discuss how, for certain distributions on the inputs, the algorithm can lead to an average-case exponential speedup. 1

Tolerating Memory Latency through Software-Controlled Pre-Execution in Simultaneous Multithreading Processors

by Chi-Keung Luk - In Proceedings of the 28th Annual International Symposium on Computer Architecture , 2001
"... Hardly predictable data addresses in many irregular applications have rendered prefetching ineffective. In many cases, the only accurate way to predict these addresses is to directly execute the code that generates them. As multithreaded architectures become increasingly popular, one attractive appr ..."
Abstract - Cited by 174 (0 self) - Add to MetaCart
of shortening programs for pre-execution, and no need of special hardware to copy register values upon thread spawns). Consequently, only minimal extensions to SMT machines are required to support our technique. Despite its simplicity, our technique offers an average speedup of 24% in a set of irregular

Thread scheduling for multiprogrammed multiprocessors

by Nimar S. Arora, Robert D. Blumofe, C. Greg Plaxton - In Proceedings of the Tenth Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), Puerto Vallarta , 1998
"... We present a user-level thread scheduler for shared-memory multiprocessors, and we analyze its performance under multiprogramming. We model multiprogramming with two scheduling levels: our scheduler runs at user-level and schedules threads onto a fixed collection of processes, while below, the opera ..."
Abstract - Cited by 208 (3 self) - Add to MetaCart
-blocking implementation of the work-stealing algorithm. For any multithreaded computation with work ¢¤ £ and critical-path length ¢¦ ¥ , and for any number § of processes, our scheduler executes the computation in expected time ¨�©�¢�£���§¤����¢�¥�§���§¤�� � , where §� � is the average number of processors allocated
Next 10 →
Results 1 - 10 of 1,296
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University