See this document in CiteSeerX!

ProfileMe: Hardware Support for Instruction-Level Profiling on Out-of-Order Processors (1997)  (Make Corrections)  (67 citations)
Jeffrey Dean, James E. Hicks, Carl A. Waldspurger, William E. Weihl, George Chrysos
International Symposium on Microarchitecture



  Home/Search   Context   Related

 
View or download:
digital.com/pub/DEC/SRC/pu...micro30.ps
digital.com/SRC/pe...ofilememicro30.ps
digital.com/SRC/pe...ofilememicro30.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  digital.com/SRC/staff/weihl...bib (more)
From:  digital.com/SRC/personal...papers
Homepages:  J.Dean  

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Profile data is valuable for identifying performance bottlenecks and guiding optimizations. Periodic sampling of a processor's performance monitoring hardware is an effective, unobtrusive way to obtain detailed profiles. Unfortunately, existing hardware simply counts events, such as cache misses and branch mispredictions, and cannot accurately attribute these events to instructions, especially on out-of-order machines. We propose an alternative approach, called ProfileMe, that samples... (Update)

Cited by:   More
TAPE: A Transactional Application Profiling Environment - Chi (2005)   (Correct)
DiST: A Simple, Reliable and Scalable Method to.. - Girbal, Mouchard.. (2003)   (Correct)
Toward a Systematic, Pragmatic and Architecture-Aware.. - Parello, Temam, al. (2004)   (Correct)

Active bibliography (related documents):   More   All
0.5:   Fine-Grained Dynamic Instrumentation Of Commodity Operating.. - Tamches (2001)   (Correct)
0.3:   Memory Forwarding: Enabling Aggressive Layout Optimizations by.. - Luk, Mowry (1999)   (Correct)
0.3:   Memory Dependence Prediction using Store Sets - George Chrysos And (1998)   (Correct)

Similar documents based on text:   More   All
0.4:   Transparent, Low-Overhead Profiling on Modern Processors - Anderson, Berc, Chrysos, .. (1998)   (Correct)
0.3:   Compiler-Directed Storage Reclamation Using Object Lifetime.. - Hicks, Jr. (1992)   (Correct)
0.1:   Rapid Profiling via Stratified Sampling - Sastry, Bodik, James (2001)   (Correct)

Related documents from co-citation:   More   All
27:   Continuous profiling: Where have all the cycles gone - Anderson, Berc et al. - 1996
14:   Accurate and practical profile-driven compilation using the profile buffer - Conte, Menezes et al. - 1996
12:   Performance Analysis Using the MIPS R10000 Performance Counters - Zagha, Larson et al. - 1996

BibTeX entry:   (Update)

J. Dean, J. E. Hicks, C. A. Waldspurger, W. E. Weihl, and G. Chrysos. ProfileMe: Hardware support for instruction-level profiling on out-of-order processors. In Proc. 30th Annual Intl. Symp. on Microarchitecture, Dec. 1997. http://citeseer.ist.psu.edu/dean97profileme.html   More

@inproceedings{ dean97profileme,
    author = "Jeffrey Dean and James E. Hicks and Carl A. Waldspurger and William E. Weihl and George Z. Chrysos",
    title = " ProfileMe : Hardware Support for Instruction-Level Profiling on Out-of-Order Processors",
    booktitle = "International Symposium on Microarchitecture",
    pages = "292-302",
    year = "1997",
    url = "citeseer.ist.psu.edu/dean97profileme.html" }
Citations (may not include all citations):
407   Trace scheduling: A technique for global microcode compactio.. (context) - Fisher - 1981  DBLP
121   Continuous profiling: Where have all the cycles gone - Anderson, Berc et al. - 1997
106   Microprocessor User's Manual (context) - Technologies, MIPS - 1995
103   Efficient path profiling - Ball, Larus - 1996  ACM   DBLP
80   Avoiding conflict misses dynamically in large direct-mapped .. - Bershad, Lee et al. - 1994  ACM   DBLP
64   Improving the accuracy of static branch prediction using bra.. (context) - Young, Smith - 1994  ACM   DBLP
59   Operating system support for improving data locality on CC-N.. (context) - Verghese, Devine et al. - 1996  DBLP
35   Dynamic page mapping policies for cache conflict resolution .. - Romer, Lee et al. - 1994  DBLP
34   Alpha 21164 MicroprocessorHardware Reference Manual (context) - Corporation - 1995
32   Informing memory operations: Providing memory performancefee.. - Horowitz, Martonosi et al. - 1996
31   Reducing TLB and memory overhead using online superpage prom.. - Romer, Ohlrich et al. - 1995  ACM   DBLP
24   Predicting data cache misses in nonnumeric applications thro.. - Luk, Mowry - 1997
19   Accurate and practical profile-driven compilation using the .. - Conte, Menezes et al. - 1996  ACM   DBLP
18   Hot cold optimization of large Windows /NT applications (context) - Cohn, Lowney - 1996
15   Using branch handling hardware to support profile-driven opt.. - Conte, Patel et al. - 1994  ACM   DBLP
15   Predicting load latencies using cache profiling (context) - Abraham, Rau - 1994
3   The Alpha 21264: A 500 MHz Out-ofOrder Execution Microproces.. (context) - Leibholz, Razdan - 1997
2   Pro Processor Developer's Manual (context) - Corporation - 1997
1   assigned to International Business Machines Corporation (context) - Westcott, White et al. - 1992



The graph only includes citing articles where the year of publication is known.


Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC