See this document in CiteSeerX!

Code Layout Optimizations for Transaction Processing Workloads (2001)  (Make Corrections)  (7 citations)
Alex Ramírez, Luiz André Barroso, Kourosh Gharachorloo, Robert Cohn, Josep Larriba-Pey, P. Geoffrey Lowney, Mateo Valero



  Home/Search   Context   Related

 
View or download:
people.ac.upc.es/aramire...isca01.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  people.ac.upc.es/aramirez...index (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Commercial applications such as databases and Web servers constitute the most important market segment for high-performance servers. Among these applications, on-line transaction processing (OLTP) workloads provide a challenging set of requirements for system designs since they often exhibit inefficient executions dominated by a large memory stall component. This behavior arises from large instruction and data footprints and high communication miss rates. A number of recent studies have... (Update)

Context of citations to this paper:   More

.... to align branches to benefit the underlying fetch architecture and branch predictor [4] Our previous work presents a detailed analysis [25] of the effects of these optimizations, concluding that the improvements on the instruction cache performance are due to an increase in...

Cited by:   More
StagedDB: Designing Database Servers for Modern Hardware - Stavros Harizopoulos.. (2005)   (Correct)
An Analysis of Dynamic Instruction Streams - Oliverio Santana Marco (2003)   (Correct)
Hardware Support for Prescient Instruction Prefetch - Aamodt, Chow, Hammarlund.. (2004)   (Correct)

Active bibliography (related documents):   More   All
0.8:   Optimization of Instruction Fetch for Decision.. - Ramírez.. (1999)   (Correct)
0.5:   Impact of Chip-Level Integration on Performance of OLTP.. - Luiz Andr Barroso (2000)   (Correct)
0.5:   Performance of Database Workloads on Shared-Memory .. - Ranganathan.. (1998)   (Correct)

Similar documents based on text:   More   All
1.1:   Piranha: A Scalable Architecture Based on.. - Barroso.. (2000)   (Correct)
0.9:   Memory System Characterization of Commercial Workloads - Barroso, Gharachorloo.. (1998)   (Correct)
0.5:   Code Reordering of Decision Support Systems for.. - Ramírez.. (1998)   (Correct)

Related documents from co-citation:   More   All
4:   Trace Cache: A Low Latency Approach to High Bandwidth Instruction Fetching - Rotenberg, Bennett et al. - 1996
3:   Spike optimizer alphant executable - Lowney, An et al. - 1997
2:   Software trace cache (context) - Ramirez, Larriba-Pey et al. - 1999

BibTeX entry:   (Update)

A. Ramirez, L. A. Barroso, K. A. Gharachorloo, R. Cohn, J. Larriba-Pey, P. G. Lowney, M. Valero. Code Layout Optimizations for Transaction Processing Workloads. Proceedings of the 28 th Intl. Symposium on Computer Architecture, pp. 155-164, June 2001 http://citeseer.ist.psu.edu/ramirez01code.html   More

@misc{ ramirez01code,
  author = "A. Ramirez and L. Barroso and K. Gharachorloo and R. Cohn and J. Larriba-Pey
    and P. Lowney and M. Valero",
  title = "Code Layout Optimizations for Transaction Processing Workloads",
  text = "A. Ramirez, L. A. Barroso, K. A. Gharachorloo, R. Cohn, J. Larriba-Pey,
    P. G. Lowney, M. Valero. Code Layout Optimizations for Transaction Processing
    Workloads. Proceedings of the 28 th Intl. Symposium on Computer Architecture,
    pp. 155-164, June 2001",
  year = "2001",
  url = "citeseer.ist.psu.edu/ramirez01code.html" }
Citations (may not include all citations):
121   Continuous profiling: Where have all the cycles gone - Anderson, Berc et al. - 1997
115   Program optimization for instruction caches (context) - McFarling - 1989
107   Achieving high instruction cache performance with an optimiz.. (context) - Hwu, Chang - 1989
107   Memory system characterization of commercial workloads - Barroso, Gharachorloo et al. - 1998
91   The impact of architectural trends on operating system perfo.. (context) - Rosenblum, Bugnion et al. - 1995
89   A practical system for intermodule code optimization at link.. - Srivastava, Wall - 1992
89   Using the SimOS machine simulator to study complex computer .. - Rosenblum, Bugnion et al. - 1997
72   Performance characterization of a quad pentium pro smp using.. - Keeton, Patterson et al. - 1998
67   Contrasting characteristics and cache performance of technic.. (context) - Maynard, Donnelly et al. - 1994
65   Studies of windows NT performance using dynamic execution tr.. - Perl, Sites - 1996
54   Piranha: A Scalable Architecture Based on Single-Chip Multip.. - Barroso, Gharachorloo et al. - 2000
53   Procedure placement using temporal ordering information - Gloy, Blackwell et al. - 1997
51   Optimizing instruction cache performance for operating syste.. - Torrellas, Xia et al. - 1995
47   Efficient procedure mapping using cache line coloring - Hashemi, Kaeli et al. - 1997
47   An analysis of database workload performance on simultaneous.. - Lo, Barroso et al. - 1998
22   Optimizing alpha executables on windows NT with spike - Cohn, Goodwin et al. - 1997
15   Improving locality by critical working sets (context) - Ferrari - 1974
14   Program restructuring for virtual memory (context) - Hartfield, Gerald - 1971
10   AlphaServer 4100 performance characterization - Cvetanovic, Donaldson - 1996
10   Impact of Chip-Level Integration on Performance of OLTP Work.. - Barroso, Gharachorloo et al. - 2000
10   Online Transaction Processing) Standard Specification (context) - Performance, Benchmark - 1990
9   Software trace cache - Ram, Larriba-Pey et al. - 1999
8   on ProgrammingLanguage Design and Implementation (context) - Pettis, Hansen et al. - 1990
7   Compile-time program restructuring in multiprogrammed virtua.. (context) - Hartley - 1988
6   Profile-directed restructuring of operating system code (context) - Schmidt, Roediger et al. - 1998
4   Optimization of instruction fetch for decision support workl.. - Ram, Larriba-Pey et al. - 1999
4   Temporal-based procedure reordering for improved instruction.. (context) - Kalamaitianos, Kaeli - 1998
4   Performance characterization of the Alpha 21164 microprocess.. (context) - Cventanovic, Bhandarkar - 1994
2   Performance of database workloads on shared-memory systems w.. (context) - Rangananthan, Gharachorloo et al. - 1998
http://www.tpc.org



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://people.ac.upc.es/aramirez/papers/index.html):   More
A Comparative Study of Redundancy in Trace Caches - Vandierendonck..   (Correct)
Branch Classification to Control Instruction Fetch .. - Knijnenburg.. (2002)   (Correct)
Fetching Instruction Streams - Alex Ramirez Oliverio (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC