MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Web servers under overload: How scheduling can help (2003) [35 citations — 2 self]

Download:
Download as a PDF | Download as a PS
by B. Schroeder, M. Harchol-balter
http://www-2.cs.cmu.edu/~harchol/Papers/systemsoverload.ps
Add To MetaCart

Abstract:

Most well-managed web servers perform well most of the time. Occasionally, however, every popular web server experiences transient overload. An overloaded web server typically displays signs of its a#iction within a few seconds. Work enters the web server at a greater rate than the web server can complete it, causing the number of connections at the server to build up. This implies large delays for clients accessing the server. This paper provides a systematic performance study of exactly what happens when a web server is run under transient overload, both from the perspective of the server and from the perspective of the client. Second, this paper proposes and evaluates a particular kernel-level solution for improving the performance of web servers under overload. The solution is based on SRPT connection scheduling. We show that SRPT-based scheduling improves overload performance across a variety of client and server-oriented metrics. 1

Citations

610 Generating Representative Web Workloads for Network and Server Performance Evaluation – Barford, Crovella - 1998
371 Resource containers: A new facility for resource management in server systems – Banga, Druschel, et al. - 1999
303 Dummynet: A simple approach to the evaluation of network protocols – Rizzo - 1997
302 Computer Networking: A Top-Down Approach Featuring the Internet – Kurose, Ross - 2001
260 The case for geographical push-caching – Gwertzman, Seltzer - 1995
235 Eliminating receive livelock in an interrupt-driven kernel – MOGUL, RAMAKRISHNAN - 1997
220 Flash: An efficient and portable Web server – Pai, Druschel, et al. - 1999
204 Modeling TCP Reno Performance: A Simple Model and Its Empirical Validation – Padhye, Firoiu, et al. - 2000
172 Lazy receiver processing (LRP): A network subsystem architecture for server systems – Druschel, Banga - 1996
158 The Case for Persistent-Connection HTTP – Mogul - 1995
143 Workload Characterization of the 1998 World Cup Web Site – Arlitt, Jin - 1999
131 TCP Behavior of a Busy Internet Server: Analysis and Improvements – Balakrishnan, Padmanabhan, et al. - 1998
121 Computing TCP’s Retransmission Timer – Paxson, Allman - 2000
108 Analysis of SRPT scheduling: investigating unfairness – BANSAL, HARCHOL-BALTER - 2001
107 On the use and performance of content distribution networks – Krishnamurthy, Wills, et al. - 2001
107 Network behavior of a busy Web server and its clients – Mogul - 1995
100 A scalable and highly available web server – Dias, Kish, et al. - 1996
97 Flow and stretch metrics for scheduling continuous job streams – Bender, Chakrabarti, et al. - 1998
93 Connection scheduling in Web servers – Crovella, Frangioso, et al. - 1999
92 Web facts and fantasy – Manley, Seltzer - 1997
87 The measured performance of content distribution networks – JOHNSON, CARR, et al. - 2000
75 Adaptive Overload Control for Busy Internet Servers – Welsh, Culler - 2003
70 Session based admission control: a mechanism for improving the performance of an overloaded web server – Cherkasova, Phaal - 1998
68 Web traffic characterization: an assessment of the impact of caching documents from NCSA's Web server – Braun, Claffy - 1994
65 Server Operating Systems – Kaashoek, Engler, et al. - 1996
62 Size-based scheduling to improve web performance – HARCHOL-BALTER, SCHROEDER, et al. - 2003
58 Web Content Adaptation to Improve Server Overload Behavior – Abdelzaher, Bhatti - 1999
57 A proof of the optimality of the shortest remaining processing time discipline – SCHRAGE - 1968
55 Measuring the capacity of a Web server under realistic loads – Banga, Druschel - 1999
55 Classifying scheduling policies with respect to unfairness in an M/GI/1 – Wierman, Harchol-Balter - 2003
54 Analysis of task assignment policies in scalable distributed Web-server systems – Colajanni, Yu, et al. - 1998
53 A performance evaluation of hyper text transfer protocols – Barford, Crovella - 1999
53 Kernel mechanisms for service differentiation in overloaded Web servers – Voigt, Tewari, et al. - 2001
51 Application-level document caching in the Internet – Bestavros, Carter, et al. - 1995
48 A method for transparent admission control and request scheduling in e-commerce web sites – Elnikety, Nahum, et al. - 2004
48 The queue M/G/1 with the shortest remaining processing time discipline – Schrage, Miller - 1966
37 Analysis of LAS scheduling for job size distributions with high variance – Rai, Urvoy-Keller, et al. - 2003
35 The Effects of Wide-Area Conditions on WWW Server Performance – Nahum, Rosu, et al. - 2001
34 Session-based overload control in qos-aware web servers – Chen, Mohapatra - 2002
33 Linux network traffic control — implementation overview. Available at http://lrcwww.epfl.ch/linux-diffserv – Almesberger - 1999
31 Priority mechanisms for OLTP and transactional web applications – McWherter, Schroeder, et al. - 2004
31 Operating System Concepts, 5th Edition – Silberschatz, Galvin - 1998
27 Syn cookies. http://cr.yp.to/syncookies.html – Bernstein - 1997
26 Overload Control Mechanisms for Web Servers – Iyer, Tewari, et al. - 2000
18 Operating System Concepts, Sixth Edition – Silberschatz, Galvin, et al. - 2002
17 TCP implementation enhancements for improving Webserver performance – Aron, Druschel - 1999
16 Fairness and efficiency in web server protocols – Friedman, Henderson - 2003
16 CNN.com: Facing a World Crisis – LeFebvre - 2002
16 Flash: An ecient and portable Web server – Pai, Druschel, et al. - 1999
15 SWIFT: Scheduling in web servers for fast response time – Rawat, Kshemkalyani - 2003