Results 1 - 10
of
12
A multifractal wavelet model with application to TCP network traffic
- IEEE TRANS. INFORM. THEORY
, 1999
"... In this paper, we develop a new multiscale modeling framework for characterizing positive-valued data with longrange-dependent correlations (1=f noise). Using the Haar wavelet transform and a special multiplicative structure on the wavelet and scaling coefficients to ensure positive results, the mo ..."
Abstract
-
Cited by 151 (30 self)
- Add to MetaCart
In this paper, we develop a new multiscale modeling framework for characterizing positive-valued data with longrange-dependent correlations (1=f noise). Using the Haar wavelet transform and a special multiplicative structure on the wavelet and scaling coefficients to ensure positive results, the model provides a rapid O(N) cascade algorithm for synthesizing N-point data sets. We study both the second-order and multifractal properties of the model, the latter after a tutorial overview of multifractal analysis. We derive a scheme for matching the model to real data observations and, to demonstrate its effectiveness, apply the model to network traffic synthesis. The flexibility and accuracy of the model and fitting procedure result in a close fit to the real data statistics (variance-time plots and moment scaling) and queuing behavior. Although for illustrative purposes we focus on applications in network traffic modeling, the multifractal wavelet model could be useful in a number of other areas involving positive data, including image processing, finance, and geophysics.
Connection-level Analysis and Modeling of Network Traffic
- in ACM SIGCOMM Internet Measurement Workshop
, 2001
"... Abstract — Most network traffic analysis and modeling studies lump all connections together into a single flow. Such aggregate traffic typically exhibits long-range-dependent (LRD) correlations and non-Gaussian marginal distributions. Importantly, in a typical aggregate traffic model, traffic bursts ..."
Abstract
-
Cited by 72 (5 self)
- Add to MetaCart
Abstract — Most network traffic analysis and modeling studies lump all connections together into a single flow. Such aggregate traffic typically exhibits long-range-dependent (LRD) correlations and non-Gaussian marginal distributions. Importantly, in a typical aggregate traffic model, traffic bursts arise from many connections being active simultaneously. In this paper, we develop a new framework for analyzing and modeling network traffic that moves beyond aggregation by incorporating connection-level information. A careful study of many traffic traces acquired in different networking situations reveals (in opposition to the aggregate modeling ideal) that traffic bursts typically arise from just a few high-volume connections that dominate all others. We term such dominating connections alpha traffic. Alpha traffic is caused by large file transmissions over high bandwidth links and is extremely bursty (non-Gaussian). Stripping the alpha traffic from an aggregate trace leaves a beta traffic residual that is Gaussian, LRD, and shares the same fractal scaling exponent as the aggregate traffic. Beta traffic is caused by both small and large file transmissions over low bandwidth links. In our alpha/beta traffic model, the heterogeneity of the network resources give rise to burstiness and heavy-tailed connection durations give rise to LRD. Queuing experiments suggest that the alpha component dictates the tail queue behavior for large queue sizes, whereas the beta component controls the tail queue behavior for small queue sizes. Keywords—network traffic modeling, animal kingdom I.
Simulation of nonGaussian Long-Range-Dependent Traffic using Wavelets
, 1999
"... In this paper, we develop a simple and powerful multiscale model for the synthesis of nonGaussian, long-range dependent (LRD) network traffic. Although wavelets effectively decorrelate LRD data, wavelet-based models have generally been restricted by a Gaussianity assumption that can be unrealistic f ..."
Abstract
-
Cited by 31 (4 self)
- Add to MetaCart
In this paper, we develop a simple and powerful multiscale model for the synthesis of nonGaussian, long-range dependent (LRD) network traffic. Although wavelets effectively decorrelate LRD data, wavelet-based models have generally been restricted by a Gaussianity assumption that can be unrealistic for traffic. Using a multiplicative superstructure on top of the Haar wavelet transform, we exploit the decorrelating properties of wavelets while simultaneously capturing the positivity and "spikiness" of nonGaussian traffic. This leads to a swift O(N) algorithm for fitting and synthesizing N-point data sets. The resulting model belongs to the class of multifractal cascades, a set of processes with rich statistical properties. We elucidate our model's ability to capture the covariance structure of real data and then fit it to real traffic traces. Queueing experiments demonstrate the accuracy of the model for matching real data. Our results indicate that the nonGaussian nature of traffic has a significant effect on queuing.
Multiscale queuing analysis of long-range-dependent network traffic
- Proc. IEEE INFOCOM
, 2000
"... Abstract—Many studies have indicated the importance of capturing scaling properties when modeling traffic loads; however, the influence of long-range dependence (LRD) and marginal statistics still remains on unsure footing. In this paper, we study these two issues by introducing a multiscale traffic ..."
Abstract
-
Cited by 22 (6 self)
- Add to MetaCart
Abstract—Many studies have indicated the importance of capturing scaling properties when modeling traffic loads; however, the influence of long-range dependence (LRD) and marginal statistics still remains on unsure footing. In this paper, we study these two issues by introducing a multiscale traffic model and a novel multiscale approach to queuing analysis. The multifractal wavelet model (MWM) is a multiplicative, wavelet-based model that captures the positivity, LRD, and “spikiness ” of non-Gaussian traffic. Using a binary tree, the model synthesizes an-point data set with only computations. Leveraging the tree structure of the model, we derive a multiscale queuing analysis that provides a simple closed form approximation to the tail queue probability, valid for any given buffer size. The analysis is applicable not only to the MWM but to tree-based models in general, including fractional Gaussian noise. Simulated queuing experiments demonstrate the accuracy of the MWM for matching real data traces and the precision of our theoretical queuing formula. Thus, the MWM is useful not only for fast synthesis of data for simulation purposes but also for applications requiring accurate queuing formulas such as call admission control. Our results clearly indicate that the marginal distribution of traffic at different time-resolutions affects queuing and that a Gaussian assumption can lead to over-optimistic predictions of tail queue probability even when taking LRD into account. I.
Variable Heavy Tailed Durations in Internet Traffic
"... This paper studies tails of the duration distribution of internet data flows, and their "heaviness". Data analysis ..."
Abstract
-
Cited by 18 (6 self)
- Add to MetaCart
This paper studies tails of the duration distribution of internet data flows, and their "heaviness". Data analysis
Long range dependence analysis of Internet traffic: Summary page for LRD project
, 2010
"... Long Range Dependent time series are endemic in the statistical analysis of Internet traffic. The Hurst Parameter provides good summary of important self-similar scaling properties. We compare a number of different Hurst parameter estimation methods and some important variations. This is done in the ..."
Abstract
-
Cited by 10 (7 self)
- Add to MetaCart
Long Range Dependent time series are endemic in the statistical analysis of Internet traffic. The Hurst Parameter provides good summary of important self-similar scaling properties. We compare a number of different Hurst parameter estimation methods and some important variations. This is done in the context of a wide range of simulated, laboratory generated and real data sets. Important differences between the methods are highlighted. Deep insights are revealed on how well the laboratory data mimic the real data. Non-stationarities, that are local in time, are seen to be central issues, and lead to both conceptual and practical recommendations. 1
Variable Heavy Tails in Internet Traffic
"... This paper studies tails of the size distribution of Internet data flows and their "heaviness". Data analysis motivates the concepts of moderate, far and extreme tails for understanding the richness of information available in the data. The data analysis also motivates a notion of "variable tail ind ..."
Abstract
-
Cited by 9 (5 self)
- Add to MetaCart
This paper studies tails of the size distribution of Internet data flows and their "heaviness". Data analysis motivates the concepts of moderate, far and extreme tails for understanding the richness of information available in the data. The data analysis also motivates a notion of "variable tail index", which leads to a generalization of existing theory for heavy-tail durations leading to long-range dependence.
Mice and Elephants Visualization of Internet Traffic
"... Internet traffic is composed of flows, sets of packets being transferred from one computer to another. Some visualizations for understanding the set of flows at a busy internet link are developed. These show graphically that the set of flows is dominated by a relatively few "elephants", and a very l ..."
Abstract
-
Cited by 8 (2 self)
- Add to MetaCart
Internet traffic is composed of flows, sets of packets being transferred from one computer to another. Some visualizations for understanding the set of flows at a busy internet link are developed. These show graphically that the set of flows is dominated by a relatively few "elephants", and a very large number of "mice". It also becomes clear that "representative sampling" from heavy tail distributions is a challenging task.
Multiscale Modeling and Queuing Analysis of Long-Range-Dependent Network Traffic
, 1999
"... We develop a simple multiscale model for the analysis and synthesis of nonGaussian, long-range-dependent (LRD) network traffic loads. The wavelet transform effectively decorrelates LRD signals and hence is well-suited to model such data. However, traditional wavelet-based models are Gaussian in n ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
We develop a simple multiscale model for the analysis and synthesis of nonGaussian, long-range-dependent (LRD) network traffic loads. The wavelet transform effectively decorrelates LRD signals and hence is well-suited to model such data. However, traditional wavelet-based models are Gaussian in nature which one may at the most hope to match second order statistics of inherently nonGaussian traffic loads. Using a multiplicative superstructure atop the Haar wavelet tree, we retain the decorrelating properties of wavelets while simultaneously capturing the positivity and "spikiness" of nonGaussian traffic. This leads to a swift O(N) algorithm for fitting and synthesizing N-point data sets. The resulting model belongs to the class of multifractal cascades, a set of processes with rich scaling properties which are better suited than LRD to capture burstiness. We elucidate our model's ability to capture the covariance structure of real data and then fit it to real traffic traces. We derive approximate analytical queuing formulas for our model, also applicable to other multiscale models, by exploiting its multiscale construction scheme. Queuing experiments demonstrate the accuracy of the model for matching real data and the precision of our theoretical queuing results, thus revealing the potential use of the model for numerous networking applications. Our results indicate that a Gaussian assumption can lead to over-optimistic predictions of tail queue probability even when taking LRD into account.
Mice and Elephants Visualization of Internet
, 2002
"... Internet tra#c is composed of flows, sets of packets being transferred from one computer to another. Some visualizations for understanding the set of flows at a busy internet link are developed. These show graphically that the set of flows is dominated by a relatively few "elephants", and a very lar ..."
Abstract
- Add to MetaCart
Internet tra#c is composed of flows, sets of packets being transferred from one computer to another. Some visualizations for understanding the set of flows at a busy internet link are developed. These show graphically that the set of flows is dominated by a relatively few "elephants", and a very large number of "mice". It also becomes clear that "representative sampling" from heavy tail distributions is a challenging task.

