A statistical model of internet traffic

We present a method to extract a time series (Number of Active Requests (NAR)) from web cache logs which serves as a transport level measurement of internet traffic. This series also reflects the performance or Quality of Service of a web cache. Using time series modelling, we interpret the properti...

Full description

Bibliographic Details
Main Author: Keogh-Brown, Marcus R.
Published: Queen Mary, University of London 2003
Subjects:
Online Access:https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.408506
id ndltd-bl.uk-oai-ethos.bl.uk-408506
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-4085062019-02-27T03:23:08ZA statistical model of internet trafficKeogh-Brown, Marcus R.2003We present a method to extract a time series (Number of Active Requests (NAR)) from web cache logs which serves as a transport level measurement of internet traffic. This series also reflects the performance or Quality of Service of a web cache. Using time series modelling, we interpret the properties of this kind of internet traffic and its effect on the performance perceived by the cache user. Our preliminary analysis of NAR concludes that this dataset is suggestive of a long-memory self-similar process but is not heavy-tailed. Having carried out more in-depth analysis, we propose a three stage modelling process of the time series: (i) a power transformation to normalise the data, (ii) a polynomial fit to approximate the general trend and (iii) a modelling of the residuals from the polynomial fit. We analyse the polynomial and show that the residual dataset may be modelled as a FARIMA(p, d, q) process. Finally, we use Canonical Variate Analysis to determine the most significant defining properties of our measurements and draw conclusions to categorise the differences in traffic properties between the various caches studied. We show that the strongest illustration of differences between the caches is shown by the short memory parameters of the FARIMA fit. We compare the differences revealed between our studied caches and draw conclusions on them. Several programs have been written in Perl and S programming languages for this analysis including totalqd.pl for NAR calculation, fullanalysis for general statistical analysis of the data and armamodel for FARIMA modelling.025.04015118MathematicsQueen Mary, University of Londonhttps://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.408506http://qmro.qmul.ac.uk/xmlui/handle/123456789/1811Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 025.04015118
Mathematics
spellingShingle 025.04015118
Mathematics
Keogh-Brown, Marcus R.
A statistical model of internet traffic
description We present a method to extract a time series (Number of Active Requests (NAR)) from web cache logs which serves as a transport level measurement of internet traffic. This series also reflects the performance or Quality of Service of a web cache. Using time series modelling, we interpret the properties of this kind of internet traffic and its effect on the performance perceived by the cache user. Our preliminary analysis of NAR concludes that this dataset is suggestive of a long-memory self-similar process but is not heavy-tailed. Having carried out more in-depth analysis, we propose a three stage modelling process of the time series: (i) a power transformation to normalise the data, (ii) a polynomial fit to approximate the general trend and (iii) a modelling of the residuals from the polynomial fit. We analyse the polynomial and show that the residual dataset may be modelled as a FARIMA(p, d, q) process. Finally, we use Canonical Variate Analysis to determine the most significant defining properties of our measurements and draw conclusions to categorise the differences in traffic properties between the various caches studied. We show that the strongest illustration of differences between the caches is shown by the short memory parameters of the FARIMA fit. We compare the differences revealed between our studied caches and draw conclusions on them. Several programs have been written in Perl and S programming languages for this analysis including totalqd.pl for NAR calculation, fullanalysis for general statistical analysis of the data and armamodel for FARIMA modelling.
author Keogh-Brown, Marcus R.
author_facet Keogh-Brown, Marcus R.
author_sort Keogh-Brown, Marcus R.
title A statistical model of internet traffic
title_short A statistical model of internet traffic
title_full A statistical model of internet traffic
title_fullStr A statistical model of internet traffic
title_full_unstemmed A statistical model of internet traffic
title_sort statistical model of internet traffic
publisher Queen Mary, University of London
publishDate 2003
url https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.408506
work_keys_str_mv AT keoghbrownmarcusr astatisticalmodelofinternettraffic
AT keoghbrownmarcusr statisticalmodelofinternettraffic
_version_ 1718983942636306432