Classification of Video Traffic : An Evaluation of Video Traffic Classification using Random Forests and Gradient Boosted Trees

Traffic classification is important for Internet providers and other organizations to solve some critical network management problems.The most common methods for traffic classification is Deep Packet Inspection (DPI) and port based classification. These methods are starting to become obsolete as mor...

Full description

Bibliographic Details
Main Author: Andersson, Ricky
Format: Others
Language:English
Published: Karlstads universitet, Fakulteten för hälsa, natur- och teknikvetenskap (from 2013) 2017
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kau:diva-55302
id ndltd-UPSALLA1-oai-DiVA.org-kau-55302
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-kau-553022019-11-12T22:36:17ZClassification of Video Traffic : An Evaluation of Video Traffic Classification using Random Forests and Gradient Boosted TreesengAndersson, RickyKarlstads universitet, Fakulteten för hälsa, natur- och teknikvetenskap (from 2013)2017Machine LearningVideo Traffic ClassificationRandom ForestGradient Boosted TreesGradient Boosted MachineComputer SciencesDatavetenskap (datalogi)Traffic classification is important for Internet providers and other organizations to solve some critical network management problems.The most common methods for traffic classification is Deep Packet Inspection (DPI) and port based classification. These methods are starting to become obsolete as more and more traffic are being encrypted and applications are starting to use dynamic ports and ports of other popular applications. An alternative method for traffic classification uses Machine Learning (ML).This ML method uses statistical features of network traffic flows, which solves the fundamental problems of DPI and port based classification for encrypted flows.The data used in this study is divided into video and non-video traffic flows and the goal of the study is to create a model which can classify video flows accurately in real-time.Previous studies found tree-based algorithms to work well in classifying network traffic. In this study random forest and gradient boosted trees are examined and compared as they are two of the best performing tree-based classification models.Random forest was found to work the best as the classification speed was significantly faster than gradient boosted trees. Over 93% correctly classified flows were achieved while keeping the random forest model small enough to keep fast classification speeds. HITS, 4707Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kau:diva-55302application/pdfinfo:eu-repo/semantics/openAccessapplication/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Machine Learning
Video Traffic Classification
Random Forest
Gradient Boosted Trees
Gradient Boosted Machine
Computer Sciences
Datavetenskap (datalogi)
spellingShingle Machine Learning
Video Traffic Classification
Random Forest
Gradient Boosted Trees
Gradient Boosted Machine
Computer Sciences
Datavetenskap (datalogi)
Andersson, Ricky
Classification of Video Traffic : An Evaluation of Video Traffic Classification using Random Forests and Gradient Boosted Trees
description Traffic classification is important for Internet providers and other organizations to solve some critical network management problems.The most common methods for traffic classification is Deep Packet Inspection (DPI) and port based classification. These methods are starting to become obsolete as more and more traffic are being encrypted and applications are starting to use dynamic ports and ports of other popular applications. An alternative method for traffic classification uses Machine Learning (ML).This ML method uses statistical features of network traffic flows, which solves the fundamental problems of DPI and port based classification for encrypted flows.The data used in this study is divided into video and non-video traffic flows and the goal of the study is to create a model which can classify video flows accurately in real-time.Previous studies found tree-based algorithms to work well in classifying network traffic. In this study random forest and gradient boosted trees are examined and compared as they are two of the best performing tree-based classification models.Random forest was found to work the best as the classification speed was significantly faster than gradient boosted trees. Over 93% correctly classified flows were achieved while keeping the random forest model small enough to keep fast classification speeds. === HITS, 4707
author Andersson, Ricky
author_facet Andersson, Ricky
author_sort Andersson, Ricky
title Classification of Video Traffic : An Evaluation of Video Traffic Classification using Random Forests and Gradient Boosted Trees
title_short Classification of Video Traffic : An Evaluation of Video Traffic Classification using Random Forests and Gradient Boosted Trees
title_full Classification of Video Traffic : An Evaluation of Video Traffic Classification using Random Forests and Gradient Boosted Trees
title_fullStr Classification of Video Traffic : An Evaluation of Video Traffic Classification using Random Forests and Gradient Boosted Trees
title_full_unstemmed Classification of Video Traffic : An Evaluation of Video Traffic Classification using Random Forests and Gradient Boosted Trees
title_sort classification of video traffic : an evaluation of video traffic classification using random forests and gradient boosted trees
publisher Karlstads universitet, Fakulteten för hälsa, natur- och teknikvetenskap (from 2013)
publishDate 2017
url http://urn.kb.se/resolve?urn=urn:nbn:se:kau:diva-55302
work_keys_str_mv AT anderssonricky classificationofvideotrafficanevaluationofvideotrafficclassificationusingrandomforestsandgradientboostedtrees
_version_ 1719290174485037056