Classification of HTTP traffic based on C5.0 Machine Learning Algorithm

Research output: Research - peer-reviewArticle in proceeding

Abstract

Our previous work demonstrated the possibility of distinguishing several groups of traffic with accuracy of over 99%. Today, most of the traffic is generated by web browsers, which provide different kinds of services based on the HTTP protocol: web browsing, file downloads, audio and voice streaming through third-party plugins, etc. This paper suggests and evaluates two approaches to distinguish various types of HTTP traffic based on the content: distributed among volunteers' machines and centralized running in the core of the network. We also assess the accuracy of the centralized classifier for both the HTTP traffic and mixed HTTP/non-HTTP traffic. In the latter case, we achieved the accuracy of 94%. Finally, we provide graphical characteristics of different kinds of HTTP traffic.
Close

Details

Our previous work demonstrated the possibility of distinguishing several groups of traffic with accuracy of over 99%. Today, most of the traffic is generated by web browsers, which provide different kinds of services based on the HTTP protocol: web browsing, file downloads, audio and voice streaming through third-party plugins, etc. This paper suggests and evaluates two approaches to distinguish various types of HTTP traffic based on the content: distributed among volunteers' machines and centralized running in the core of the network. We also assess the accuracy of the centralized classifier for both the HTTP traffic and mixed HTTP/non-HTTP traffic. In the latter case, we achieved the accuracy of 94%. Finally, we provide graphical characteristics of different kinds of HTTP traffic.
Original languageEnglish
Title of host publicationIEEE Symposium on Computers and Communications (ISCC), 2012
Number of pages6
Place of PublicationCappadocia
PublisherIEEE
Publication date1 Jul 2012
Pages000882 - 000887
ISBN (Print)978-1-4673-2712-1
ISBN (Electronic)978-1-4673-2711-4
DOI
StatePublished - 1 Jul 2012
Publication categoryResearch
Peer-reviewedYes
EventThe Seventeenth IEEE Symposium on Computers and Communications - Cappadocia, Turkey
Duration: 1 Jul 20124 Jul 2012

Conference

ConferenceThe Seventeenth IEEE Symposium on Computers and Communications
LandTurkey
ByCappadocia
Periode01/07/201204/07/2012
SeriesI E E E International Symposium on Computers and Communications
ISSN1530-1346

    Research areas

  • traffic classification, computer networks, HTTP traffic, browser traffic, C5.0, Machine Learning Algorithms (MLAs), performance monitoring

Download statistics

No data available
ID: 62073457