The Applications of Machine Learning Techniques in Networked Systems

Jamshidi, Soheil

The Applications of Machine Learning Techniques in Networked Systems

dc.contributor.advisor	Rejaie, Reza
dc.contributor.author	Jamshidi, Soheil
dc.date.accessioned	2020-12-08T15:49:21Z
dc.date.available	2020-12-08T15:49:21Z
dc.date.issued	2020-12-08
dc.description.abstract	Many large networked systems ranging from the Internet to ones deployed atop the Internet (e.g., Amazon) play critical roles in our daily lives. In these systems, individual nodes (e.g., a computer) establish a physical or virtual connection/relationship to form a networked system and exchange data. An important task in these systems is the timely and accurate detection of security or management events, e.g. a denial of service attack on campus. Machine learning (ML) models offer a promising data-driven method to learn the ``signature'' of these events from the past instances and use that to detect future events. While ML models have been very successful in other domains (e.g., image processing), there are clear challenges in using them for event detection in networked systems including (i) limited availability of large scale labeled dataset, (ii) subtle and changing signature of target event, (iii) selecting and capturing proper traffic features for (re)training, (iv) ``black-box'' nature of ML models. This dissertation presents three different applications of ML models for event detection based on exchanged messages in networked systems that tackle the above challenges. First, we develop an ML-based method to identify incentivized Amazon reviews. To this end, we present a heuristic-based signature to identify explicitly incentivized reviews (EIRs) and characterize related reviews, products, and reviewers. We use EIRs to train an ML model for detecting implicitly incentivized reviews. Second, we examine how casting and training strategies of unsupervised ML (and statistical) model affects their accuracy and overhead (and thus feasibility) for forecasting network data streams. In particular, we study the impact of the size, selection, and recency of the training data on accuracy and overhead. Third, we design and evaluate anomaly detection mechanisms based on an unsupervised ML-based method that takes input data streams from network traffic, end-system, and application load. Furthermore, we leverage model interpretation to identify the most important input data streams and deploy model extraction to infer the rules that represent model behavior. Overall, these three cases studies result in numerous insightful findings on a range of practical issues that arise in deploying ML models for event detection in networked systems.	en_US
dc.identifier.uri	https://hdl.handle.net/1794/25911
dc.language.iso	en_US
dc.publisher	University of Oregon
dc.rights	All Rights Reserved.
dc.subject	anomaly detection	en_US
dc.subject	machine learning	en_US
dc.subject	model interpretation	en_US
dc.subject	online review analysis	en_US
dc.subject	text classification	en_US
dc.subject	time series forecasting	en_US
dc.title	The Applications of Machine Learning Techniques in Networked Systems	en_US
dc.type	Electronic Thesis or Dissertation
thesis.degree.discipline	Department of Computer and Information Science
thesis.degree.grantor	University of Oregon
thesis.degree.level	doctoral
thesis.degree.name	Ph.D.

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Jamshidi_oregon_0171A_12879.pdf
Size:: 2.29 MB
Format:: Adobe Portable Document Format

Download

Collections

Theses and Dissertations
Computer Science Theses and Dissertations