AIoT and edge computing of sensor data

The Internet of Things (IoT) consists of resource-constrained devices or sensors connected to the network. These devices send large amounts of data to the servers in the cloud, to be stored and processed. Relying on the cloud has its disadvantages, namely high network usage, privacy concerns and slo...

Full description

Bibliographic Details
Main Author:	Šalej, Jakob
Other Authors:	Trebar, Mira
Format:	Master Thesis
Language:	English
Published:	2021
Subjects:	internet of things machine learning sensor data edge computing anomaly detection internet stvari strojno učenje senzorski podatki robna obdelava detekcija anomalij sami
Online Access:	https://repozitorij.uni-lj.si/IzpisGradiva.php?id=133344 https://repozitorij.uni-lj.si/Dokument.php?id=151342&dn= https://plus.si.cobiss.net/opac7/bib/87405571?lang=sl

id	ftuniljubljanair:oai:repozitorij.uni-lj.si:IzpisGradiva.php-id-133344
record_format	openpolar
institution	Open Polar
collection	Repository of the University of Ljubljana (RUL)
op_collection_id	ftuniljubljanair
language	English
topic	internet of things machine learning sensor data edge computing anomaly detection internet stvari strojno učenje senzorski podatki robna obdelava detekcija anomalij
spellingShingle	internet of things machine learning sensor data edge computing anomaly detection internet stvari strojno učenje senzorski podatki robna obdelava detekcija anomalij Šalej, Jakob AIoT and edge computing of sensor data
topic_facet	internet of things machine learning sensor data edge computing anomaly detection internet stvari strojno učenje senzorski podatki robna obdelava detekcija anomalij
description	The Internet of Things (IoT) consists of resource-constrained devices or sensors connected to the network. These devices send large amounts of data to the servers in the cloud, to be stored and processed. Relying on the cloud has its disadvantages, namely high network usage, privacy concerns and slower data processing. These downsides can be mitigated with a new paradigm - edge computing. The new category of smart AIoT (Artificial Intelligence + IoT) devices is capable of learning from and responding to new scenarios instantly by using ML methods locally, on-device. The goal of this master's thesis is to analyse the possibilities of local data processing on resource-constrained edge devices. An experimental analysis of classification and runtime performance of ML algorithms is provided. Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF) and Artificial Neural Network (ANN) are evaluated on a dataset DS2OS traffic traces in an anomaly detection domain. Additionally, Stochastic Gradient Descent (SGD) is used for incremental learning. Algorithm performance is measured on two devices: Raspberry Pi 4 model B serves as a reference edge device, while a laptop PC XPS 13 provides a performance baseline. Mode Ds and mode D implementations differ in how data is split into training and test sets. Two methods for reducing dataset size are presented: randomly sampled smaller datasets and datasets with reduced majority class. Classification results on initial training sets and reduced training sets show that SVM, DT and RF perform the best. Performance analysis shows that DT achieves fastest training and inference times on RPi 4. By using datasets with reduced majority class, RPi 4 is able to match XPS 13 runtime performance results with only a small decrease in classification accuracy. Internet stvari (IoT) sestavljajo naprave in senzorji z omejenimi viri. Imajo dostop do omrežja in pošiljajo velike količine podatkov v strežnike v oblaku, kjer se podatki shranjujejo in obdelujejo. Zanašanje na oblačne storitve ima svoje pomanjkljivosti: obremenjenost omrežja, pomanjkljiva zasebnost in počasnejša obdelava podatkov. Nova paradigma robne obdelave podatkov lahko te težave odpravi. Kategorija pametnih naprav AIoT (AI + IoT) je z lokalno uporabo metod strojnega učenja na sami napravi zmožna učenja in prilagajanja novim situacijam. Glavni cilj magistrske naloge je analiza zmožnosti lokalne obdelave podatkov na robnih napravah z omejenimi viri. Izvedena je eksperimentalna analiza klasifikacijske točnosti in časovne učinkovitosti izvajanja algoritmov strojnega učenja. Metode strojnega učenja LR (ang. Logistic Regression), SVM (ang. Support Vector Machine), DT (ang. Decision Tree), RF (ang.Random Forest) in ANN (ang. Artificial Neural Network) so primerjane na področju detekcije anomalij z uporabo podatkovne zbirke DS2OS. V analizo je vključena tudi metoda SGD (ang. Stochastic Gradient Descent) v domeni inkrementalnega učenja. Uporabljeni sta dve testni napravi: Raspberry Pi 4 (model B) in prenosni računalnik XPS 13. Implementirana sta dva načina za za določanje učnih in testnih množic, poimenovana način Ds in način D. Predstavljena sta tudi dva načina za določanje manjših učnih množic: naključno izbrane manjše učne množice in učne množice z zmanjšanim večinskim razredom. Rezultati klasifikacije na začetnih učnih množicah in zmanjšanih učnih množicah pokažejo, da se pri razvrščanju najbolje odrežejo metode SVM, DT in RF. Analiza hitrosti izvajanja pokaže, da metoda DT doseže najhitrejše čase učenja in predikcije na napravi RPi 4. Z določanjem učnih množic z zmanjšanim večinskim razredom se delovanje RPi 4 pohitri na nivo naprave XPS 13, ob tem pa pride le do manjše izgube klasifikacijske točnosti.
author2	Trebar, Mira
format	Master Thesis
author	Šalej, Jakob
author_facet	Šalej, Jakob
author_sort	Šalej, Jakob
title	AIoT and edge computing of sensor data
title_short	AIoT and edge computing of sensor data
title_full	AIoT and edge computing of sensor data
title_fullStr	AIoT and edge computing of sensor data
title_full_unstemmed	AIoT and edge computing of sensor data
title_sort	aiot and edge computing of sensor data
publishDate	2021
url	https://repozitorij.uni-lj.si/IzpisGradiva.php?id=133344 https://repozitorij.uni-lj.si/Dokument.php?id=151342&dn= https://plus.si.cobiss.net/opac7/bib/87405571?lang=sl
genre	sami
genre_facet	sami
op_relation	https://repozitorij.uni-lj.si/IzpisGradiva.php?id=133344 https://repozitorij.uni-lj.si/Dokument.php?id=151342&dn= https://plus.si.cobiss.net/opac7/bib/87405571?lang=sl
op_rights	info:eu-repo/semantics/openAccess
_version_	1766186848184434688
spelling	ftuniljubljanair:oai:repozitorij.uni-lj.si:IzpisGradiva.php-id-133344 2023-05-15T18:14:08+02:00 AIoT and edge computing of sensor data AIoT in robna obdelava senzorskih podatkov Šalej, Jakob Trebar, Mira 2021-11-23 application/pdf https://repozitorij.uni-lj.si/IzpisGradiva.php?id=133344 https://repozitorij.uni-lj.si/Dokument.php?id=151342&dn= https://plus.si.cobiss.net/opac7/bib/87405571?lang=sl eng eng https://repozitorij.uni-lj.si/IzpisGradiva.php?id=133344 https://repozitorij.uni-lj.si/Dokument.php?id=151342&dn= https://plus.si.cobiss.net/opac7/bib/87405571?lang=sl info:eu-repo/semantics/openAccess internet of things machine learning sensor data edge computing anomaly detection internet stvari strojno učenje senzorski podatki robna obdelava detekcija anomalij info:eu-repo/semantics/masterThesis info:eu-repo/semantics/publishedVersion 2021 ftuniljubljanair 2021-12-06T10:27:47Z The Internet of Things (IoT) consists of resource-constrained devices or sensors connected to the network. These devices send large amounts of data to the servers in the cloud, to be stored and processed. Relying on the cloud has its disadvantages, namely high network usage, privacy concerns and slower data processing. These downsides can be mitigated with a new paradigm - edge computing. The new category of smart AIoT (Artificial Intelligence + IoT) devices is capable of learning from and responding to new scenarios instantly by using ML methods locally, on-device. The goal of this master's thesis is to analyse the possibilities of local data processing on resource-constrained edge devices. An experimental analysis of classification and runtime performance of ML algorithms is provided. Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF) and Artificial Neural Network (ANN) are evaluated on a dataset DS2OS traffic traces in an anomaly detection domain. Additionally, Stochastic Gradient Descent (SGD) is used for incremental learning. Algorithm performance is measured on two devices: Raspberry Pi 4 model B serves as a reference edge device, while a laptop PC XPS 13 provides a performance baseline. Mode Ds and mode D implementations differ in how data is split into training and test sets. Two methods for reducing dataset size are presented: randomly sampled smaller datasets and datasets with reduced majority class. Classification results on initial training sets and reduced training sets show that SVM, DT and RF perform the best. Performance analysis shows that DT achieves fastest training and inference times on RPi 4. By using datasets with reduced majority class, RPi 4 is able to match XPS 13 runtime performance results with only a small decrease in classification accuracy. Internet stvari (IoT) sestavljajo naprave in senzorji z omejenimi viri. Imajo dostop do omrežja in pošiljajo velike količine podatkov v strežnike v oblaku, kjer se podatki shranjujejo in obdelujejo. Zanašanje na oblačne storitve ima svoje pomanjkljivosti: obremenjenost omrežja, pomanjkljiva zasebnost in počasnejša obdelava podatkov. Nova paradigma robne obdelave podatkov lahko te težave odpravi. Kategorija pametnih naprav AIoT (AI + IoT) je z lokalno uporabo metod strojnega učenja na sami napravi zmožna učenja in prilagajanja novim situacijam. Glavni cilj magistrske naloge je analiza zmožnosti lokalne obdelave podatkov na robnih napravah z omejenimi viri. Izvedena je eksperimentalna analiza klasifikacijske točnosti in časovne učinkovitosti izvajanja algoritmov strojnega učenja. Metode strojnega učenja LR (ang. Logistic Regression), SVM (ang. Support Vector Machine), DT (ang. Decision Tree), RF (ang.Random Forest) in ANN (ang. Artificial Neural Network) so primerjane na področju detekcije anomalij z uporabo podatkovne zbirke DS2OS. V analizo je vključena tudi metoda SGD (ang. Stochastic Gradient Descent) v domeni inkrementalnega učenja. Uporabljeni sta dve testni napravi: Raspberry Pi 4 (model B) in prenosni računalnik XPS 13. Implementirana sta dva načina za za določanje učnih in testnih množic, poimenovana način Ds in način D. Predstavljena sta tudi dva načina za določanje manjših učnih množic: naključno izbrane manjše učne množice in učne množice z zmanjšanim večinskim razredom. Rezultati klasifikacije na začetnih učnih množicah in zmanjšanih učnih množicah pokažejo, da se pri razvrščanju najbolje odrežejo metode SVM, DT in RF. Analiza hitrosti izvajanja pokaže, da metoda DT doseže najhitrejše čase učenja in predikcije na napravi RPi 4. Z določanjem učnih množic z zmanjšanim večinskim razredom se delovanje RPi 4 pohitri na nivo naprave XPS 13, ob tem pa pride le do manjše izgube klasifikacijske točnosti. Master Thesis sami Repository of the University of Ljubljana (RUL)

AIoT and edge computing of sensor data

Similar Items