Publications

Enabling Machine Learning on the Edge using SRAM Conserving Efficient Neural Networks Execution Approach

Bharath Sudharsan, National University of Ireland, Galway
Pankesh Patel, University of South Carolina - ColumbiaFollow
John G. Breslin, National University of Ireland, Galway
Muhammad Intizar Ali, Dublin City University

Document Type

Article

Subject Area(s)

Artificial Intelligence, Embedded System, Internet of Things

Abstract

Edge analytics refers to the application of data analytics and Machine Learning (ML) algorithms on IoT devices. The concept of edge analytics is gaining popularity due to its ability to perform AI-based analytics at the device level, enabling autonomous decision-making, without depending on the cloud. However, the majority of Internet of Things (IoT) devices are embedded systems with a low-cost microcontroller unit (MCU) or a small CPU as its brain, which often are incapable of handling complex ML algorithms.

In this paper, we propose an approach for the ecient execution of already deeply compressed, large neural networks (NNs) on tiny IoT devices. After optimizing NNs using state-of-the-art deep model compression methods, when the resultant models are executed by MCUs or small CPUs using the model execution sequence produced by our approach, higher levels of conserved SRAM can be achieved. During the evaluation for nine popular models, when comparing the default NN execution sequence with the sequence produced by our approach, we found that 1.61-38.06% less SRAM was used to produce inference results, the inference time was reduced by 0.28-4.9 ms, and energy consumption was reduced by 4-84 mJ. Despite achieving such high conserved levels of SRAM, our method 100% preserved the accuracy, F1 score, etc. (model performance).

Publication Info

Preprint version European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021, pages 1-17.

APA Citation

Sudharsan, B., Patel, P., Breslin, J. G., & Ali, M. I. (2021) Enabling machine learning on the edge using SRAM conserving efficient neural networks execution approach. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. https://2021.ecmlpkdd.org/

Download

Included in

Computer and Systems Architecture Commons, Electrical and Computer Engineering Commons, Hardware Systems Commons

COinS

Publications

Enabling Machine Learning on the Edge using SRAM Conserving Efficient Neural Networks Execution Approach

Document Type

Subject Area(s)

Abstract

Publication Info

APA Citation

Included in

Search

Browse

Submissions

Links

Publications

Enabling Machine Learning on the Edge using SRAM Conserving Efficient Neural Networks Execution Approach

Author(s)

Document Type

Subject Area(s)

Abstract

Publication Info

APA Citation

Included in

Share

Search

Browse

Submissions

Links