Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Waist Tightening of CNNs: A Case study on Tiny YOLOv3 for Distributed IoT Implementations
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-).
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-).
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-).
Show others and affiliations
2023 (English)In: ACM International Conference Proceeding Series, Association for Computing Machinery (ACM), 2023, p. 241-246Conference paper, Published paper (Refereed)
Abstract [en]

Computer vision systems in sensor nodes of the Internet of Things (IoT) based on Deep Learning (DL) are demanding because the DL models are memory and computation hungry while the nodes often come with tight constraints on energy, latency, and memory. Consequently, work has been done to reduce the model size or distribute part of the work to other nodes. However, then the question arises how these approaches impact the energy consumption at the node and the inference time of the system. In this work, we perform a case study to explore the impact of partitioning a Convolutional Neural Network (CNN) such that one part is implemented on the IoT node, while the rest is implemented on an edge device. The goal is to explore how the choice of partition point, quantization method and communication technology affects the IoT system. We identify possible partitioning points between layers, where we transform the feature maps passed between layers by applying quantization and compression to reduce the data sent over the communication channel between the two partitions in Tiny YOLOv3. The results show that a reduction of transmitted data by 99.8% reduces the network accuracy by 3 percentage points. Furthermore, the evaluation of various IoT communication protocols shows that the quantization of data facilitates CNN network partitioning with significant reduction of overall latency and node energy consumption. 

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2023. p. 241-246
Keywords [en]
CNN partitioning, convolutional neural networks, intelligence partitioning, Internet of Things, smart camera
National Category
Communication Systems
Identifiers
URN: urn:nbn:se:miun:diva-48419DOI: 10.1145/3576914.3587518ISI: 001054880600042Scopus ID: 2-s2.0-85159789406ISBN: 9798400700491 (print)OAI: oai:DiVA.org:miun-48419DiVA, id: diva2:1763608
Conference
2023 Cyber-Physical Systems and Internet-of-Things Week, CPS-IoT Week 2023, 9 May 2023 through 12 May 2023
Available from: 2023-06-07 Created: 2023-06-07 Last updated: 2026-01-21Bibliographically approved
In thesis
1. Partitioned Deep Neural Network Inference on Resource-Constrained IoT Devices: A System-Level Methodology
Open this publication in new window or tab >>Partitioned Deep Neural Network Inference on Resource-Constrained IoT Devices: A System-Level Methodology
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The proliferation of the Internet of Things (IoT) has driven the deployment of Deep Learning models on constrained edge devices. However, a fundamental conflict exists between the computational demands of Deep Neural Networks (DNNs) and the strict energy and processing limits of battery-operated nodes. While intelligence partitioning offers a potential solution by offloading computation to a server, practical deployment is hindered by the structural barrier of modern DNNs, which are characterized by intensive early-layer computation and intermediate data expansion, creating critical bottlenecks in distributed environments. This thesis presents a system-level methodology to bridge the gap between algorithmic demands and hardware constraints.

The research begins by identifying the governing parameters of system efficiency through a systematic analysis method and a Design Space Exploration (DSE) method. Based on these core determinants, a co-design strategy is introduced to overcome the structural barrier to partitioning. By synergistically combining model- and data-level transformations, this approach induces efficiency at potential partition points, significantly reducing node energy consumption and system latency. Finally, the thesis proposes an accuracy recovery method to effectively decouple node efficiency from application accuracy. By shifting the paradigm from loss mitigation to compensation, this reconstruction engine ensures that performance is maintained relative to the baseline accuracy even under extreme optimization actions.

In summary, this thesis establishes a system-level methodology for the efficient partitioning of DNNs. It demonstrates that by operationalizing the presented formal design workflow, it is possible to exploit the capabilities of resource-unconstrained servers to maximize node battery life and minimize system response time. This work lays the foundation for ubiquitous intelligence, enabling the deployment of advanced AI on resource-limited hardware by transforming the structural limitations of DNNs into opportunities for distributed efficiency.

Place, publisher, year, edition, pages
Sundsvall: Mid Sweden University, 2026. p. 77
Series
Mid Sweden University doctoral thesis, ISSN 1652-893X ; 445
Keywords
Edge AI, Split Computing, DNN Partitioning, Co-optimization, Accuracy recovery, Feature map reconstruction, Feature map regeneration, Node-server partitioning, Design Space Exploration, Hardware-Aware Design, Distributed Inference, Deep Neural Networks
National Category
Computer Vision and Learning Systems Other Electrical Engineering, Electronic Engineering, Information Engineering Embedded Systems
Identifiers
urn:nbn:se:miun:diva-56427 (URN)978-91-90017-54-8 (ISBN)
Public defence
2026-02-18, L111, Holmgatan 10, Sundsvall, 09:00 (English)
Opponent
Supervisors
Projects
Research profile NIIT
Funder
Knowledge Foundation, 20180170
Available from: 2026-01-22 Created: 2026-01-21 Last updated: 2026-01-22Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Sánchez Leal, IsaacSaqib, EirajShallari, IridaKrug, SilviaO'Nils, Mattias

Search in DiVA

By author/editor
Sánchez Leal, IsaacSaqib, EirajShallari, IridaKrug, SilviaO'Nils, Mattias
By organisation
Department of Computer and Electrical Engineering (2023-)
Communication Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 735 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf