Open this publication in new window or tab >>2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]
The proliferation of the Internet of Things (IoT) has driven the deployment of Deep Learning models on constrained edge devices. However, a fundamental conflict exists between the computational demands of Deep Neural Networks (DNNs) and the strict energy and processing limits of battery-operated nodes. While intelligence partitioning offers a potential solution by offloading computation to a server, practical deployment is hindered by the structural barrier of modern DNNs, which are characterized by intensive early-layer computation and intermediate data expansion, creating critical bottlenecks in distributed environments. This thesis presents a system-level methodology to bridge the gap between algorithmic demands and hardware constraints.
The research begins by identifying the governing parameters of system efficiency through a systematic analysis method and a Design Space Exploration (DSE) method. Based on these core determinants, a co-design strategy is introduced to overcome the structural barrier to partitioning. By synergistically combining model- and data-level transformations, this approach induces efficiency at potential partition points, significantly reducing node energy consumption and system latency. Finally, the thesis proposes an accuracy recovery method to effectively decouple node efficiency from application accuracy. By shifting the paradigm from loss mitigation to compensation, this reconstruction engine ensures that performance is maintained relative to the baseline accuracy even under extreme optimization actions.
In summary, this thesis establishes a system-level methodology for the efficient partitioning of DNNs. It demonstrates that by operationalizing the presented formal design workflow, it is possible to exploit the capabilities of resource-unconstrained servers to maximize node battery life and minimize system response time. This work lays the foundation for ubiquitous intelligence, enabling the deployment of advanced AI on resource-limited hardware by transforming the structural limitations of DNNs into opportunities for distributed efficiency.
Place, publisher, year, edition, pages
Sundsvall: Mid Sweden University, 2026. p. 77
Series
Mid Sweden University doctoral thesis, ISSN 1652-893X ; 445
Keywords
Edge AI, Split Computing, DNN Partitioning, Co-optimization, Accuracy recovery, Feature map reconstruction, Feature map regeneration, Node-server partitioning, Design Space Exploration, Hardware-Aware Design, Distributed Inference, Deep Neural Networks
National Category
Computer Vision and Learning Systems Other Electrical Engineering, Electronic Engineering, Information Engineering Embedded Systems
Identifiers
urn:nbn:se:miun:diva-56427 (URN)978-91-90017-54-8 (ISBN)
Public defence
2026-02-18, L111, Holmgatan 10, Sundsvall, 09:00 (English)
Opponent
Supervisors
Projects
Research profile NIIT
Funder
Knowledge Foundation, 20180170
2026-01-222026-01-212026-01-22Bibliographically approved