Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Adaptive Noise Resilient Keyword Spotting Using One-Shot Learning
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-).ORCID iD: 0000-0002-2336-5390
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-).
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-).ORCID iD: 0000-0002-8617-0435
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-).ORCID iD: 0000-0001-9572-3639
Show others and affiliations
2025 (English)In: 2025 IEEE 11th World Forum on Internet of Things (WF-IoT), IEEE conference proceedings, 2025, p. 1-6Conference paper, Published paper (Refereed)
Abstract [en]

Keyword spotting (KWS) is a key component of smart devices, enabling efficient and intuitive audio interaction. However, standard KWS systems deployed on embedded devices often suffer performance degradation under real-world operating conditions. Resilient KWS systems address this issue by enabling dynamic adaptation, with applications such as adding or replacing keywords, adjusting to specific users, and improving noise robustness. However, deploying resilient, standalone KWS systems with low latency on resource-constrained devices remains challenging due to limited memory and computational resources. This study proposes a low computational approach for continuous noise adaptation of pretrained neural networks used for KWS classification, requiring only 1-shot learning and one epoch. The proposed method was assessed using two pretrained models and three real-world noise sources at signal-to-noise ratios (SNRs) ranging from 24 to -3 dB. The adapted models consistently outperformed the pretrained models across all scenarios, especially at SNR≤18 dB, achieving accuracy improvements of 4.9% to 46.0%. These results highlight the efficacy of the proposed methodology while being lightweight enough for deployment on resource-constrained devices.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2025. p. 1-6
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:miun:diva-56251DOI: 10.1109/WF-IoT64238.2025.11270573ISBN: 979-8-3315-1522-5 (electronic)OAI: oai:DiVA.org:miun-56251DiVA, id: diva2:2020678
Conference
2025 IEEE 11th World Forum on Internet of Things (WF-IoT)
Available from: 2025-12-11 Created: 2025-12-11 Last updated: 2025-12-11Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Martinez Rau, LucianoNguyen Phuong Vu, QuynhZhang, YuxuanOelmann, BengtBader, Sebastian

Search in DiVA

By author/editor
Martinez Rau, LucianoNguyen Phuong Vu, QuynhZhang, YuxuanOelmann, BengtBader, Sebastian
By organisation
Department of Computer and Electrical Engineering (2023-)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 17 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf