Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Measurement Quality in Acoustic Sensing with Microphones: From Indoor Localization to Heart Sound Classification
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-). (AISound)ORCID iD: 0000-0002-8253-7535
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

This thesis investigates how measurement design shapes acoustic source localization and classification, with a focus on the interplay between array geometry, device characteristics, and modern signal processing and deep learning. The work is motivated by a persistent gap between theoretically well-understood methods and the practical realities of indoor positioning and biomedical auscultation, where sensor variability, reverberation, and limited control over operating conditions often dominate performance. The overarching aim is to understand how measurement quality in microphone-based sensing constrains and enables what can be inferred from sound under real-world noise, by treating microphones, arrays, and recording protocols as design variables rather than static background assumptions.

Six studies (P1–P6 refer to the list of papers) are presented. The first line of work concerns acoustic fingerprinting. P1 examines how far a single microphone can exploit ambient noise for indoor “silent” object localization, highlighting both the appeal of zero-emission fingerprints and their sensitivity to day-to-day room changes. P6 revisits fingerprinting with active excitation, using exponential sine sweeps and a four-microphone array feeding a convolutional neural network. The comparison between P1 and P6 shows how moving from uncontrolled ambient sound to controlled probing and array-based features improves robustness. Together, they characterize a practical design space for silent object localization, from simple cross-correlation baselines to array-aided deep learning.

The second line of work addresses direction-of-arrival (DoA) estimation with microphone arrays. P2 compares several planar layouts and microphone directivities in a controlled room, using a representative high-resolution DoA estimator to isolate how geometry and sensor pattern affect accuracy and robustness in realistic indoor conditions. P3 focuses on a six-channel uniform circular array and a coherent wideband pipeline, showing that circular-harmonic focusing can retain MUSIC-level resolution while keeping computational demands compatible with embedded implementations. These studies map how established methods behave when constrained by physically small arrays and practical sensor choices, clarifying when geometry or processing is the main bottleneck.

A third line of work turns to biomedical acoustic classification. P4 evaluates a four-channel electronic stethoscope prototype that combines delay-and-sum beamforming and matched filtering for heart-sound segmentation before classification. Working with a limited and clinically constrained dataset, the study illustrates how a realistic multi-channel auscultation setup can increase segment quality and support distinguish normal and abnormal sound for murmur detection. Finally, the thesis examines measurement quality more generally. P5 introduces a measurement quality pipeline that uses existing recordings to extrapolate the benefit of future system upgrades. By fixing a pretrained CNN and synthetically degrading current data to different SNR levels, the study emulates the performance of improved setups, providing a basis for deciding whether it is worthwhile to invest in new measurements and a full round of model retraining and tuning. These results underline that model architecture and measurement quality jointly determine performance, and that metrological upgrades can sometimes deliver rich information without retraining.

Overall, the thesis contributes a set of measurement-driven case studies that make explicit how arrays, excitation signals, and device responses constrain what localization and classification algorithms can realistically achieve. The outcomes include practical recipes for acoustic fingerprinting, design reference points for compact array configurations in indoor DoA tasks, an experimentally grounded path toward reproducible multi-channel auscultation, and empirical guidelines for anticipating how SNR and device variability affect pretrained models. Rather than resolving all trade-offs, the work argues for treating measurement design and algorithm choice as coupled problems.

Place, publisher, year, edition, pages
Sundsvall: Mid Sweden University , 2026. , p. 67
Series
Mid Sweden University doctoral thesis, ISSN 1652-893X ; 446
National Category
Engineering and Technology Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:miun:diva-56463ISBN: 978-91-90017-57-9 (print)OAI: oai:DiVA.org:miun-56463DiVA, id: diva2:2032254
Public defence
2026-02-25, M108, Holmgatan 10, Sundsvall, 09:00 (English)
Opponent
Supervisors
Note

Vid tidpunkten för disputationen var följande delarbeten opublicerade: delarbete 6 accepterat.

At the time of the doctoral defence the following papers were unpublished: paper 6 accepted.

Available from: 2026-01-28 Created: 2026-01-26 Last updated: 2026-01-28Bibliographically approved
List of papers
1. Indoor Silent Object Localization using Ambient Acoustic Noise Fingerprinting
Open this publication in new window or tab >>Indoor Silent Object Localization using Ambient Acoustic Noise Fingerprinting
Show others...
2020 (English)In: 2020 IEEE International Instrumentation and Measurement Technology Conference (I2MTC), IEEE, 2020Conference paper, Published paper (Refereed)
Abstract [en]

Indoor localization has been a popular research subject in recent years. Usually, object localization using sound involves devices on the objects, acquiring data from stationary sound sources, or by localizing the objects with external sensors when the object generates sounds. Indoor localization systems using microphones have traditionally also used systems with several microphones, setting the limitations on cost efficiency and required space for the systems. In this paper, the goal is to investigate whether it is possible for a stationary system to localize a silent object in a room, with only one microphone and ambient noise as information carrier. A subtraction method has been combined with a fingerprint technique, to define and distinguish the noise absorption characteristic of the silent object in the frequency domain for different object positions. The absorption characteristics of several positions of the object is taken as comparison references, serving as fingerprints of known positions for an object. With the experiment result, the tentative idea has been verified as feasible, and noise signal based lateral localization of silent objects can be achieved.

Place, publisher, year, edition, pages
IEEE, 2020
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:miun:diva-39460 (URN)10.1109/I2MTC43012.2020.9129086 (DOI)2-s2.0-85088298769 (Scopus ID)978-1-7281-4460-3 (ISBN)
Conference
2020 IEEE International Instrumentation and Measurement Technology Conference (I2MTC)
Available from: 2020-07-06 Created: 2020-07-06 Last updated: 2026-01-26Bibliographically approved
2. Performance Comparison of Omni and Cardioid Directional Microphones for Indoor Angle of Arrival Sound Source Localization
Open this publication in new window or tab >>Performance Comparison of Omni and Cardioid Directional Microphones for Indoor Angle of Arrival Sound Source Localization
Show others...
2022 (English)In: Conference Record - IEEE Instrumentation and Measurement Technology Conference, IEEE, 2022Conference paper, Published paper (Refereed)
Abstract [en]

The sound source localization technology brings the possibility of mapping the sound source positions. In this paper, angle-of-arrival (AOA) has been chosen as the method for achieving sound source localization in an indoor enclosed environment. The dynamic environment and reverberations bring a challenge for AOA-based systems for such applications. By the acknowledgement of microphone directionality, the cardioid-directional microphone systems have been chosen for the localization performance comparison with omni-directional microphone systems, in order to investigate which microphone is superior in AOA indoor sound source localization. To reduce the hardware complexity, the number of microphones used during the experiment has been limited to 4. A localization improvement has been proposed with a weighting factor. The comparison has been done for both types of microphones with 3 different array manifolds under the same system setup. The comparison shows that the cardioid-directional microphone system has an overall higher accuracy. 

Place, publisher, year, edition, pages
IEEE, 2022
Keywords
angle of arrival, array manifold, cardioid microphone, sound source localization
National Category
Computer Sciences Signal Processing
Identifiers
urn:nbn:se:miun:diva-45756 (URN)10.1109/I2MTC48687.2022.9806559 (DOI)000844585400090 ()2-s2.0-85134427845 (Scopus ID)9781665483605 (ISBN)
Conference
2022 IEEE International Instrumentation and Measurement Technology Conference, I2MTC 2022, 16 May 2022 through 19 May 2022
Available from: 2022-08-03 Created: 2022-08-03 Last updated: 2026-01-26Bibliographically approved
3. A Coherent Wideband Acoustic Source Localization Using a Uniform Circular Array
Open this publication in new window or tab >>A Coherent Wideband Acoustic Source Localization Using a Uniform Circular Array
Show others...
2023 (English)In: Sensors, E-ISSN 1424-8220, Vol. 23, no 11, article id 5061Article in journal (Refereed) Published
Abstract [en]

In modern applications such as robotics, autonomous vehicles, and speaker localization, the computational power for sound source localization applications can be limited when other functionalities get more complex. In such application fields, there is a need to maintain high localization accuracy for several sound sources while reducing computational complexity. The array manifold interpolation (AMI) method applied with the Multiple Signal Classification (MUSIC) algorithm enables sound source localization of multiple sources with high accuracy. However, the computational complexity has so far been relatively high. This paper presents a modified AMI for uniform circular array (UCA) that offers reduced computational complexity compared to the original AMI. The complexity reduction is based on the proposed UCA-specific focusing matrix which eliminates the calculation of the Bessel function. The simulation comparison is done with the existing methods of iMUSIC, the Weighted Squared Test of Orthogonality of Projected Subspaces (WS-TOPS), and the original AMI. The experiment result under different scenarios shows that the proposed algorithm outperforms the original AMI method in terms of estimation accuracy and up to a 30% reduction in computation time. An advantage offered by this proposed method is the ability to implement wideband array processing on low-end microprocessors.

Place, publisher, year, edition, pages
MDPI, 2023
Keywords
array manifold interpolation, direction of arrival estimation, wideband sources
National Category
Signal Processing
Identifiers
urn:nbn:se:miun:diva-48473 (URN)10.3390/s23115061 (DOI)001005309700001 ()37299788 (PubMedID)2-s2.0-85161608613 (Scopus ID)
Available from: 2023-06-12 Created: 2023-06-12 Last updated: 2026-01-26Bibliographically approved
4. Improving Cardiac Auscultation Signal Quality by using 4-Channel Stethoscope Array
Open this publication in new window or tab >>Improving Cardiac Auscultation Signal Quality by using 4-Channel Stethoscope Array
Show others...
2024 (English)In: Conference Record - IEEE Instrumentation and Measurement Technology Conference, IEEE conference proceedings, 2024Conference paper, Published paper (Refereed)
Abstract [en]

In cardiac auscultation, the ability to clearly hear any existing murmur sounds in heart sounds is crucial for proper diagnosis. This work aims to improve heart sound by the use of a stethoscope array and beamforming technique. The stethoscope array comprises four piezo elements for measurement, placed on the edges of a 40mm by 40mm rectangle. The directionality of the piezo elements reduces the effect of ambient noise in the measurement. The signal amelioration is achieved by isolating the systole and diastole sounds, and independently applying the delay-and-sum beamforming. This thereby makes any existing murmur sounds in the systole and/or diastole more audible and clearer to aid diagnosis. Finally, the designed stethoscope array and signal processing shows a gain of up to 33% for measured healthy heart samples, and up to 63% increase in murmur sound gain for measured sample with medically confirmed murmur. 

Place, publisher, year, edition, pages
IEEE conference proceedings, 2024
Keywords
acoustic beam steering, auscultation, auscultation signal processing, stethoscope, stethoscope array
National Category
Medical Laboratory Technologies
Identifiers
urn:nbn:se:miun:diva-52055 (URN)10.1109/I2MTC60896.2024.10560871 (DOI)001261521400174 ()2-s2.0-85197746698 (Scopus ID)9798350380903 (ISBN)
Conference
Conference Record - IEEE Instrumentation and Measurement Technology Conference
Available from: 2024-08-07 Created: 2024-08-07 Last updated: 2026-01-26Bibliographically approved
5. Accuracy Impact of Increased Measurement Quality when using Pretrained Networks for Classification
Open this publication in new window or tab >>Accuracy Impact of Increased Measurement Quality when using Pretrained Networks for Classification
Show others...
2024 (English)In: Conference Record - IEEE Instrumentation and Measurement Technology Conference, IEEE conference proceedings, 2024Conference paper, Published paper (Refereed)
Abstract [en]

The field of Metrology has seen great use of Machine Learning and Deep Learning models, improving existing Metrology and enabling measurements and estimations that were previously not possible. In the challenging task of gathering training data in various areas of Metrology, a question arises; is it necessary to gather a completely new dataset every time a quality upgrade is done to a measurement system, method or model, or can the formerly trained model be used for new data with higher Signal-to-Noise Ratio (SNR)? This paper investigates how trained neural networks react to new data coming into the testing, with a higher SNR than the training data. In the experiments, Convolutional Neural Networks (CNN), in 1D and 2D, are used on heart sound data, as a test case. The initial results show that the classification accuracy for the new data, with a higher SNR, coming into the 1D CNN is almost as high as if the network had been trained on the higher SNR data. For a 2D CNN working with spectrograms instead of time series data, the change in accuracy is not nearly as high, as the 2D CNN model seems more robust to noise differences. 

Place, publisher, year, edition, pages
IEEE conference proceedings, 2024
Keywords
Classification, Data Quality, Deep Learning, Machine Learning, Neural Networks, Pretrained, SNR
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:miun:diva-52050 (URN)10.1109/I2MTC60896.2024.10561016 (DOI)001261521400247 ()2-s2.0-85197767566 (Scopus ID)9798350380903 (ISBN)
Conference
Conference Record - IEEE Instrumentation and Measurement Technology Conference
Available from: 2024-08-07 Created: 2024-08-07 Last updated: 2026-01-26Bibliographically approved

Open Access in DiVA

fulltext(3920 kB)36 downloads
File information
File name FULLTEXT01.pdfFile size 3920 kBChecksum SHA-512
aab6fa538fbe0aa49869918be5361ddc640c96d6055afd2b33ab75f1f26ec13cdd7ad4c3b0d5b8c1120f005174cf1d31c25a521eb10647e08663e5d9c2db282a
Type fulltextMimetype application/pdf

Authority records

Jiang, Meng

Search in DiVA

By author/editor
Jiang, Meng
By organisation
Department of Computer and Electrical Engineering (2023-)
Engineering and TechnologyElectrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 4805 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf