Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Joint effects of depth-aiding augmentations and viewing positions on the quality of experience in augmented telepresence
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Systems and Technology. (Realistic 3D)ORCID iD: 0000-0002-4967-3033
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Systems and Technology. Division ICT-Acreo, RISE Research Institutes of Sweden.ORCID iD: 0000-0001-5060-9402
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Systems and Technology. (Realistic 3D)
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Systems and Technology. Mid Sweden University, Faculty of Science, Technology and Media, Department of Design.
Show others and affiliations
2020 (English)In: Quality and User Experience, ISSN 2366-0139, E-ISSN 2366-0147, Vol. 5, p. 1-17Article in journal (Refereed) Published
Abstract [en]

Virtual and augmented reality is increasingly prevalent in industrial applications, such as remote control of industrial machinery, due to recent advances in head-mounted display technologies and low-latency communications via 5G. However, the influence of augmentations and camera placement-based viewing positions on operator performance in telepresence systems remains unknown. In this paper, we investigate the joint effects of depth-aiding augmentations and viewing positions on the quality of experience for operators in augmented telepresence systems. A study was conducted with 27 non-expert participants using a real-time augmented telepresence system to perform a remote-controlled navigation and positioning task, with varied depth-aiding augmentations and viewing positions. The resulting quality of experience was analyzed via Likert opinion scales, task performance measurements, and simulator sickness evaluation. Results suggest that reducing the reliance on stereoscopic depth perception via camera placement has a significant benefit to operator performance and quality of experience. Conversely, the depth-aiding augmentations can partly mitigate the negative effects of inferior viewing positions. However the viewing-position based monoscopic and stereoscopic depth cues tend to dominate over cues based on augmentations. There is also a discrepancy between the participants’ subjective opinions on augmentation helpfulness, and its observed effects on positioning task performance.

Place, publisher, year, edition, pages
2020. Vol. 5, p. 1-17
Keywords [en]
Quality of Experience, Augmented Reality, Telepresence, Head Mounted Displays, Depth Perception
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:miun:diva-38413DOI: 10.1007/s41233-020-0031-7OAI: oai:DiVA.org:miun-38413DiVA, id: diva2:1392857
Funder
European Regional Development Fund (ERDF), 20201888Knowledge Foundation, 20160194Available from: 2020-02-13 Created: 2020-02-13 Last updated: 2025-09-25Bibliographically approved
In thesis
1. Augmented Telepresence based on Multi-Camera Systems: Capture, Transmission, Rendering, and User Experience
Open this publication in new window or tab >>Augmented Telepresence based on Multi-Camera Systems: Capture, Transmission, Rendering, and User Experience
2021 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

 Observation and understanding of the world through digital sensors is an ever-increasing part of modern life. Systems of multiple sensors acting together have far-reaching applications in automation, entertainment, surveillance, remote machine control, and robotic self-navigation. Recent developments in digital camera, range sensor and immersive display technologies enable the combination of augmented reality and telepresence into Augmented Telepresence, which promises to enable more effective and immersive forms of interaction with remote environments.

The purpose of this work is to gain a more comprehensive understanding of how multi-sensor systems lead to Augmented Telepresence, and how Augmented Telepresence can be utilized for industry-related applications. On the one hand, the conducted research is focused on the technological aspects of multi-camera capture, rendering, and end-to-end systems that enable Augmented Telepresence. On the other hand, the research also considers the user experience aspects of Augmented Telepresence, to obtain a more comprehensive perspective on the application and design of Augmented Telepresence solutions.

This work addresses multi-sensor system design for Augmented Telepresence regarding four specific aspects ranging from sensor setup for effective capture to the rendering of outputs for Augmented Telepresence. More specifically, the following problems are investigated: 1) whether multi-camera calibration methods can reliably estimate the true camera parameters; 2) what the consequences are of synchronization errors in a multi-camera system; 3) how to design a scalable multi-camera system for low-latency, real-time applications; and 4) how to enable Augmented Telepresence from multi-sensor systems for mining, without prior data capture or conditioning. 

The first problem was solved by conducting a comparative assessment of widely available multi-camera calibration methods. A special dataset was recorded, enforcing known constraints on camera ground-truth parameters to use as a reference for calibration estimates. The second problem was addressed by introducing a depth uncertainty model that links the pinhole camera model and synchronization error to the geometric error in the 3D projections of recorded data. The third problem was addressed empirically - by constructing a multi-camera system based on off-the-shelf hardware and a modular software framework. The fourth problem was addressed by proposing a processing pipeline of an augmented remote operation system for augmented and novel view rendering.

The calibration assessment revealed that target-based and certain target-less calibration methods are relatively similar in their estimations of the true camera parameters, with one specific exception. For high-accuracy scenarios, even commonly used target-based calibration approaches are not sufficiently accurate with respect to the ground truth. The proposed depth uncertainty model was used to show that converged multi-camera arrays are less sensitive to synchronization errors. The mean depth uncertainty of a camera system correlates to the rendered result in depth-based reprojection as long as the camera calibration matrices are accurate. The presented multi-camera system demonstrates a flexible, de-centralized framework where data processing is possible in the camera, in the cloud, and on the data consumer's side. The multi-camera system is able to act as a capture testbed and as a component in end-to-end communication systems, because of the general-purpose computing and network connectivity support coupled with a segmented software framework. This system forms the foundation for the augmented remote operation system, which demonstrates the feasibility of real-time view generation by employing on-the-fly lidar de-noising and sparse depth upscaling for novel and augmented view synthesis.

In addition to the aforementioned technical investigations, this work also addresses the user experience impacts of Augmented Telepresence. The following two questions were investigated: 1) What is the impact of camera-based viewing position in Augmented Telepresence? 2) What is the impact of depth-aiding augmentations in Augmented Telepresence? Both are addressed through a quality of experience study with non-expert participants, using a custom Augmented Telepresence test system for a task-based experiment. The experiment design combines in-view augmentation, camera view selection, and stereoscopic augmented scene presentation via a head-mounted display to investigate both the independent factors and their joint interaction.

The results indicate that between the two factors, view position has a stronger influence on user experience. Task performance and quality of experience were significantly decreased by viewing positions that force users to rely on stereoscopic depth perception. However, position-assisting view augmentations can mitigate the negative effect of sub-optimal viewing positions; the extent of such mitigation is subject to the augmentation design and appearance.

In aggregate, the works presented in this dissertation cover a broad view of Augmented Telepresence. The individual solutions contribute general insights into Augmented Telepresence system design, complement gaps in the current discourse of specific areas, and provide tools for solving challenges found in enabling the capture, processing, and rendering in real-time-oriented end-to-end systems.

Place, publisher, year, edition, pages
Sundsvall: Mid Sweden University, 2021. p. 70
Series
Mid Sweden University doctoral thesis, ISSN 1652-893X ; 345
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:miun:diva-41860 (URN)978-91-89341-06-7 (ISBN)
Public defence
2021-05-17, C312, Mittuniversitetet Holmgatan 10, Sundsvall, 14:00 (English)
Opponent
Supervisors
Available from: 2021-04-15 Created: 2021-04-15 Last updated: 2025-09-25Bibliographically approved

Open Access in DiVA

fulltext(1759 kB)994 downloads
File information
File name FULLTEXT01.pdfFile size 1759 kBChecksum SHA-512
d969389abbc13a068599ee7ff049b39c6f2326d5dd7bb2cdcb207604355ae6be93e6d8a1c1aa2fec09f724e810e32878b0963f4f2acdc121923de8cfab4096bf
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Dima, ElijsBrunnström, KjellSjöström, MårtenAndersson, MattiasEdlund, Joakim

Search in DiVA

By author/editor
Dima, ElijsBrunnström, KjellSjöström, MårtenAndersson, MattiasEdlund, Joakim
By organisation
Department of Information Systems and TechnologyDepartment of Design
In the same journal
Quality and User Experience
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 995 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 644 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf