Mid Sweden University

miun.sePublications
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Rotational Invariant Object Recognition for Robotic Vision
Mid Sweden University, Faculty of Science, Technology and Media, Department of Electronics Design.
Mid Sweden University, Faculty of Science, Technology and Media, Department of Electronics Design.
Mid Sweden University, Faculty of Science, Technology and Media, Department of Electronics Design.
2019 (English)In: ICACR 2019 Proceedings of the 2019 3rd International Conference on Automation, Control and Robots, ACM Digital Library, 2019, p. 1-6Conference paper, Published paper (Refereed)
Abstract [en]

Depth cameras have enhanced the environment perception for robotic applications significantly. They allow to measure true distances and thus enable a 3D measurement of the robot surroundings. In order to enable robust robot vision, the objects recognition has to handle rotated data because object can be viewed from different dynamic perspectives when the robot is moving. Therefore, the 3D descriptors used of object recognition for robotic applications have to be rotation invariant and implementable on the embedded system, with limited memory and computing resources. With the popularization of the depth cameras, the Histogram of Gradients (HOG) descriptor has been extended to recognize also 3D volumetric objects (3DVHOG). Unfortunately, both version are not rotation invariant. There are different methods to achieve rotation invariance for 3DVHOG, but they increase significantly the computational cost of the overall data processing. Hence, they are unfeasible to be implemented in a low cost processor for real-time operation. In this paper, we propose an object pose normalization method to achieve 3DVHOG rotation invariance while reducing the number of processing operations as much as possible. Our method is based on Principal Component Analysis (PCA) normalization. We tested our method using the Princeton Modelnet10 dataset.

Place, publisher, year, edition, pages
ACM Digital Library, 2019. p. 1-6
Keywords [en]
3D Object Recognition, Histogram of Gradients, Princeton Modelnet10, Principal Component Analysis, Pose Normalization, Image Processing, Depth Camera
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:miun:diva-37973DOI: 10.1145/3365265.3365273ISI: 000651280300001Scopus ID: 2-s2.0-85117541596ISBN: 978-1-4503-7288-6 (electronic)OAI: oai:DiVA.org:miun-37973DiVA, id: diva2:1377512
Conference
2019 3rd International Conference on Automation, Control and Robots, Prague, Czech Republic, 11-13 October, 2019
Available from: 2019-12-12 Created: 2019-12-12 Last updated: 2022-06-01Bibliographically approved
In thesis
1. Semi-Autonomous Navigation of Powered Wheelchairs: 2D/3D Sensing and Positioning Methods
Open this publication in new window or tab >>Semi-Autonomous Navigation of Powered Wheelchairs: 2D/3D Sensing and Positioning Methods
2021 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Autonomous driving and assistance systems have become a reality for the automotive industry to improve driving safety in the car. Hence, the cars use a variety of sensors, cameras and image processing techniques to measure their surroundings and control their direction, braking and speed for obstacle avoidance or autonomously driving applications.Like the automotive industry, powered wheelchairs also require safety systems to ensure their operation, especially when the user has controlling limitations, but also to develop new applications to improve its usability. One of the applications is focused on developing a new contactless control of a powered wheelchair using the position of a caregiver beside it as a control reference. Contactless control can prevent control errors, but it can also provide better and more equal communication between the wheelchair user and the caregiver

This thesis evaluates the camera requirements for a contactless powered wheelchair control and the 2D/3D image processing techniques for caregiver recognition and position measurement beside the powered wheelchair. The research evaluates the strength and limitations of different depth camera technologies for caregiver feet detection above the ground plane to select the proper camera for the application. Then, a hand-crafted 3D object descriptor is evaluated for caregiver feet recognition and compared with respect to a state-of-the-art deep learning object detector. Results for both methods are good, however, the hand-crafted descriptor suffers from segmentation errors and consequently, their accuracy is lower. After the depth camera and image processing techniques evaluation, results show that it is possible to use only an RGB camera to recognize and measure his or her relative position.

Place, publisher, year, edition, pages
Sundsvall: Mid Sweden University, 2021. p. 64
Series
Mid Sweden University doctoral thesis, ISSN 1652-893X
Keywords
3D object recognition, YOLO, YOLO-Tiny, 3DHOG, Histogram-of-Oriented-Gradients, ModelNet40, Feature descriptor, Intel RealSense, Depth camera, Wheelchair
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:miun:diva-43829 (URN)978-91-89341-32-6 (ISBN)
Public defence
2021-12-09, O102, Mittuniversitetet, Sundsvall, 16:06 (English)
Opponent
Supervisors
Note

Vid tidpunkten för disputationen var följande delarbeten opublicerade: delarbete 5 inskickat.

At the time of the doctoral defence the following papers were unpublished: paper 5 submitted.

Available from: 2021-11-24 Created: 2021-11-23 Last updated: 2025-03-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Vilar, CristianKrug, SilviaThörnberg, Benny

Search in DiVA

By author/editor
Vilar, CristianKrug, SilviaThörnberg, Benny
By organisation
Department of Electronics Design
Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 188 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf