Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Immersive Video Coding: Should Geometry Information be Transmitted as Depth Maps?
Show others and affiliations
2021 (English)In: IEEE transactions on circuits and systems for video technology (Print), ISSN 1051-8215, E-ISSN 1558-2205Article in journal (Refereed) Published
Abstract [en]

Immersive video often refers to multiple views with texture and scene geometry information, from which different viewports can be synthesized on the client side. To design efficient immersive video coding solutions, it is desirable to minimize bitrate, pixel rate and complexity. We investigate whether the classical approach of sending the geometry of a scene as depth maps is appropriate to serve this purpose. Previous work shows that bypassing depth transmission entirely and estimating depth at the client side improves the synthesis performance while saving bitrate and pixel rate. In order to understand if the encoder side depth maps contain information that is beneficial to be transmitted, we first explore a hybrid approach which enables partial depth map transmission using a block-based RD-based decision in the depth coding process. This approach reveals that partial depth map transmission may improve the rendering performance but does not present a good compromise in terms of compression efficiency. This led us to address the remaining drawbacks of decoder side depth estimation: complexity and depth map inaccuracy. We propose a novel system that takes advantage of high quality depth maps at the server side by encoding them into lightweight features that support the depth estimator at the client side. These features allow reducing the amount of data that has to be handled during decoder side depth estimation by 88%, which significantly speeds up the cost computation and the energy minimization of the depth estimator. Furthermore, -46.0% and -37.9% average synthesis BD-Rate gains are achieved compared to the classical approach with depth maps estimated at the encoder. IEEE

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc. , 2021.
Keywords [en]
Bit rate, Complexity theory, decoder side depth estimation, Decoding, Encoding, Estimation, Feature-Driven Depth Estimation, Immersive Video, MPEG, Transform coding, Visualization, Geometry, Pixels, Signal encoding, Textures, Transmissions, Classical approach, Compression efficiency, Depth Estimation, Energy minimization, Geometry information, Hybrid approach, Multiple views, Rendering performance, Video signal processing
Identifiers
URN: urn:nbn:se:miun:diva-43449DOI: 10.1109/TCSVT.2021.3100006Scopus ID: 2-s2.0-85112600542OAI: oai:DiVA.org:miun-43449DiVA, id: diva2:1603737
Available from: 2021-10-18 Created: 2021-10-18 Last updated: 2021-10-18Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Guillemot, C.
In the same journal
IEEE transactions on circuits and systems for video technology (Print)

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 26 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf