miun.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Subjective Evaluation of an Edge-based Depth Image Compression Scheme
Mittuniversitetet, Fakulteten för naturvetenskap, teknik och medier, Avdelningen för informations- och kommunikationssystem. (Realistic3D)
Mittuniversitetet, Fakulteten för naturvetenskap, teknik och medier, Avdelningen för informations- och kommunikationssystem. (Realistic3D)ORCID-id: 0000-0003-3751-6089
Mittuniversitetet, Fakulteten för naturvetenskap, teknik och medier, Avdelningen för informations- och kommunikationssystem. (Realistic3D)
Mittuniversitetet, Fakulteten för naturvetenskap, teknik och medier, Avdelningen för informations- och kommunikationssystem. (Realistic3D)
Visa övriga samt affilieringar
2013 (Engelska)Ingår i: Proceedings of SPIE - The International Society for Optical Engineering: Stereoscopic Displays and Applications XXIV, SPIE - International Society for Optical Engineering, 2013, s. Art. no. 86480D-Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Multi-view three-dimensional television requires many views, which may be synthesized from two-dimensional images with accompanying pixel-wise depth information. This depth image, which typically consists of smooth areas and sharp transitions at object borders, must be consistent with the acquired scene in order for synthesized views to be of good quality. We have previously proposed a depth image coding scheme that preserves significant edges and encodes smooth areas between these. An objective evaluation considering the structural similarity (SSIM) index for synthesized views demonstrated an advantage to the proposed scheme over the High Efficiency Video Coding (HEVC) intra mode in certain cases. However, there were some discrepancies between the outcomes from the objective evaluation and from our visual inspection, which motivated this study of subjective tests. The test was conducted according to ITU-R BT.500-13 recommendation with Stimulus-comparison methods. The results from the subjective test showed that the proposed scheme performs slightly better than HEVC with statistical significance at majority of the tested bit rates for the given contents.

Ort, förlag, år, upplaga, sidor
SPIE - International Society for Optical Engineering, 2013. s. Art. no. 86480D-
Nyckelord [en]
Depth image compression, view synthesis, subjective test
Nationell ämneskategori
Signalbehandling
Identifikatorer
URN: urn:nbn:se:miun:diva-18539DOI: 10.1117/12.2003053ISI: 000322737100011Scopus ID: 2-s2.0-84878743350Lokalt ID: STCISBN: 978-081949421-4 (tryckt)OAI: oai:DiVA.org:miun-18539DiVA, id: diva2:608481
Konferens
24th IS and T/SPIE Stereoscopic Displays and Applications Conference, SD and A 2013; Burlingame, CA; United States; 4 February 2013 through 6 February 2013; Code 97281
Tillgänglig från: 2013-02-27 Skapad: 2013-02-27 Senast uppdaterad: 2017-08-22
Ingår i avhandling
1. Coding of three-dimensional video content: Depth image coding by diffusion
Öppna denna publikation i ny flik eller fönster >>Coding of three-dimensional video content: Depth image coding by diffusion
2013 (Engelska)Licentiatavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Three-dimensional (3D) movies in theaters have become a massive commercial success during recent years, and it is likely that, with the advancement of display technologies and the production of 3D contents, TV broadcasting in 3D will play an important role in home entertainments in the not too distant future. 3D video contents contain at least two views from different perspectives for the left and the right eye of viewers. The amount of coded information is doubled if these views are encoded separately. Moreover, for multi-view displays (i.e. different perspectives of a scene in 3D are presented to the viewer at the same time through different angles), either video streams of all the required views must be transmitted to the receiver, or the displays must synthesize the missing views with a subset of the views. The latter approach has been widely proposed to reduce the amount of data being transmitted. The virtual views can be synthesized by the Depth Image Based Rendering (DIBR) approach from textures and associated depth images. However it is still the case that the amount of information for the textures plus the depths presents a significant challenge for the network transmission capacity. An efficient compression will, therefore, increase the availability of content access and provide a better video quality under the same network capacity constraints.

In this thesis, the compression of depth images is addressed. These depth images can be assumed as being piece-wise smooth. Starting from the properties of depth images, a novel depth image model based on edges and sparse samples is presented, which may also be utilized for depth image post-processing. Based on this model, a depth image coding scheme that explicitly encodes the locations of depth edges is proposed, and the coding scheme has a scalable structure. Furthermore, a compression scheme for block-based 3D-HEVC is also devised, in which diffusion is used for intra prediction. In addition to the proposed schemes, the thesis illustrates several evaluation methodologies, especially, the subjective test of the stimulus-comparison method. It is suitable for evaluating the quality of two impaired images, as the objective metrics are inaccurate with respect to synthesized views.

The MPEG test sequences were used for the evaluation. The results showed that virtual views synthesized from post-processed depth images by using the proposed model are better than those synthesized from original depth images. More importantly, the proposed coding schemes using such a model produced better synthesized views than the state of the art schemes. As a result, the outcome of the thesis can lead to a better quality of 3DTV experience.

Ort, förlag, år, upplaga, sidor
Sundsvall: Mid Sweden University, 2013. s. 36
Serie
Mid Sweden University licentiate thesis, ISSN 1652-8948
Nationell ämneskategori
Teknik och teknologier Signalbehandling
Identifikatorer
urn:nbn:se:miun:diva-19087 (URN)STC (Lokalt ID)978-91-87103-76-6 (ISBN)STC (Arkivnummer)STC (OAI)
Presentation
(Engelska)
Opponent
Handledare
Tillgänglig från: 2013-06-11 Skapad: 2013-06-06 Senast uppdaterad: 2016-10-20Bibliografiskt granskad

Open Access i DiVA

Li_Subjective_evaluation(482 kB)682 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 482 kBChecksumma SHA-512
826b06cac918f5f4fb4225976a0c2997868ad4512b878fb446950f8b2e6ef558e7493a76f594bdb13ca554d2222c694cec1fcbb253d18f8b84ad942186da6638
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltextScopus

Personposter BETA

Li, YunSjöström, MårtenJennehag, UlfOlsson, RogerTourancheau, Sylvain

Sök vidare i DiVA

Av författaren/redaktören
Li, YunSjöström, MårtenJennehag, UlfOlsson, RogerTourancheau, Sylvain
Av organisationen
Avdelningen för informations- och kommunikationssystem
Signalbehandling

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 682 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 1116 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf