Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A method for document image enhancement to improve template-based classification
Donghua University, Shanghai, China.
Donghua University, Shanghai, China.
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Systems and Technology.
2020 (English)In: ACM International Conference Proceeding Series, ACM Digital Library, 2020, p. 87-91Conference paper, Published paper (Refereed)
Abstract [en]

Document classification is one of the significant procedure in paper document recognition. This article proposed a method for document image enhancement to improve the performance of classification in the convolutional neural network. An enhanced document image was generated by extracting the table frame, text region, and shape of the raw document. The template-based classification experiment on 414 customs documents and more than one thousand generated images showed the enhanced image could help CNN model achieve higher accuracies compared to the original images. It could also diminish the interference of noise and unrelated features in document classification optimizing the robustness of networks. The proposed method also demonstrated the channels of the image could provide more information except for color in deep neural networks. As the similarity in the whole image classification tasks, the conclusion might provide ideas for the training of the neural networks in other fields such as street view recognition, medical image recognition, etc. 

Place, publisher, year, edition, pages
ACM Digital Library, 2020. p. 87-91
Keywords [en]
convolutional neural network, document recognition, image classification, Image enhancement
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:miun:diva-39947DOI: 10.1145/3409501.3409531Scopus ID: 2-s2.0-85090881891ISBN: 9781450375603 (print)OAI: oai:DiVA.org:miun-39947DiVA, id: diva2:1470249
Conference
4th High Performance Computing and Cluster Technologies Conference, HPCCT 2020 and the 3rd International Conference on Big Data and Artificial Intelligence, BDAI 2020, 3 July 2020 through 6 July 2020
Available from: 2020-09-24 Created: 2020-09-24 Last updated: 2020-09-24Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Zhang, Tingting

Search in DiVA

By author/editor
Zhang, Tingting
By organisation
Department of Information Systems and Technology
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 69 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf