Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Robust image descriptor for machine learning based data reduction in serial crystallography
Mid Sweden University, Faculty of Science, Technology and Media, Department of Computer and Electrical Engineering (2023-). Deutsches Elektronen-Synchrotron (DESY).
2024 (English)In: Journal of applied crystallography, ISSN 0021-8898, E-ISSN 1600-5767, Vol. 57, no Pt 2, p. 413-430Article in journal (Refereed) Published
Abstract [en]

Serial crystallography experiments at synchrotron and X-ray free-electron laser (XFEL) sources are producing crystallographic data sets of ever-increasing volume. While these experiments have large data sets and high-frame-rate detectors (around 3520 frames per second), only a small percentage of the data are useful for downstream analysis. Thus, an efficient and real-time data classification pipeline is essential to differentiate reliably between useful and non-useful images, typically known as ‘hit’ and ‘miss’, respectively, and keep only hit images on disk for further analysis such as peak finding and indexing. While feature-point extraction is a key component of modern approaches to image classification, existing approaches require computationally expensive patch preprocessing to handle perspective distortion. This paper proposes a pipeline to categorize the data, consisting of a real-time feature extraction algorithm called modified and parallelized FAST (MP-FAST), an image descriptor and a machine learning classifier. For parallelizing the primary operations of the proposed pipeline, central processing units, graphics processing units and field-programmable gate arrays are implemented and their performances compared. Finally, MP-FAST-based image classification is evaluated using a multi-layer perceptron on various data sets, including both synthetic and experimental data. This approach demonstrates superior performance compared with other feature extractors and classifiers. 

Place, publisher, year, edition, pages
International Union of Crystallography (IUCr) , 2024. Vol. 57, no Pt 2, p. 413-430
Keywords [en]
data reduction, feature extraction, machine learning, serial crystallography
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:miun:diva-51210DOI: 10.1107/S160057672400147XScopus ID: 2-s2.0-85189932647OAI: oai:DiVA.org:miun-51210DiVA, id: diva2:1854017
Available from: 2024-04-24 Created: 2024-04-24 Last updated: 2024-04-24

Open Access in DiVA

fulltext(4695 kB)47 downloads
File information
File name FULLTEXT01.pdfFile size 4695 kBChecksum SHA-512
68881d0e3a322e75d544f53eca5d2331a3df1a7b6457c071d219aa707f1ba2d52a6f884edec5258e14a5912c1e3561ad0f94b1405852b3ec758e4c4ed21819de
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Graafsma, Heinz

Search in DiVA

By author/editor
Graafsma, Heinz
By organisation
Department of Computer and Electrical Engineering (2023-)
In the same journal
Journal of applied crystallography
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 47 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 135 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf