Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Clustering: Hierarchical, k-Means, DBSCAN
Mid Sweden University, Faculty of Human Sciences, Department of Economics, Geography, Law and Tourism.ORCID iD: 0000-0003-3964-2716
University of Applied Sciences Ravensburg-Weingarten.
2022 (English)In: Applied Data Science in Tourism: Interdisciplinary Approaches, Methodologies, and Applications / [ed] Roman Egger, Cham: Springer Nature, 2022, p. 129-149Chapter in book (Refereed)
Abstract [en]

This chapter will discuss the unsupervised machine learning technique known as clustering and its main approaches and use cases. After presenting typical application areas for the tourism industry, the mathematical principle of clustering will be explained. Various techniques for representing differences between cases or clusters will be introduced, and major methods used to form clusters based on these differences will be presented (i.e., single linkage, complete linkage, average linkage, and centroid). Subsequently, the three most widely applied clustering approaches will be described. First, major concepts of hierarchical clustering, like divisive and agglomerative techniques, will be highlighted. Second, the partitioning technique k-means will be introduced, and, third, DBSCAN (Density-Based Spatial Clustering of Applications with Noise) will be discussed. By using real tourism data and the data science platform RapidMiner, the practical demonstration will then explain step-by-step how clustering approaches can be executed. After employing typical processes for data transformation and normalization, RapidMiner processes for k-means, hierarchical clustering, and DBSCAN will be shown, and the clustering results will be discussed. Lastly, a tourism case applying k-means and DBSCAN to identify points of interest based on uploaded photo data extracted from the platform Flickr will conclude the chapter.

Place, publisher, year, edition, pages
Cham: Springer Nature, 2022. p. 129-149
Series
Tourism on the Verge, ISSN 2366-2611, E-ISSN 2366-262X
Keywords [en]
Clustering techniques, Hierarchical, k-Means, DBSCAN, Rapid Miner, tourism case study
National Category
Business Administration
Identifiers
URN: urn:nbn:se:miun:diva-44730DOI: 10.1007/978-3-030-88389-8_8ISBN: 978-3-030-88388-1 (print)ISBN: 978-3-030-88389-8 (electronic)OAI: oai:DiVA.org:miun-44730DiVA, id: diva2:1648288
Available from: 2022-03-30 Created: 2022-03-30 Last updated: 2022-04-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Fuchs, Matthias

Search in DiVA

By author/editor
Fuchs, Matthias
By organisation
Department of Economics, Geography, Law and Tourism
Business Administration

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 144 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf