Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Improve Data Quality By Using Dependencies And Regular Expressions
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Systems and Technology.
2018 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The objective of this study has been to answer the question of finding ways to improve the quality of database. There exists a lot of problems of the data stored in the database, like missing or spelling errors. To deal with the dirty data in the database, this study adopts the conditional functional dependencies and regular expressions to detect and correct data. Based on the former studies of data cleaning methods, this study considers the more complex conditions of database and combines the efficient algorithms to deal with the data. The study shows that by using these methods, the database’s quality can be improved and considering the complexity of time and space, there still has a lot of things to do to make the data cleaning process more efficiency.

Place, publisher, year, edition, pages
2018. , p. 50
Keywords [en]
data cleaning, data quality, condition functional dependency, regular expression
National Category
Computer Systems
Identifiers
URN: urn:nbn:se:miun:diva-35620Local ID: DT-V18-A2-004OAI: oai:DiVA.org:miun-35620DiVA, id: diva2:1288074
Subject / course
Computer Engineering DT1
Supervisors
Examiners
Available from: 2019-02-12 Created: 2019-02-12 Last updated: 2019-02-12Bibliographically approved

Open Access in DiVA

fulltext(1247 kB)177 downloads
File information
File name FULLTEXT01.pdfFile size 1247 kBChecksum SHA-512
1b09d282e953395bd49741af65b80250eb2893d8c83338c65a7fd44bf6e47e6931f3c829761fa696d7d792404713d2c6cc06eb645288006416a1ad38e734e0dd
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Feng, Yuan
By organisation
Department of Information Systems and Technology
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 177 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 307 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf