Dalarna University's logo and link to the university's website

du.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
End user interface for collecting and evaluating company data: Real-time data collection through web-scraping
Dalarna University, School of Information and Engineering.
Dalarna University, School of Information and Engineering.
2021 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The demand of open and reliable data, in the Era of Big Data is constantly increasing as thediversity of research and the need of trustworthy data as high-quality data is increasesconsiderably the quality of the findings . However, it is very hard to get reliable data for free witha small effort. With an immense progress of tools, on one hand for data scraping, data cleansing,data storing, and on the other hand so many platforms with data that can be scrapped, it isabsolutely crucial to make use of them and easily build data sets with real and trustworthy data,for free and in a user-friendly way. Using several available tools, an application with a graphicaluser interface (GUI) was developed. The possibilities of the applications are: collecting financialdata for any given list of companies, updating an existent data set, build a data set out of thewhole data warehouse(DW), based on several filters, make the data sets available to anyone whouses the application, and build simple visualization of the existent data. To make sure that‘garbage data in – garbage data out’ concept is avoided, a constant analysis of the data quality isperformed, and the quality of the data is adjusted so that it is ready for use in a research project.The work provides a viable solution for collecting data and making it borderless while respectingthe standards of data sharing. The application can collect data from 2 sources, with more than250 features per company. The application is updated with more functionalities and more sourcesof data.

Place, publisher, year, edition, pages
2021.
National Category
Social Sciences Interdisciplinary
Identifiers
URN: urn:nbn:se:du-37740OAI: oai:DiVA.org:du-37740DiVA, id: diva2:1580335
Subject / course
Microdata Analysis
Available from: 2021-07-14 Created: 2021-07-14

Open Access in DiVA

fulltext(1520 kB)326 downloads
File information
File name FULLTEXT01.pdfFile size 1520 kBChecksum SHA-512
927ef6093d023e0a649b5aa221602e1e9f5ad4859ef785ac2fe6beb83a89d42e5bf54e369454ab01e9741ddd7fc5d6232ec5d7e1047b1435e200387bacfe155b
Type fulltextMimetype application/pdf

By organisation
School of Information and Engineering
Social Sciences Interdisciplinary

Search outside of DiVA

GoogleGoogle Scholar
Total: 326 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 1026 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf