Dalarna University's logo and link to the university's website

du.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An Empirical Investigation of The Effect of Proxy Response and The Merits of Its Remedial Measures
Dalarna University, School of Information and Engineering.
Dalarna University, School of Information and Engineering.
2023 (English)Independent thesis Advanced level (degree of Master (One Year)), 10 credits / 15 HE creditsStudent thesisAlternative title
En empirisk undersökning av effekten av proxysvar och fördelarna med dess korrigerande åtgärder (Swedish)
Abstract [en]

In the event of missing data, substitution of data from proxy sources are usually considered a very useful alternative when available to avoid the problem of missingness. Nonetheless, research has also shown that this approach often induces “response bias”. This bias has been known to vary significantly from study to study depending on what is being evaluated. As an extension to the study of Lapin et al. (2021), this study aims to evaluate the effect of binary proxy response under varying degrees of biasness and the merits of its usage in comparison with a few commonly used methods for handling missing data. Specific questions around the comparison of proxy information to self-responses, proxy bias issues and decision making in the absence of self-responses under the missing at random (MAR) mechanism were evaluated. In this study, three levels of bias (i.e., 10%,30%, and 50%) obtainable in a binary proxy response were investigated (proxy substitution) alongside a few commonly used remedial measures (i.e., complete case analysis, multiple imputation, and inverse probability treatment weighting). A Monte Carlo simulation experiment was conducted with a logistic regression model of three explanatory variables (consisting of a binary, discrete and continuous data types). The experiment was conducted under different MAR mechanism with varying sample sizes (100, 500, 1000, 5000, 10000). The various methods were compared using Mean Square Error (MSE) criterion and the relative MSE. The findings of this study show that the performance of each method is highly dependent on the sample size, proportion of missing data under MAR mechanism, data type and error-in-variable. However, in the absence of proxy response, this study recommends the use of inverse probability treatment weighting (IPTW) provided the sample size is large. The findings of the simulation study were used invalidating the results of an existing study conducted with data obtained from the Swedish National Board of Health and Welfare Survey (2017) which consisted about 43% proxy response. 

Place, publisher, year, edition, pages
2023.
Keywords [en]
missing data, proxy response, missingness mechanism, error-in-variable, logistic regression, Monte-Carlo simulation, missing at random, complete case analysis, proxy substitution, multiple imputation, inverse probability treatment weighting
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:du-45457OAI: oai:DiVA.org:du-45457DiVA, id: diva2:1736758
Subject / course
Microdata Analysis
Available from: 2023-02-14 Created: 2023-02-14 Last updated: 2025-10-09

Open Access in DiVA

fulltext(628 kB)376 downloads
File information
File name FULLTEXT01.pdfFile size 628 kBChecksum SHA-512
bd5b4bea897de592c5250c19e1fdceaea619dc69314b27ae8ad20efbedd76aa414fc0245dc2cdcb46b621a69e919f5d4ed30f4f62a6625c937b2ebc9670d13be
Type fulltextMimetype application/pdf

By organisation
School of Information and Engineering
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 379 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 485 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf