The measurement of biallelic pair-wise association called linkage disequilibrium (LD) is an important issue in order to understand the genomic architecture. A plethora of measures of association in two by two tables have been proposed in the literature. Beside the problem of choosing an appropriate measure, the problem of their estimation has been neglected in the literature. It needs to be emphasized that the definition of a measure and the choice of an estimator function for it are conceptually unrelated tasks. In this paper, we compare the performance of various estimators for the three popular LD measures D’, r and Y in a simulation study for small to moderate samples sizes (N\textless=500). The usual frequency-plug-in estimators can lead to unreliable or undefined estimates. Estimators based on the computationally expensive volume measures have been proposed recently as a remedy to this well-known problem. We confirm that volume estimators have better expected mean square error than the naive plug-in estimators. But they are outperformed by estimators plugging-in easy to calculate non-informative Bayesian probability estimates into the theoretical formulae for the measures. Fully Bayesian estimators with non-informative Dirichlet priors have comparable accuracy but are computationally more expensive. We recommend the use of non-informative Bayesian plug-in estimators based on Jeffreys’ prior, in particular when dealing with SNP array data where the occurrence of small table entries and table margins is likely.
Projects: Genetical Statistics and Systems Biology
Publication type: Journal article
Journal: The international journal of biostatistics
Human Diseases: No Human Disease specified
Citation: The international journal of biostatistics 6(1):Article 1
Date Published: 2010
Registered Mode: imported from a bibtex file
Views: 980
Created: 14th Sep 2020 at 13:13
Last updated: 7th Dec 2021 at 17:58
This item has not yet been tagged.
None