plinkQC is a R/CRAN package for genotype quality control in genetic association studies. It makes PLINK basic statistics (e.g.missing genotyping rates per individual, allele frequencies per genetic marker) and relationship functions easily accessible from within R and allows for automatic evaluation of the results.
Full documentation is available at http://meyer-lab-cshl.github.io/plinkQC/.
plinkQC generates a per-individual and per-marker quality control report. A step-by-step guide on how to run these analyses can be found here.
Individuals and markers that fail the quality control can subsequently be removed with plinkQC to generate a new, clean dataset.
plinkQC facilitates an ancestry check for study individuals based on comparison to reference datasets. The processing of the reference datasets is documented in detail here.
Removal of individuals based on relationship status via plinkQC is optimised to retain as many individuals as possible in the study.
The current github version of plinkQC is: 0.3.3 and can be installed via
library(devtools)
install_github("meyer-lab-cshl/plinkQC")
The current CRAN version of plinkQC is: 0.3.2 and can be installed via
install.packages("plinkQC")
A log of version changes can be found here.
Meyer HV (2018) plinkQC: Genotype quality control in genetic association studies. doi:10.5281/zenodo.3373798