-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Quality control of Base and Target Data #4
Comments
This describe the necessary quality control measures to perform polygenic risk scores. I have followed this BioRx guide. The issues to consider are as follows:
|
Quality control issues 1: Quality control -Our target data (UKBB) was created in PLINK, which, as per the BioRx guide, is an appropriate and standard procedure for quality control. |
Quality control issue 2: File Transfer -I have done this using md5sum and the file was not corrupted during transfer. |
Quality control issue 3: Genome build -To do |
Quality control issue 4: Effect allele -Make sure that the effect allele in the GWAS SS is clear. Done: We have arranged the GWAS SS to the same columns as what is required for LDPred. Below is a list of the required format of GWAS SS for step 1 of LDpred (left side) and what format the MEGASTROKE GWAS SS were in: Required format for LDpred - MEGASTROKE *The original location of this above table is here. |
Quality control issue 5: Ambiguous SNPs -We removed all SNPs that didn’t have identical rsIDs (see /oak/stanford/groups/euan/projects/ukbb/code/anna_code/risk_scores/Merging_bim_GWAS_SS.py). |
Quality control issue 6: Duplicate SNPs -I have checked and there are no duplicate SNPs |
Quality control issue 7: Sex chromosomes -I have checked our PLINK files and we have not included chromosome 23. The relevant PLINK files are located here: /oak/stanford/groups/euan/projects/ukbb/code/anna_code/risk_scores/step1_inputs |
Quality control issue 8: Sample-overlap: I have done this and there is no overlap. |
Quality control issue 9: |
Quality control issue 10: TO DO |
No description provided.
The text was updated successfully, but these errors were encountered: