Hey,
first, thank you for developing this pipeline!
I tried running GenoPred on the UK Biobank data and installed it via Conda following the instructions. To quickly test the pipeline, I restricted both the target genotypes and the GWAS summary statistics to chromosome 22 only.
I loaded the reference data for the methods ldpred2, quickprs, and sbayesrc using the Snakemake rules (the ldpred2 reference was downloaded from Google Drive). The pipeline ran successfully until the indiv_report_i step, where it failed because the variance of the score files was too low (all zeros).
I’m wondering:
- Is this failure expected when only a single chromosome is used, and would using multiple chromosomes or genome-wide data be required?
- Or is there another mistake I might have made before reaching this step?
Best,
Marina