跳至正文
首页 » Predicting GEBV/GPRS

Predicting GEBV/GPRS

HIBLUP has the function of predicting genomic estimated breeding value (GEBV) or genomic polygenic risk score (GPRS) using the pre-computed SNP effects and individual level genotype data, which is the same with the function “--score” in PLINK software, but several times faster for computing.

The command to do prediction is as follows:

./hiblup --pred
         --bfile demo        #the binary genotype data
         --score demo.snpeff #the pre-computed SNP effects
         --threads 10
         --out demo

Note that the genotype should be in PLINK binary file, and at least 5 columns should be included in SNP effects file (see detailed format here). By default, HIBLUP codes the genotype in additive genetic effect (i.e., 0 1 2 for AA Aa aa), user can add a flag “--dom” to code the genotype in dominant effect (i.e., 0 1 0 for AA Aa aa) if the provided SNPs effects are all in dominant. But if both “--add” and “--dom” are specified in commands, the SNP effect file must have 6 columns, of which the 5th and 6th are the additive and dominant SNP effects, respectively. Thus, if there are additive and dominant effects in file, please remember to add flag --add --dom” in the commands.

A file named “demo.bv” will be generated in the work directory, overview of this file:

id	trait1	trait2
Ind2	-0.305403	2.6644
Ind5	0.00897198	-1.36166
Ind11	0.392148	-0.653216
Ind17	0.00232218	-0.213599
Ind22	-0.359507	-2.12692
Ind45	-0.232806	0.269005

As shown above, the first column is the individual id, and the rest of columns are the predicted GEBV or GPRS, the number of columns depends on the number of effects provided in SNP effects file.