Skip to content

We’ve analyzed over 1,000 more phenotypes using the UK Biobank’s latest data

Two weeks ago, we were the first research group to post our analysis results of the newly released UK Biobank exome sequence data on 50,000 individuals. We have now added to our list of phenotypes analyzed, including the blood biomarker and brain, heart, and abdominal imaging data that the UK Biobank recently made available. Our total number of phenotypes analyzed is now 2,742, including 29 blood biomarkers and 908 imaging phenotypes.

Among the new results, the blood biomarkers are definitely the stars of the show, driving more than 40 new statistically significant associations, including expected associations between loss of function variants in PCSK9 and LDL levels. We additionally find that rare variants in STAB1, a transmembrane receptor that is thought to play a role in angiogenesis, are associated with MRI measurements in several brain structures. We found this most strongly with an MRI measurement reflecting blood flow in the putamen (Median T2star).

We have also made several improvements to our interactive search tool where you can explore the results yourself. In addition to adding the new phenotypes, clicking on a gene will now show you the individual rare variants driving its association with a trait. This lets you dive right into any association of interest to understand the data underlying the association. You can also download the full result dataset and a summary of the top hits here.

Blood biomarkers

ALPLAlkaline phosphatase; Phosphate 2.30E-1680.33%coding; LoF
GOT1Aspartate aminotransferase7.90E-680.12%coding; LoF
CST3Cystatin C1.70E-480.03%LoF; coding
ABCA1Apolipoprotein A; HDL cholesterol7.00E-38 0.60%coding; LoF
GPLD1Alkaline phosphatase2.70E-33 0.42%coding; LoF
SHBGSHBG1.50E-32 0.12%LoF; coding
GPTAlanine aminotransferase6.30E-27 0.16%coding
APOBLDL direct; Cholesterol; Apolipoprotein B; Triglycerides1.10E-26 0.12%LoF
PCSK9 LDL direct; Apolipoprotein B; Cholesterol1.10E-25 0.27%coding; LoF
SLC2A9Urate7.20E-19 0.18%coding
ALBAlbumin2.80E-18 0.02%LoF; coding
ASGR1Alkaline phosphatase2.40E-16 0.16%coding; LoF
ABCB11Alkaline phosphatase2.00E-13 0.23%coding
LCATHDL cholesterol; Apolipoprotein A5.90E-13 0.12%coding
GCKGlycated haemoglobin; Glucose6.90E-13 0.07%coding
APOA5Triglycerides9.00E-13 0.06%LoF
ANGPTL3Apolipoprotein A; Cholesterol; Triglycerides1.40E-12 0.22%coding; LoF
CRPC-reactive protein3.20E-10 0.09%coding
LIPCApolipoprotein A; HDL cholesterol4.20E-10 0.35%coding
CETPHDL cholesterol7.50E-10 0.05%LoF; coding
UGT1A1Total bilirubin; Direct bilirubin6.40E-090.08%coding
SCARB1HDL cholesterol8.10E-09 0.20%coding
UGT1A6Total bilirubin1.80E-08 0.34%coding
SLCO1B3Total bilirubin2.70E-08 0.62%coding
VTNCystatin C9.50E-080.02%LoF
TBX10Phosphate1.20E-07 0.05%LoF
NR1H3HDL cholesterol4.30E-07 0.24%coding


The methods we used were the same as those previously reported, but we now also make available some extra association information. Our main analysis of case-control traits requires at least 10 carriers expected in the minimum group based on the overall carrier frequency. However, associations that did not meet this criterion, but had at least 10 carriers in the smallest group, are also of interest. These associations are now highlighted in the downloadable summary file of our top associations. However, these associations should be interpreted with caution, as they are more prone to false positives than the more stringent table shown above. Also note that >50% of these associations were explained by a single variant, which was true of <10% of the associations that met our stringent criteria and is not the type of association that gene-based collapsing models are designed to identify. With these caveats, we show below the associations with p<1×10-9.

Low expected case carriers

This research has been conducted using the UK Biobank Resource under Application Number 40436.