{"id":8288,"date":"2020-01-20T02:16:05","date_gmt":"2020-01-20T01:16:05","guid":{"rendered":"https:\/\/emilkirkegaard.dk\/en\/?p=8288"},"modified":"2020-02-21T02:45:36","modified_gmt":"2020-02-21T01:45:36","slug":"predicting-iq-from-genetics-how-far-have-we-come-january-2020","status":"publish","type":"post","link":"https:\/\/emilkirkegaard.dk\/en\/2020\/01\/predicting-iq-from-genetics-how-far-have-we-come-january-2020\/","title":{"rendered":"Predicting IQ from genetics: how far have we come? [January 2020]"},"content":{"rendered":"<figure id=\"attachment_8290\" aria-describedby=\"caption-attachment-8290\" style=\"width: 300px\" class=\"wp-caption alignright\"><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/allegrini-2019.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-8290 size-medium\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/allegrini-2019-300x227.png\" alt=\"\" width=\"300\" height=\"227\" \/><\/a><figcaption id=\"caption-attachment-8290\" class=\"wp-caption-text\">Results from Allegrini et al 2019<\/figcaption><\/figure>\n<p>Counting significant hits was always a dumb way to measure progress in genomic prediction of a trait. Breeders using animals and plants never bothered with this approach and they used ridge regression for best predictive power (<a href=\"http:\/\/www2.math.uu.se\/~thulin\/mm\/breiman.pdf\">a &#8220;two cultures&#8221; problem no doubt<\/a>). Researchers in human genetics are starting to catch up, implementing clever <a href=\"https:\/\/en.wikipedia.org\/wiki\/Elastic_net_regularization\">Enet<\/a> approach for array datafiles (<a href=\"https:\/\/www.biorxiv.org\/node\/709974.full\">Qian et al 2019<\/a>, called <strong>snpnet<\/strong>, based on <strong>glmnet<\/strong>). We are still waiting for this method to be widely used. It is possible to do summary statistics based Enet too (<a href=\"https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/gepi.22050\">Mak et al 2017<\/a>, called <strong>lassosum<\/strong>), but again, not many have done it yet.<\/p>\n<p>That being said, we still see a lot of progress owing to larger datasets and some improvements in using the output from &#8216;single-variant-at-a-time&#8217; regression that they use in regular GWASs. A brief summary. I focus on the <a href=\"https:\/\/www.teds.ac.uk\/researchers\/scientific-publications\">TEDS sample<\/a> (a bug UK twin sample with good DNA and cognitive testing) because this is the largest dataset not used to train GWASs with that has great cognitive testing. It&#8217;s someone could use the new subset of UK Biobank with improved cognitive testing to replicate the below (<a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0160289619300789\">Cox et al 2019<\/a>, n=29k).<\/p>\n<ul>\n<li>\n<div class=\"gs_citr\" tabindex=\"0\">Davies, G., Marioni, R. E., Liewald, D. C., Hill, W. D., Hagenaars, S. P., Harris, S. E., &#8230; &amp; Cullen, B. (2016). <a href=\"https:\/\/www.nature.com\/articles\/mp201645\">Genome-wide association study of cognitive functions and educational attainment in UK Biobank<\/a> (N= 112 151). <i>Molecular psychiatry<\/i>, <i>21<\/i>(6), 758.<\/div>\n<ul>\n<li>Polygenic score analyses indicate that up to <strong>5% of the variance in cognitive test scores<\/strong> can be predicted in an independent cohort.<\/li>\n<\/ul>\n<\/li>\n<li>\n<div class=\"gs_citr\" tabindex=\"0\">Selzam, S., Krapohl, E., von Stumm, S., O&#8217;Reilly, P. F., Rimfeld, K., Kovas, Y., &#8230; &amp; Plomin, R. (2017). <a href=\"https:\/\/www.nature.com\/articles\/mp2016107\">Predicting educational achievement from DNA<\/a>. <i>Molecular psychiatry<\/i>, <i>22<\/i>(2), 267.<\/div>\n<ul>\n<li>We found that <i>EduYears<\/i> GPS explained greater amounts of variance in <strong>educational achievement over time, up to 9% at age 16<\/strong>, accounting for 15% of the heritable variance. This is the strongest GPS prediction to date for quantitative behavioral traits.<\/li>\n<li>Not quite intelligence, but closer to intelligence (<em>g<\/em>) than to educational attainment.<\/li>\n<\/ul>\n<\/li>\n<li>\n<div class=\"gs_citr\" tabindex=\"0\">Krapohl, E., Patel, H., Newhouse, S., Curtis, C. J., von Stumm, S., Dale, P. S., &#8230; &amp; Plomin, R. (2018). <a href=\"https:\/\/www.nature.com\/articles\/mp2017163\">Multi-polygenic score approach to trait prediction<\/a>. <i>Molecular psychiatry<\/i>, <i>23<\/i>(5), 1368.<\/div>\n<ul>\n<li>The MPS approach predicted <strong>10.9% variance in educational achievement, 4.8% in general cognitive ability<\/strong> and 5.4% in BMI in an independent test set, predicting 1.1%, 1.1%, and 1.6% more variance than the best single-score predictions.<\/li>\n<\/ul>\n<\/li>\n<li>\n<div class=\"gs_citr\" tabindex=\"0\">Allegrini, A. G., Selzam, S., Rimfeld, K., von Stumm, S., Pingault, J. B., &amp; Plomin, R. (2019). <a href=\"https:\/\/www.nature.com\/articles\/s41380-019-0394-4\">Genomic prediction of cognitive traits in childhood and adolescence<\/a>. <i>Molecular psychiatry<\/i>, <i>24<\/i>(6), 819.<\/div>\n<ul>\n<li>In a representative UK sample of 7,026 children at ages 12 and 16, we show that we can now predict up to<strong> 11% of the variance in intelligence and 16% in educational achievement<\/strong>.<\/li>\n<li>As above, <em>educational achievement<\/em><strong>.<\/strong><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>As it so happens, there is a paper for each year, letting one see a kind of 4 year progress.<\/p>\n<p>Important caveat of the above! These predictions are not done on sibling pairs. When they are (<a href=\"https:\/\/www.cell.com\/ajhg\/fulltext\/S0002-9297(19)30231-9\">Selzam et al 2019<\/a>), the validity is ~50% reduced. This indicates some kind of training problem with the GWASs which either train on family related variance, detect population structure and use that, or something more complicated.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Counting significant hits was always a dumb way to measure progress in genomic prediction of a trait. Breeders using animals and plants never bothered with this approach and they used ridge regression for best predictive power (a &#8220;two cultures&#8221; problem no doubt). Researchers in human genetics are starting to catch up, implementing clever Enet approach [&hellip;]<\/p>\n","protected":false},"author":17,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2463,2591],"tags":[2305,2210,2465],"class_list":["post-8288","post","type-post","status-publish","format-standard","hentry","category-genomics","category-intelligence-iq-cognitive-ability","tag-genomic-prediction","tag-gwas","tag-polygenic-score","entry"],"_links":{"self":[{"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/posts\/8288","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/users\/17"}],"replies":[{"embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/comments?post=8288"}],"version-history":[{"count":4,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/posts\/8288\/revisions"}],"predecessor-version":[{"id":8293,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/posts\/8288\/revisions\/8293"}],"wp:attachment":[{"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/media?parent=8288"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/categories?post=8288"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/tags?post=8288"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}