{"id":15090,"date":"2026-03-27T11:53:02","date_gmt":"2026-03-27T10:53:02","guid":{"rendered":"https:\/\/emilkirkegaard.dk\/en\/?p=15090"},"modified":"2026-03-27T12:35:05","modified_gmt":"2026-03-27T11:35:05","slug":"dysgenics-for-whom","status":"publish","type":"post","link":"https:\/\/emilkirkegaard.dk\/en\/2026\/03\/dysgenics-for-whom\/","title":{"rendered":"Dysgenics for whom?"},"content":{"rendered":"<p>Some time ago, we published studies showing that the negative correlation between intelligence and fertility varies by other factors. Specifically, we found that it varies by religiousness and conservatism, in <a href=\"https:\/\/emilkirkegaard.dk\/en\/2024\/04\/the-eugenic-effect-of-religiousness-confirmed\/\">that leftists and unreligious people show stronger dysgenic effects<\/a>. These results were based on reasonably large samples. The largest was the cumulative GSS dataset with decades of representative Americans, resulting in tens of thousands of subjects, and still a large number of we subset to people with completed fertility (age 50+). Second, a replication in the Vietnam Experience Study, which has 4500 American former soldiers measured at age 38. This is somewhat problematic because men aren&#8217;t done reproducing at that age, also not in 1985 which is when the data is from.<\/p>\n<p>So I decided I wanted to dig into these findings and conduct a large-scale comprehensive replication. I surveyed Americans on Prolific and only targeted those with completed fertility in ages 50-69. I wanted to try out the Multifactor General Knowledge Test on another sample. It has an odd response format, where it shows you 10 options and says &#8220;Pick the 5 things that are X and don&#8217;t pick any things that aren&#8217;t X&#8221;. You get 1 point for each correct thing you picked and 1 point for each incorrect thing you didn&#8217;t pick. There are 32 questions, and thus one can get 320 points at most. We shall see that this format presents severe psychometric issues that I didn&#8217;t previously see, and <a href=\"https:\/\/emilkirkegaard.dk\/en\/2025\/10\/new-paper-psychometric-analysis-of-the-multifactor-general-knowledge-test\/\">Seb Jensen didn&#8217;t note them in our paper on this test<\/a>. In addition, I put together a few questions to measure overall religiousness, a standard US-style left-right scale, and a checklist for mental illness diagnoses. Thus, putting all of this together, we should be in a decent position to find those pesky effects. They are hard to find because the intelligence fertility correlation is weak to begin with, about 0.10, and we are looking for relatively week interactions with that, say, that changes it to 0.00 or 0.20. This require a lot of statistical precision.<\/p>\n<p>This post is basically the preview of the academic write-up of this article, since I just finished the analyses after waiting some months for the data collection to finish.<\/p>\n<p>This was the most annoying part. Naturally, you would think that since selecting the right things (say, American poets) out of a list of 10 options requires knowledge of such matters, and so does not picking the wrong options, these correct decisions should show positive correlations. But they don&#8217;t actually. They show negative correlations, in apparent violation of the positive manifold. Why is that? It&#8217;s because of guessing. Some people are more inclined to guess than others, and those people get\u00a0<em>both<\/em> more correct selections and more incorrect ones, thus creating negative correlations between them. We can see that by looking first at the pass rates by item:<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_pass_rate_by_type-scaled.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15091\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_pass_rate_by_type-scaled.png\" alt=\"\" width=\"2560\" height=\"1664\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_pass_rate_by_type-scaled.png 2560w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_pass_rate_by_type-300x195.png 300w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_pass_rate_by_type-1024x666.png 1024w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_pass_rate_by_type-768x499.png 768w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_pass_rate_by_type-1536x998.png 1536w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_pass_rate_by_type-2048x1331.png 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><\/a><\/p>\n<p>The correct selections have higher pass rates, so people are better at picking the right options than they are at not picking the wrong ones. Their loadings distributions if we analyze with a single factor for all items (2PL model):<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_loading_dist_by_type-scaled.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15093\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_loading_dist_by_type-scaled.png\" alt=\"\" width=\"2560\" height=\"1664\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_loading_dist_by_type-scaled.png 2560w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_loading_dist_by_type-300x195.png 300w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_loading_dist_by_type-1024x666.png 1024w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_loading_dist_by_type-768x499.png 768w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_loading_dist_by_type-1536x998.png 1536w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_loading_dist_by_type-2048x1331.png 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><\/a><\/p>\n<p>You see the problem. And this is not because they are not scored correctly, they truly have reverse of expected loadings because of this format. We can also get the same pattern at the aggregate level by summing the positive and negative scores by item (thus 32*2 mini-sums):<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_question_heatmap.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15094\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_question_heatmap.png\" alt=\"\" width=\"2100\" height=\"1800\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_question_heatmap.png 2100w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_question_heatmap-300x257.png 300w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_question_heatmap-1024x878.png 1024w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_question_heatmap-768x658.png 768w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_question_heatmap-1536x1317.png 1536w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/gk_question_heatmap-2048x1755.png 2048w\" sizes=\"auto, (max-width: 2100px) 100vw, 2100px\" \/><\/a><\/p>\n<p>Too small to read, they are in pairs, so 1st variable is poets for the positives (select them) and 2nd variable is poets negative. This pair actually has a positive correlation, but in general, the pairs don&#8217;t have positive correlations (blue color). For good measure, I went back to the old dataset and made the same plot to verify:<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mfgkt_question_heatmap.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15110\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mfgkt_question_heatmap.png\" alt=\"\" width=\"2100\" height=\"1800\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mfgkt_question_heatmap.png 2100w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mfgkt_question_heatmap-300x257.png 300w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mfgkt_question_heatmap-1024x878.png 1024w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mfgkt_question_heatmap-768x658.png 768w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mfgkt_question_heatmap-1536x1317.png 1536w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mfgkt_question_heatmap-2048x1755.png 2048w\" sizes=\"auto, (max-width: 2100px) 100vw, 2100px\" \/><\/a><\/p>\n<p>It looks materially the same, so this pattern is not caused by Prolific users, but is also seen in these online test takers from <a href=\"https:\/\/openpsychometrics.org\/tests\/MGKT2\/\">openpsychometrics.org<\/a>.<\/p>\n<p>There&#8217;s a number of ways we could then score these data. Fortunately, we don&#8217;t have to guess from theory which one is best because we can simply check the correlations of the various scores with our other variables. The one that results in the strongest correlations is the best method empirically. We can use religiousness as our criterion given that it has a moderate negative correlation that is very well established. The sum score obtains a correlation of -0.32, higher than the other methods.<\/p>\n<p>The left-right and religiousness scales didn&#8217;t present any notable problems, just standard single factor good solutions. For the mental health items, since they are binary, we can plot their latent correlations (tetrachoric):<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_tetrachoric_heatmap.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15095\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_tetrachoric_heatmap.png\" alt=\"\" width=\"1500\" height=\"1200\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_tetrachoric_heatmap.png 1500w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_tetrachoric_heatmap-300x240.png 300w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_tetrachoric_heatmap-1024x819.png 1024w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_tetrachoric_heatmap-768x614.png 768w\" sizes=\"auto, (max-width: 1500px) 100vw, 1500px\" \/><\/a><\/p>\n<p>Like cognitive data, the mental health checklist shows an mostly perfect positive manifold. The question is then whether a fancy IRT score (a weighted sum) can beat just taking a simple sum. We can compare the results of the scores:<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_sum_vs_irt-scaled.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15097\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_sum_vs_irt-scaled.png\" alt=\"\" width=\"2560\" height=\"1664\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_sum_vs_irt-scaled.png 2560w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_sum_vs_irt-300x195.png 300w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_sum_vs_irt-1024x666.png 1024w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_sum_vs_irt-768x499.png 768w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_sum_vs_irt-1536x998.png 1536w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/mh_sum_vs_irt-2048x1331.png 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><\/a><\/p>\n<p>They correlate at 0.96 but the linear model is inappropriate. Possibly this matters for some analyses.<\/p>\n<p>With this stuff done, we can now plot the correlation matrix across our variables:<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/correlations_heatmap-scaled.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15098\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/correlations_heatmap-scaled.png\" alt=\"\" width=\"2560\" height=\"1664\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/correlations_heatmap-scaled.png 2560w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/correlations_heatmap-300x195.png 300w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/correlations_heatmap-1024x666.png 1024w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/correlations_heatmap-768x499.png 768w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/correlations_heatmap-1536x998.png 1536w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/correlations_heatmap-2048x1331.png 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><\/a><\/p>\n<p>Most things look normal, e.g., worse mental health correlates with female and leftism, religiousness correlates with fertility and negatively with leftism. The most surprising is the ~0 correlation between mental health and IQ, <a href=\"https:\/\/emilkirkegaard.dk\/en\/2022\/02\/mental-illness-and-intelligence-the-relationship-is-negative\/\">normally it&#8217;s negative at about -0.20<\/a>. It may be because of this particular measure (general knowledge means a lot of time spent reading instead of socializing) or because of the self-selection of sampling older people filling out surveys on the internet for 8 USD\/hour.<\/p>\n<p>Fertility, this was just a question about how many biological children people have. We replicated the usual mysterious male-female difference:<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/fertility_dist_by_sex-scaled.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15099\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/fertility_dist_by_sex-scaled.png\" alt=\"\" width=\"2560\" height=\"1664\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/fertility_dist_by_sex-scaled.png 2560w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/fertility_dist_by_sex-300x195.png 300w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/fertility_dist_by_sex-1024x666.png 1024w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/fertility_dist_by_sex-768x499.png 768w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/fertility_dist_by_sex-1536x998.png 1536w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/fertility_dist_by_sex-2048x1331.png 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><\/a><\/p>\n<p>It may be due to self-selection bias with the sampling. Notice also that the data do not fit the Poisson distribution exactly, there are too many 0&#8217;s. This has some implications for modeling. For predicting fertility we have some options:<\/p>\n<ul>\n<li>Ignore that it&#8217;s non-negative count data, and use chad OLS<\/li>\n<li>Use the simplest count model, Poisson<\/li>\n<li>Use 2-component models that deal with the zero-inflation, I tried hurdle and zero-inflated Poisson (ZIP). These models basically just fit 2 models, one to predict whether the fertility is 1+ and another to fit the value (1-X).<\/li>\n<\/ul>\n<p>The cost of the fancier models is that they have lower statistical power, but they can in theory detect opposite effects on having children vs. number of children. Anyway, we can try the main effects and interaction effects. Recall that the main effect is just the correlation\/beta of GK on fertility, and any differences in dysgenics will show up as an GK * X interaction. So we test for those for our 3 candidates:<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/table_main_and_full_interact-scaled.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15100\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/table_main_and_full_interact-scaled.png\" alt=\"\" width=\"1007\" height=\"2560\" srcset=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/table_main_and_full_interact-scaled.png 1007w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/table_main_and_full_interact-118x300.png 118w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/table_main_and_full_interact-403x1024.png 403w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/table_main_and_full_interact-768x1953.png 768w, https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/table_main_and_full_interact-604x1536.png 604w\" sizes=\"auto, (max-width: 1007px) 100vw, 1007px\" \/><\/a><\/p>\n<p>This model requires a lot of power and even with 3600+ people, we get an unsatisfactory p = 2% for the leftism interaction. For this reason, we need to boost power, and the most obvious way is to fit reduced models that only test for one interaction at a time. Doing that we get these results:<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/interact_effect_sizes.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15102\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/interact_effect_sizes.png\" alt=\"\" width=\"1650\" height=\"1050\" \/><\/a><\/p>\n<p>Looking at these, we see there are some findings but the confidence intervals are too wide even at this sample size. For the simplest models, we confirm the prior finding that leftism shows the dysgenic interaction, that is, beyond the main effects of leftism and GK, being high in both is particularly bad for fertility. In other words, the dysgenics effect is stronger for leftists than rightists. We also see the opposite trends for religiousness, though only the Poisson is beyond chance. The well known female effect is also non-significant, though again trending in the right direction. The fancier models confirm the leftism effect on the number of children, and also find the female dysgenic effect but only for having 0 vs. 1+ children. This effect is quite tiny, the correlations by sex are 0.19 for women and 0.16 for men. I didn&#8217;t expect to find any interactions for mental health, but I included it because it was cheap to include the diagnoses list.<\/p>\n<p>Based on these half-satisfactory results, we can meta-analyze the present results with the prior ones to see if they differ more than we expect by chance:<\/p>\n<p><a href=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/forest_plots.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15105\" src=\"https:\/\/emilkirkegaard.dk\/en\/wp-content\/uploads\/forest_plots.png\" alt=\"\" width=\"1200\" height=\"1000\" \/><\/a><\/p>\n<p>The conversions aren&#8217;t exactly valid because the prior studies used different measures, but in every case the current results align with prior ones, just that the uncertainty is too large. This is somewhat frustrating after I spent like 10k USD to gather these data, but I guess I should have done a proper power analysis beforehand. Anyway, all the effects were in the correct direction. We will have to look around for much larger datasets to properly test this idea, say, 10k+.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Some time ago, we published studies showing that the negative correlation between intelligence and fertility varies by other factors. Specifically, we found that it varies by religiousness and conservatism, in that leftists and unreligious people show stronger dysgenic effects. These results were based on reasonably large samples. The largest was the cumulative GSS dataset with [&hellip;]<\/p>\n","protected":false},"author":17,"featured_media":15104,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3636,1690,2591,2435],"tags":[2339,2001,2707,3238,3204,2625],"class_list":["post-15090","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-demographics","category-genetics","category-intelligence-iq-cognitive-ability","category-psychiatry","tag-dysgenics","tag-fertility","tag-interactions","tag-political-ideology","tag-prolific","tag-religiousness","entry","has-media"],"_links":{"self":[{"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/posts\/15090","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/users\/17"}],"replies":[{"embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/comments?post=15090"}],"version-history":[{"count":5,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/posts\/15090\/revisions"}],"predecessor-version":[{"id":15111,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/posts\/15090\/revisions\/15111"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/media\/15104"}],"wp:attachment":[{"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/media?parent=15090"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/categories?post=15090"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/emilkirkegaard.dk\/en\/wp-json\/wp\/v2\/tags?post=15090"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}