Open data and behavioral genetics: room for improvement!

Open data is a fundamental part of getting science to work well. Primary reasons for this:

  • Redundancy is data archiving. Most data are lost because no backups exist!
  • Easy access to 3rd parties. For new analyses or error checking previous work. Scientists are human and often refuse access to data for hostile outsiders, preventing them from error checking their own work.
Unfortunately, there are only a few behavioral datasets in existence owing to not generally collecting datasets for multiple family members at a time. Some of the public or partially public ones are:
  • NLSYs (National Longitudinal Surveys of Youth) as described in
  • NCPP (National Collaborative Perinatal Project) The data files are really annoying to work with (fixed width format), but some people have released 3rd party versions that are easier
  • TEDS (Twins Early Development Study) is closed
    • But, part of it used to be partially public at this but removed now, I have put a copy here
    • Update: it is now moved to here and still available
  • PT (Project Talent), but so far not released I think
  • More? Contact me