A Comprehensive List of ASD Data Resources

           Associated academic publication: Al-jawahiri, R. & Milne, E. (2017). Resources available for autism research in the big data era: a systematic review. PeerJ, 5, e2880-e2880.

Resource Metadata Data Type Category Data Type Number of Participants with ASD/ Description
National Database for Autism Research (NDAR) meta Phenotypic, neuroimaging, genetic, omics Phenotypic, neuroimaging, genetic, omics data Over 80,203 participants (however this number includes the control participants of the ASD studies).
Simons Foundation Autism Research Initiative (SFARI) meta Phenotypic, neuroimaging, genetic Phenotypic data, biospecimens, genetic data, neuroimaging data, participant recruitment (to recruit SSC families for additional studies) Over 3,000 participants (SSC), over 200 participants (Simons VIP), 50,0001 participants (SPARK).
Autism Genetic Resource Exchange (AGRE) meta Phenotypic, genetic, biospecimens Phenotypic data; genetic data, biospecimens Over 1,700 families with over 3,300 ASD participants.
Interactive Autism Network (IAN) meta ASD participant recruitment services Phenotypic data, ASD participant recruitment services Over 17,000 participants.
Autism Spectrum Database-UK (ASD-UK) meta ASD participant recruitment services Phenotypic data, ASD participant recruitment services Over 3,000 families.
Autism BrainNet meta BioBank Postmortem brain and related biospecimens Over 25 donations (since 2014)1.
Autism Brain Imaging Data Exchange (ABIDE) meta Neuroimaging Resting state functional magnetic resonance imaging (R-fMRI), structural MRI, phenotypic data 539 participants (ABIDE I), 487 participants (ABIDE II).
Australian EEG Database (AED)2 meta Neuroimaging EEG data 50 participants3.
BrainMap4 meta Human brain statistical maps fMRI, PET, and structural coordinate-based results (x,y,z) in Talairach or MNI space 70 results / articles relevant to ASD functional data (using BrainMapWeb).
NeuroVault meta Human brain statistical maps Unthresholded statistical maps, parcellations, and atlases produced by MRI and PET studies Five studies: 277, 60, 50, 13, 218 participants in each study.
USC Multimodal Connectivity Database meta Brain connectivity matrices Brain connectivity matrices of fMRI and DTI 42 (fMRI) participants, 51 (DTI) participants.
Dryad meta General data repository lncRNA, MRI, metabolite, MEG Four studies: two, 34, 12, and 13 participants respectively.
FigShare4 meta General data repository Phenotypic, statistical, genetic data -
NIMH Repository and Genomics Resource (NIMH-RGR) meta Biospecimens, genetics Biospecimens (DNA samples and cell lines, Induced Pluripotent Stem Cell (iPSC) and Source Cells), GWAS, genomic sequences Biospecimens: 4,793 families and 19,359 individuals of which 17,189 have DNA cell lines. Genome-Wide Association Studies (GWAS) Data: 4 studies (1,232 cases, 739 families, 943 families, 935 families). Sequence data (exome): 2,119 cases.
Avon Longitudinal Study of Parents and Children (ALSPAC) meta Phenotypic, clinical, biospecimens, genetic Phenotypic, clinical, biospecimens, genetic (including GWAS, SNPs, VNTRs, in addition to sequence data from UK10K project available via EGA), ALPAC data linked with data (e.g., routine health and social records) from external sources, bespoke dataf 96 participants (as identified via follow up questionnaires completed by carers for when the proband was nine years old).
Coriell BioRepositories (including Autism Research Resource) meta BioBank Cell cultures, DNA samples, and induced pluripotent stem cells 158 ASD cases.
NIH NeuroBioBank (NBB) meta BioBank Postmortem brain and related biospecimens 64 ASD cases. 22 ASD suspected.
Medical Research Council London Neurodegenerative Diseases Brain Bank meta BioBank Postmortem brain and spinal cord tissue 4 ASD cases.

1 The data is not yet available: It is intended to be available in a future date according to the SFARI website.

2 There is no website or portal for the AED resource; however, the data is available via email requests to aed@newcastle.edu.au

3 The approximate number of ASD participants was found via email correspondence with aed@newcastle.edu.au

4 Accurate information regarding the approximate number of participants with ASD is not available due to the nature of the search functionality of FigShare's website. The search engine returns a large number of results that do not necessarily contain data relevant or useful for ASD primary or secondary analysis (e.g. figures, posters, or certain supplementary data from published articles).

The resources listed in this table contain data either from individuals with ASD or data relevant to ASD research that is collected from non-affected individuals (e.g. from individuals with certain genetic profiles or syndromes related to ASD research).

Genetics and Omics Data Resources
Resource Metadata Data Type Category Data Type Number of Participants with ASD/ Description
MSSNG meta Genetic/ Genomic Phenotypic, genomic (whole genome sequencing of blood DNA) 10,000 participants. However, data from only 3000 probands is currently available.
Simons Foundation Autism Research Initiative Gene (SFARI Gene) meta Gene Catalogue Animal Model, Protein Interaction (PIN), Gene Scoring, CNV An up-to-date, manually annotated reference set of ASD-linked genes.
Autism Chromosome Rearrangement Database (ACRD) meta Gene Catalogue Genomic structural variation data - CNVs A curated catalogue of structural variation related to ASD extracted from publicly available literature and unpublished data.
Autism Knowledgebase (AutismKB) meta Gene Catalogue A collection of genes and variations associated with ASD with annotations -
National Center for Biotechnology Information (NCBI) meta Genetics, omics A collection of multiple resources - Omics and sequencing data -
European Molecular Biology Laboratory (EMBL-EBI) meta Genetics, omics A collection of multiple resources - Omics and sequencing data -
Universal Protein Resource (UniProt) meta Protein sequences Protein sequences and their annotations Can be found among EMBL-EBI resources. 91 (reviewed) and 346 (unreviewed) protein records associated with ASD.
The European Genome-phenome Archive (EGA) meta Omics - Functional genomics Interaction of genotype and phenotype (including data from UK10K project) Can be found among EMBL-EBI resources.
Biological General Repository for Interaction Datasets (BioGRID) meta Omics Genetic and protein interaction data Resource that archives and disseminates genetic and protein interaction data.
Global Proteome Machine Database (GPM DB) meta Omics - Proteomics Proteomics data from tandem mass spectrometry Open-source system for analyzing, storing, and validating proteomics information derived from tandem mass spectrometry.
PeptideAtlas meta Omics - Proteomics Peptide sequences, mapping - proteome information/data A collection of peptides identified in a large set of tandem mass spectrometry proteomics experiments.
DNA DataBank of Japan (DDBJ) meta DNA and RNA sequences DNA and RNA sequences Annotated collection of all publicly available nucleotide sequences and their translated amino acid sequences.
The Chromosome 7 Annotation Project meta DNA sequences DNA sequence and annotation of the entire human chromosome 7 84 cases.
miRBase: the microRNA database meta miRNA sequences miRNA sequences and annotation -
Sullivan Lab Evidence Project (SLEP) meta Genetics, omics A collection of genes and variations associated with ASD with annotations Findings from genome wide linkage (GWL), genome wide association (GWA), and microarray (MA) studies for ASD.

The resources listed in this table contain data either from individuals with ASD or data relevant to ASD research that is collected from non-affected individuals (e.g., from individuals with certain genetic profiles or syndromes related to ASD research).