Objectives: Based on the experience receiving genomics data, coupled with the need to provide access to genomics data from multiple different repositories, the NDAR team implemented a predefined set of parameters that would guarantee the consistency of raw experimental data, while simplifying the data definition for submission and aggregation across federated repositories such as the Autism Genetics Research Exchange, the NIH database for genotypes and phenotypes (dbGaP), the NIMH Genetics Repository, and the Simons Foundation Research Initiative, among others.
Methods: After thorough analyses of functional genomics data acquisition and storage criteria, such as MIAME, MAGE, MINSEQE, etc., and review of the needs of the ASD research community, the NDAR team developed an interactive tool that defines the relationship between samples and data files clearly and as simply as possible.
Results: The NDAR Genomics Tool standardizes the naming of data processing and analysis protocols, requires entering sufficient details and enforces unambiguous interpretation of the entered information. Scheduled for launch in December of 2010, in time for the January NDAR submission cycle, the tool will be used by ASD investigators to define their genomics data allowing data aggregation through NDAR across projects and data repositories.
Conclusions: The NDAR team will present at the IMFAR 2011 conference the conclusions from utilizing the NDAR Genomics Tool for submission into NDAR. Furthermore, we will update IMFAR attendees on the progress of refining and utilizing this tool in the process of establishing data federation with the Autism Genetic Research Exchange and dbGaP.