The first step of Genomic Standards Consortium (GSC) has been to define the Information about a Genome Sequence (MIGS) and Minimum Information about a Metagenome Sequence (MIGS) specifications. Use of MIGS/MIMS will provide a mechanism for capturing a consensus-driven minimum set of metadata describing aspects of genomes and metagenomes such as geographic location and habitat type from which the sample was taken as well as the details of the sequencing method used.
It is the aim of the GSC to provide support for the richer capture of contextual data describing genomes and metagenomes by developing the Genomic Contextual Data Markup Language (GCDML). The support of maximum reporting of such projects, though, will require a much richer set of descriptors. Such descriptors must cover both the origin and processing of a sample, from the time of sampling up to sequencing, and the subsequent analysis. GCDML seeks to specifically support maximal reporting of contextual data and the desire of groups in the GSC to include more descriptors beyond the minimal MIGS/MIMS.