Principles of metadata organization at the ENCODE data coordination center
Citations Over TimeTop 10% of 2016 papers
Abstract
The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/). Database URL: www.encodeproject.org.
Related Papers
- → Standards - Metadata standards for educational resources(2003)61 cited
- → Automatic extraction of table metadata from digital documents(2006)37 cited
- → Experiences in Deploying Metadata Analysis Tools for Institutional Repositories(2009)15 cited
- → Metadata tools for institutional repositories(2008)
- → The role of metadata standards in EOSDIS data search and retrieval(2003)