HapMap Project logo
International HapMap Project
 

Home | About the Project | Data | Publications | Tutorial

 

XML and XML Schema in the HapMap project

Introduction

Data are exchanged between the DCC, genotyping groups and analysis groups in a format based on the eXtensible Markup Language. More specifically, the data format is strictly defined in the XML Schema language, itself based on XML. The benefits of using XML and related technologies, especially XML Schema, for a project like the HapMap project are numerous, some which are listed here:

General information on XML technologies:

Which schema definition to use?

Every XML document that goes between the DCC and other HapMap collaborators must A) refer to a specific URL on the Internet where the schema is located, and B) contain XML content that validates against that schema. See this directory for sample XML-documents. You can browse the HapMap schema by opening the schemaLocation URL at the top of each of the sample XMLs (http://www.hapmap.org/xml-schema/2003-11-04/hapmap.xsd as of May'04), in a web browser. There are documentation snippets attached to each non-trivial element defined in the schema.

As the HapMap schema will change over time (despite our best intentions to get it right the first time!), there will unavoidably be several versions of the schema referenced in XML documents from various points in time. All versions will be available through our webserver for validating against if necessary (see list of all versions to date here), but there will always be ONE primary schema that should be used when the groups need to prepare data for submission to the DCC at any given time.

Schema Layout

The HapMap schema consists of several small files that each describe a single kind of large-ish object (a SNP, a protocol), several related smaller ones or common datatypes or element groups. These components are all tied together with the master file, hapmap.xsd, that includes all the others. See the schema files themselves in this dir for descriptions of the various elements and datatypes(as XML <annotation> elements).

Last updated : index.html,v 1.2 2004/05/06 15:00:11 mummi Exp


Home | About the Project | Data | Publications | Tutorial
Please send questions and comments on website to help@hapmap.org