Table of Contents

More details about PRIDE XML format & runDescriptor.properties

PRIDE (PRoteomics IDEntifications database) is a public data repository for proteomics data.
PRIDE XML is the file format used for submission to the publicly available PRIDE database.
The PRIDE XML schema documentation is available here.

To provide meta info about your experiment for the PRIDE XML file, it's possible either to use PRIDE Converter (recommended) or to use the runDescriptor.properties file in hEIDI.

[FEATURE IN PROGRESS] Details about runDescriptor.properties

:!: Warning :!:

In the runDescriptor.properties file (in <HEIDI_PROJECT_DIR>\heidi.project\), user provides information (as properties) to describe the experiment (title, description, contact info), the sample(s), the instrument, the protocol steps, etc.

User must use very specific terms for these properties, for ex. contact.name=Hippolyte Calys to give the name of the person to contact.
When running the PRIDE XML Export, these properties are then parsed and replaced by the appropriate XML tags (simple XML tags or Controlled Vocabulary params) to create the resulting PRIDE XML file.

:!: Current limitations about using the runDescriptor.properties file :!:

First, see what is a CV Param and how to use it.
Then, you will need to browse ontologies to find the appropriate CV Params (Ontology Lookup Service).

Notes
- Missing properties will have a default value in order to create the required XML tags, but their values will have to be changed by user. These default values are surrounded by three exclamation points '!!!' in order to highlight them. They are not all required (see list bellow)
- The CV given below are examples, they are not exclusive and may change with PRIDE Format.

Default PRIDE XML file (i.e. obtained if runDescriptor.properties is empty)

<ExperimentCollection version="2.1">
  <Experiment>
    <Title>New Node</Title>
    <ShortLabel>test</ShortLabel>
    <Protocol>
      <ProtocolName>Protocol Name : To replace !</ProtocolName>
    </Protocol>
    <mzData xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" version="1.05" xsi:noNamespaceSchemaLocation="http://psidev.sourceforge.net/ms/xml/mzdata/mzdata.xsd" accessionNumber="Context_2">
	<description>
	  <admin>
	  <sampleName>!!! SAMPLE NAME !!!</sampleName>
          <contact>
	    <name>!!! USER NAME !!!</name>
            <institution>!!! USER INSTITUTION !!!</institution>
            <contactInfo>!!! USER INFOS / TEL ... !!!</contactInfo>
          </contact>
        </admin>
        <instrument>
	  <instrumentName>!!! INSTRUMENT MODEL  !!!</instrumentName>
          <source>
	    <cvParam accession="1" cvLabel="psi" name="type" value="!!! INSTRUMENT SOURCE!!!"/>
          </source>
          <analyzerList count="1">
 	    <analyzer>
            </analyzer>
          </analyzerList>
          <detector>
	  </detector>
        </instrument>
        <dataProcessing>
	  <software>
	    <name>!!! PROCESSING SOFTWARE NAME !!!</name>
            <version>!!! PROCESSING SOFTWARE NAME !!!</version>
          </software>
        </dataProcessing>
      </description>
      <spectrumList count="1623">
      </spectrumList>
    </mzData>
    <GelFreeIdentification>
      <Accession>KPYM_HUMAN</Accession>
      <Database>Sp_Trembl</Database>
    </GelFreeIdentification>
    ...
  </Experiment>
</ExperimentCollection>

Controlled Vocabulary (CV)

There are several CVs, coming from several ontologies, used in a PRIDE XML file.
You can find more details about CV on PRIDE website at help for using CV.

Here are some examples of Ontologies/CV:

Click here to have access to all ontologies.