User Tools

Site Tools


userguide:pride_converter

PRIDE Converter

PRIDE Converter is available at http://code.google.com/p/pride-converter/

  • Dowload the PRIDE_Converter_vX.Y.Z.zip file, then unzip.
  • To execute PRIDE Converter, double-click on PRIDEConverter-X.Y.Z.jar

PRIDE Converter converts mass spectrometry data from most common data formats (MASCOT .dat file for example) into PRIDE XML.
(PRIDE XML is the file format used for submission to the publicly available PRIDE database).
It presents a convenient, wizard-like graphical user interface, and includes efficient access to the Ontology Lookup Service (OLS).

Generate a PRIDE XML file from a Mascot DAT File

:!:Warning:!:

  • The purpose here is to create a PRIDE XML file from which we will only extract the header part (information about contact, experiment, sample, protocol, etc.). So, steps concerning the data itself (spectra selection, peptide and protein identification, etc.) are not described here.

:!:Tips:!:

  • PRIDE Converter help is accessible from each panel of the wizard
  • If you don't know which CV Term to use, you can always Browse Ontology
  • To move up/down rows in tables (Protocol steps, etc.) right-click on a row and select Move Up or Move Down

Data Source Selection

  1. Select Mascot DAT Files
  2. Next >

File Selection - Step 1 of 8

  1. Click on the button Select a File or a Folder and select a Mascot .dat file (the choice of this Mascot file is not very important as we will only keep the header of the generated PRIDE XML file)
  2. Next >

Spectra Selection - Step 2 of 8

  1. Keep default settings
  2. Next >

Experiment Properties - Step 3 of 8

Experiment Properties

  • Title: title used to describe the experiment
  • Description: experiment description (optional)
  • Label: short label used to describe the experiment
  • Project: project name (allow several experiments to be grouped)

Contact Information

  • Click on Add Contact
  • Click on Please Select a ContactCreate a New Contact… (you can create as much as contacts as needed, at least one)
    • Name: first+last name of people that submitted the data and/or did the experiment.
    • E-mail:
    • Institution: full adress of the laboratory
    • Ok

References (optional)

If the experiment is part of a published manuscript it is highly recommended to include a reference to the paper.

Sample Properties - Step 4 of 8

  • Click on Please Select a Sample SetCreate a New Sample Set… (a sample set consists of one or multiple samples)
    • Name: sample set name
    • Ok
  • Select the Single sample or Multiple Samples tab (a Single Sample is most likely needed)
  • Click on Add Sample CV Term to add a sample
    • Select the appropriate Ontology: for ex. NEWT (species), BTO (tissue), CTO (cell type), GO and DO (disease state)
    • In the 'Term Name Search' enter for ex. Chlamydomonas, select the appropriate row in the Search Results table, then click on Use Selected Term.
  • Next >

Protocol Properties - Step 5 of 8

Protocol

  • Click on Please Select a ProtocolCreate a New Protocol…
    • Name: protocol name
    • Ok
  • Click on Add Protocol Step (add as much protocol steps as needed)
    • Click on Ontology Lookup Service
      • Select the appropriate ontology, for ex. PRIDE Controlled Vocabulary [PRIDE]
      • Enter for ex. enzyme digestion as the Term Name
      • Select the appropriate row in the Search Results table.
      • Click on Use Selected Term
    • The CV Term you have selected appears now as a row in the Protocol Step table. Depending on the CV Term, it may be possible to specify a value. You can enter, for ex., Trypsin for the value of the Enzyme digestion CV Term (just double-click on the Value cell of this CV Term)
    • Ok

Identification Type

  • Select the appropriate technology 2D Gel or Other
  • Next >

Instrument Properties - Step 6 of 8

Instrument

  • Click on Please Select an InstrumentCreate a New Instrument…
    • Name: instrument name (for ex. LTQ-Orbitrap)
    • Ok
  • Source: click on the OLS button to find the appropriate CV term (for ex. in the Mass Spectroscopy CV (PSI-MS) [PSI] ontology, search the Electrospray Ionization Term Name)
  • Detector: click on the OLS button to find the appropriate CV term (for ex. in the PRIDE Controlled Vocabulary [PRIDE] ontology, search the Microchannel Plate Detector Term Name)
  • Click on Add Analyzer (at least one analyser)
    • Click on Ontology Lookup Service to find the appropriate CV term (for ex. in the Mass Spectroscopy CV (PSI-MS) [PSI] ontology, search the LTQ Orbitrap Term Name)
    • Click on Use Selected Item

Processing

  • Software Name: ex. Mascot Distiller
  • Software Version: ex. 2.2
  • Click on Add Processing Method (at least one processing method)
    • Click on Ontology Lookup Service to find the appropriate CV term (for ex. in the Mass Spectroscopy CV (PSI-MS) [PSI] ontology, search the Peak Processing Term Name)
    • Click on Use Selected Item
  • If needed, provide information the Value cell (for ex. CentroidMassSpectrum for the Peak Processing CV Term)
  • Next >

User Parameters - Step 7 of 8

If there is anything else you would like to say about the experiment this is where you can do just that (a user parameter has a name and a value, for ex. ImportantComponent for the name, Cryptonite for the value)

Output Properties - Step 8 of 8

  • Select an output folder
  • Click on Convert!
  • Dialogs about
    • modifications may appear. Click Ok for all.
    • protein isoforms may appear. Select Always Select The First Isoform, then Ok

A file mascotDatFile_MyLabel_to_PRIDE_YYYYMMDD_123456.xml.gz is created in your ouput folder (where myLabel is the label you specified in Step 3).
Uncompress the file to obtain the mascotDatFile_MyLabel_to_PRIDE_YYYYMMDD_123456.xml file.

userguide/pride_converter.txt · Last modified: 2011/03/04 18:17 by 132.168.74.230