User Tools

Site Tools


userguide:retentiontimealignment

This is an old revision of the document!


Retention Time Alignment

Major steps followed by the Retention Time algorithm

  1. Collect species from the reference User context and predict their retention time using an external utility and store them in reference species properties
  2. For each identification gathered under the reference User context do:
    1. Collect species to align from the identification
    2. Constitute several pair groups between collected reference species & species to align. A pair group contains 2 groups of species having same sequence & calculated mass (group1 has species from reference User context and group2 has species to align from identifications).
    3. Compute one (or several) representative value(s) for group1 & group2 for each pair group
    4. Compute linear regression between representative values
    5. Store linear regression coefficients in identification properties

In more details...

  1. Species Retention Time of the reference UserContext are predicted with NETPrediction v2.2.3378 utility using Kangas method (click here for more details). NETPrediction utility only uses species sequences to predict a Normalized Elution Time (NET) value.
    1. First, a list of 'reference' species is built
      • The reference species list doesn't contain any species with PTMs
      • If several species exist with redundant sequences, the best score species is retained
    2. Then, the corresponding sequences are exported in order to be used by the NETPrediction utility
  2. The predicted NET are converted to retention time using user-defined parameters.
  3. The user can decide to exclude predicted values too far from the others:
    • The average absolute deviation (between RT & predicted RT) is computed and all predicted RT far from this average value about a given user-defined threshold are excluded.
  4. The predicted RT values are stored as properties in the reference species.
  5. For each identification existing under the reference User context:
    1. All the final child species to align are collected
    2. Several pair groups are created using reference species & species to align. A pair group contains 2 groups of species having same sequence & calculated mass (group1 has species from reference User context and group2 has species to align from identifications).
    3. For each group, according the statistical method choosen by the user, one or several representative RT value(s) is(are) calculated.
      • Each 'Group1' always contains one species as the species/protein grouping has been executed on the reference context.
      • 'Group2' may contain one or more species. Following statistical methods are available:

userguide/retentiontimealignment.1280747042.txt.gz · Last modified: 2010/08/02 13:04 by 132.168.74.230