ExPASy Home page |
Site Map | Search ExPASy | Contact us |
UniRule is a format describing conditional annotation templates (rules) used by UniProtKB/Swiss-Prot automated annotation projects. It defines annotation which will be generated for (selected) predicted features.
Structure
Each UniRule entry consists of the following parts.
The Header section contains technical information about the UniRules.
Each UniRule starts with one or more AC (ACcession number) lines.
The format of the accession number is for intance `PRU' followed by 5
digits for ProRules and `MF_' followed by 5 digits for HAMAP rules.
Application:
|
Format:
AC primary accession number;[ secondary accession numbers;] |
Examples:
AC PRU00001; AC PRU00002; PRU99998; PRU99999; AC MF_00022; |
The DC (Data Class) line specifies which type of data the rule annotates.
Application:
|
Format:
DC data_class;[ auto] |
Examples:
DC Protein; auto DC Domain; DC Site; |
Domains are generally not allowed to overlap. However in rare cases it is possible that a domain is located within another domain; e.g. EH domain and EF-hand. In that situation, the annotation of the smaller domain must be triggered in the rule for the longer domain.
The TR (Trigger) line describes which (selected) sequence analysis features trigger
the application of the curent UniRule.
Each UniRule may contain one or more TR lines.
Each feature name should appear only once in the TR line in the whole
UniRule database, as one type of feature is expected to trigger only one rule.
Application:
|
TR dbname; identifier1; identifier2; nbhits; level=level |
Examples:
TR PROSITE; PS50075; ACP_DOMAIN; 0-1; level=0 TR HAMAP; MF_00401; -; 1; level=0 TR General; Transmembrane; -; 1; level=0 |
TR Metamotif; mmsearch-options; metamotif |
Examples:
TR Metamotif; -; Signal>=GPI_anchor |
The metamotif field should correspond to the name of a detectable metamotif
feature. The metamotif feature name is at the same time the metamotif description.
It describes the arrangements of sub-features (by their name) in a sequence (@xref{mmsearch}).
The 'available' metamotifs are listed in $ANABELLE/data/metamotifs.dat
This file is used by mmsearch, and is created at release (anabelle_update.sh) by
extracting TR metamotif lines from CVSed UniRules.
The field mmsearch-options used to controls the behavior of the metamotif search automaton (during 'picohamap' times?). It is now obsolete, just use '-'.
This separator line conveys no meaning. It is used to separate blocks of lines.
The Names line indicates the name(s) of the motif described by the rule.
The first name is the one used in UniProtKB/Swiss-Prot, synonyms are listed below, one
name per line. If there is no specific name for a motif, the term
`Undefined' should be inserted as a placeholder.
Application:
|
Format:
Names: Undefined Names: Synonym1 Synonym2 ... |
Example:
Names: Nacht domain NACHT-NTPase domain Nucleoside triphosphatase domain |
The function line indicates the `generic' function of a domain. Usually it
should not be longer than one line, but in case it is, the lines following the
first one will be indented by a space.
Application:
|
Format:
Function: Undefined/Unknown Function: text |
Examples:
Function: Protein-binding Function: DNA-binding Function: Inhibits fibrinogen interaction with platelet receptors in snake venoms |
** SALRD unusually long in the C-terminus, flagged as atypical. |
The Annotation section includes all UniProtKB/Swiss-Prot annotation lines that can be applied to a rule match. Additionally there can be lines which indicate condition statements (`case', `else case', `else', `c?', `c!') and a line which indicates the motif or alignments, according to which the feature positions are calculated (See section 3.8.1 FT From line). The line order is the same as in UniProtKB/Swiss-Prot.
The ID line contains the mnemonic code for a protein name used in the entry
name of a UniProtKB/Swiss-Prot entry.
Application:
|
Format:
ID protein_name_code |
Example:
ID ACKA |
The DE line corresponds to the description line of a UniProtKB/Swiss-Prot entry, or it
contains only the part that is common to the considered group of protein entries; it is
then preceeded by a plus (`+') and it does not end with a period.
Application:
|
Formats:
DE Description. DE + partial_description |
Example:
DE Putative 3-methyladenine DNA glycosylase (EC 3.2.2.-). DE + (EC 2.7.3.-) |
DE Hemerythrin-like protein <ORFName>. |
The GN line contains the common gene name (and optionally synonyms) of a
protein family, when one exists.
Application:
|
Format:
GN Name=name;[ Synonyms=synonym[, synonym]...;] |
Examples:
GN Name=acpD; GN Name=groS; Synonyms=groES; |
The CC line(s) contains all applicable comment lines of a UniProtKB/Swiss-Prot entry.
Application:
|
Format:
CC -!- topic: text. |
Example:
CC -!- SIMILARITY: Belongs to the ABC transporter family. |
The CC line may contain :
CC -!- SIMILARITY: Contains # ARM repeat.
CC -!- PTM: The reversible ADP-ribosylation of #{Arg-101} inactivates ...
|
The latter becomes for example:
CC -!- SIMILARITY: Contains 5 ARM repeats. CC -!- PTM: The reversible ADP-ribosylation of Arg-112 inactivates ... |
The DR line main usage is to trigger 'child' rules in order to avoid
duplication of rule content. It has not much to do with 'real' DR UniProtKB/Swiss-Prot
annotation lines (in Anabelle, DR annotation is only performed for PROSITE
motifs).
The format of the DR line is similar to that of a TR line
(See section 2.3 TR line).
Application:
|
Format:
DR feature name; identifier1; identifier2; nbhits; trigger=[yes|strict|no] |
Examples:
DR PROSITE; PS00419; PHOTOSYSTEM_I_PSAAB; 1; trigger=no DR PROSITE; PS00010; ASX_HYDROXYL; 0-1; trigger=no DR General; Signal; -; 0-1; trigger=strict DR General; Transmembrane; -; 10-11; trigger=yes |
The KW line contains all applicable keywords for a UniProtKB/Swiss-Prot entry, one per line.
Application:
|
Format:
KW keyword ... |
Example:
KW Transferase KW Kinase |
The GO line contains all applicable Gene Ontology terms, one per line.
Application:
|
Note: no annotation is transfered from those lines (so are just
kind of internal references)! UniProtKB DR GO lines are added by a separated
dedicated pipeline.
Format:
GO accession-number; aspect:term ... |
Example:
GO GO:0019104; F:DNA N-glycosylase activity GO GO:0006281; P:DNA repair |
The FT (Feature table) line contains applicable features for a UniProtKB/Swiss-Prot entry. The
feature positions are calculated by the automated annotation program based
on the rule and the motif match positions.
Application:
|
Format:
FT From: template-id (template-accnumber) FT key from to desc. [FT [Optional;] [Group: n;] [Condition: pattern]] |
Examples:
FT From: CARB_ECOLI (P00968) FT DOMAIN Nter 403 Carboxyphosphate synthetic domain. |
FT From: unique identifier for a motif FT From: entry_name (accession_number) FT From: metamotif FT From: any |
Examples:
FT From: PS50234 FT From: ACP_ECOLI (P02901) FT From: PS50021=7,91=PS50021 FT From: any |
The FT feature line defines the actual FT lines that are to be propagated in member entries. Format:
FT key from to desc. |
Examples:
FT CHAIN to+1 Cter <name>. FT LIPID 1 1 GPI-anchor amidated <residue_name>. FT DOMAIN from to Laminin G-like #. FT TOPO_DOM Nter 6 Periplasmic (Potential). FT DOMAIN ? 8+1 EGF-like #. |
Placeholders
FT REGION 101 124 Necessary for interaction with @gn(ABC-1). |
FT LIPID 1 1 GPI-anchor amidated <residue_name>. |
FT LIPID 300 300 GPI-anchor amidated aspartate. |
FT ACT_SITE 6 6 [For protease activity] By similarity. |
FT ACT_SITE 506 506 For protease activity (By similarity). |
FT ACT_SITE 506 506 By similarity. |
FT DOMAIN from to Foobar #. FT METAL 87 87 Iron #1 (By similarity). FT METAL 118 118 Iron #1 (By similarity). FT METAL 118 118 Iron #2 (By similarity). FT METAL 180 180 Iron #2 (By similarity). |
FT DOMAIN 1 200 Foobar. FT METAL 87 87 Iron 1 (By similarity). FT METAL 118 118 Iron 1 (By similarity). FT METAL 118 118 Iron 2 (By similarity). FT METAL 180 180 Iron 2 (By similarity). |
FT DOMAIN 1 200 Foobar 1. FT DOMAIN 201 400 Foobar 2. FT METAL 87 87 Iron 1 (By similarity). FT METAL 118 118 Iron 1 (By similarity). FT METAL 118 118 Iron 2 (By similarity). FT METAL 180 180 Iron 2 (By similarity). FT METAL 287 287 Iron 3 (By similarity). FT METAL 318 318 Iron 3 (By similarity). FT METAL 318 318 Iron 4 (By similarity). FT METAL 380 380 Iron 4 (By similarity). |
The FT constraints line (also known as the FT Condition line)
gives constraints on the FT line immediately above it.
Format:
FT [Optional;] [Group: n;] [Condition: pattern] |
The Feature line is most often constrained by a pattern on the sequence (in cases where more complex rules are needed, the case statement (See section 6.1 Case statement) should be used.
Example:
FT From: ACP_ECOLI (P02901) FT BINDING 37 37 Phosphopantetheine (By similarity). FT Condition: S |
The `pattern' is specified in PROSITE format, with the addition that the character `*' may be used to specify an unconstrained range, e.g. `C-x*-C'. The region of the sequence corresponding to the feature must exactly match this pattern.
For the consistency of annotation, multiple FT lines that should be applied either all together or not at all should be grouped within an `FTGroup', to constrain the common presence of all sites. This group can the be referenced by case statements, for instance in the relevant KW and CC lines that depend on the presence of the feature.
Example:
case <FTGroup:1> KW GTP-binding end case XX case <OC:Bacteria> FT From: IF2_ECOLI (P02995) FT DOMAIN 392 540 G-domain. FT Group: 1 FT NP_BIND 398 405 GTP (By similarity). FT Group: 1; Condition: G-H-V-D-H-G-K-T FT NP_BIND 444 448 GTP (By similarity). FT Group: 1; Condition: D-T-P-G-H FT NP_BIND 498 501 GTP (By similarity). FT Group: 1; Condition: N-K-[LIVCM]-D end case |
Note: One FT line can be part of several FTGroups. If at least one of those groups is complete, the FT line passes its FTGroup constraint (implicit OR).
Example:
FT DISULFID 25 31 By similarity. FT Group: 1; Group: 2; Condition: C-x*-C |
The `Optional' label can be used to indicate that the absence of a feature should not be considered a trigger for warnings in annotation programs. It can only be present if a `Condition' pattern is supplied.
Example:
FT BINDING 37 37 Phosphopantetheine (By similarity). FT Optional; Condition: S |
Line identifier1: info in line1 following lines are indented Line identifier2: info in line1 following lines are indented ... |
Specify a warning that should be generated when the rule is used for
automatic annotation. Most often used within a case statement to indicate
the occurrence of an inconsistency that cannot be solved by the rule,
or that some annotation should be completed manually by a curator.
The SAM module transfers the text of Warn lines into the `**HW' section
of a UniProtKB/Swiss-Prot entry.
Application:
|
Format:
Warn: text |
Example:
case <OC:Proteobacteria> Warn: Check manually domain bounds end case |
Range by which the bounds of a domain may be chopped in order to annotate
successive domains in a exactly consecutive manner. This line can only be used
by programs if the complete size of the domain can be annotated; generally it
will not be possible to use it with a pattern that covers only part of the
domain.
Application:
|
Format:
Chop: Nter=max; Cter=max;[ Xter(motif)=max;]* |
Examples:
Chop: Nter=0; Cter=3; Chop: Nter=1; Cter=unlimited; Chop: Nter=0; Cter=0; Nter(Signal)=50; |
The Size line indicates the size relevant to a protein family or motif.
For entries of the data class `Protein', the minimal and maximal size of
proteins matching the rule are listed. For entries of the data class
`Domain', this line contains the size range of the complete domains
annotated in UniProtKB/Swiss-Prot. Members that are strongly divergent in size may be
excluded from the range. A size may be specified as `unlimited'.
Application:
|
Format:
Size: minimal_size-maximal_size; Size: fixed_size; |
Examples:
Size: 176-239; Size: 13-unlimited; Size: unlimited; |
Lists UniRules that are known to be similar in sequence, and risk to produce cross-matches. If the string `!' or `!!' is appended to the name of a rule, it means that the rule listed in the Related line supersedes the current rule, i.e. that matches to the current rule should be disregarded if a match with the listed rule is found: `!' in an overlapping region; `!!' anywhere on the protein.
The marker `!' is particularly useful when two different rules exist for a
`short' and a `long' version of the same protein (as occurs sometimes in HAMAP
families). `Long' proteins will match both profiles; under these circumstances
the `longer' UniRule should contain the `!' marker to supersede the
shorter UniRule.
Application:
|
Format:
Related: None; Related: Protein[!][!];[ Protein[!][!];]... |
Example:
Related: MF_00492; MF_00493; MF_00494; Related: MF_00344!; Related: ANA00003!!; |
The observed number of repetitions of a domain or site in a UniProtKB/Swiss-Prot
entry. The number may be specified as `unlimited'.
Application:
|
Format:
Repeats: value;[ no keyword;] Repeats: min-max; |
The optional attribute `no keyword' indicates that the presence of several copies of a rule of the type `Domain' should not trigger the addition of the keyword `Repeat' (See section 7.1 The keyword Repeat).
Examples:
Repeats: 1; Repeats: 2-4; Repeats: unlimited; no keyword; |
Specifies the subcellular location(s) in which a domain or site may occur.
Application:
|
Formats:
Topology: Undefined; Topology: location; |
Values for this topic are restricted to `Undefined', `Cytoplasmic' or `Not cytoplasmic'.
Example:
Topology: Not cytoplasmic; |
Lists accession number(s) of characterized proteins which were used to
build the UniRule (Note: indicative only). Uncharacterized protein families do not
necessarily have a template, this is noted as `Template: None;'. Note that
in many cases the propagated annotation is a subset of that found in the
characterized entries.
Application:
|
Format:
Template: accession_number;[ accession number;]... Template: None; Template: Undefined; |
Examples:
Template: P12345; Template: None; Template: Undefined; |
One or more example entry targeted by the rule.
Application:
|
Format:
Example: accession_number;[ accession number;]... Example: Undefined; |
Examples:
Example: P12345; Example: Undefined; |
Lists the taxonomic classes in which a rule match may be found. Application:
|
Format:
Scope: kingdom[; sub-taxon] [except sub-taxon ...] [not in taxcode[, taxcode]...] ... |
The kingdom line is indented by one space, while the subsequent lines are indented by two spaces.
Example:
Scope: Bacteria; Proteobacteria except Enterobacteriales except Pasteurellales Bacteria; Actinobacteria Archaea not in ARCFU, HALN1, METTH, METJA, PYRAB, PYRHO, SULSO, SULTO, THEAC, THEVO Plastid |
If it has been assessed with certainty that a UniRule is not represented in:
Lists UniRules to which a given UniRule may be fused in some instances.
Application:
|
Format:
Fusion: NT: None CT: None Fusion: NT: Protein[; Protein]... CT: Protein[; Protein]... |
Protein may be either a UniRule accession followed by an identifiers between round brackets (e.g. `MF_00222 (aroE)'), or a designation between angle brackets (e.g. `<Thioredoxin domain>') if no UniRule is available.
Example:
Fusion: NT: None CT: MF_00222 (aroE); <Unknown> |
Lists the organisms which the triggered motif is found in multiple copies.
Application:
|
Format:
Duplicate: None Duplicate: in taxcode[, taxcode]... |
Example:
Duplicate: in ANASP, CAUCR, LACLA, RHILO, RHIME, STAAU, SYNY3 |
Lists the organisms in which a triggered motif is found encoded on a plasmid.
Application:
|
Format:
Plasmid: None Plasmid: in taxcode[, taxcode]... |
Example:
Plasmid: in RHIME |
Comments concerning the rule, which should be made visible to the public.
Application:
|
Format:
Comments: None Comments: comment_text |
Example:
Comments: NUDIX-like C-terminal domain in SYNY3 |
Whenever curators commit a UniRule file, they are asked for a short description of the modifications which have been made. This description as well as the revision number, date and author of the modification are added automatically by CVS into the file, whenever the special keyword `$'Log$ is encountered.
Example:
# $Log: PRU00001.dat,v $ # Revision 1.2 2002/09/09 16:41:42 boeckman # format changes # # Revision 1.1 2002/08/20 12:16:52 gattiker # Created # |
The last (uppermost) Revision line indicates the current revision number and revision date of the UniRule. The revision number is composed of two integers connected by a period. The first integer is the version of the UniRule format specification used (Currently always 1). The second integer is the version number of the rule, which normally starts from 1 and is increased each time a modification to the rule is committed. The value 0 for the version number is used for rules created before UniRule version control was implemented (i.e. old HAMAP rules).
If a major modification which breaks format compatibility is ever made to the UniRule specification (i.e. this document), the first integer of the revision number will be increased. Thus a rule which used to be `Revision 1.4' will become `Revision 2.5'.
The entire content of the History section consists of lines prefixed by a hash sign (`#'). It is managed entirely by CVS, and it is strictly forbidden to alter its contents manually or by programs, even to correct typos.
Only the last Revision line appears in published version.
Format:
case <condition>[ and|or [not] [defined] <condition>]... else case <condition>[ and|or [not] [defined] <condition>]... else end case |
The `case' and `else case' lines include conditions that must be met for the lines below it to be applied, until the next `else case', `else' or `end case' statement. Condition lines (c! and c?, see below) do not break the latest case statement.
Note: It is not possible to use a `case' statement within a `case' statement, but it is possible to use the condition lines c! or c?.
Types of cases:
case <OG:Chloroplast> or <OC:Cyanobacteria> case not <OG:Chloroplast> and not <OG:Cyanelle> case <OC:Archaea> case <OC:Bacteria> case <OS:Staphylococcus aureus> |
Note for conditions on organism names (`case <OS:taxon>'): the organism name matches also subspecies, i.e. organisms with the same name followed by a space and then any text. For example, `Staphylococcus aureus' matches `Staphylococcus aureus RF122', but `Salmonella typhi' does not match `Salmonella typhimurium'.
case <FT:1> case <FT:1=A-x-x> |
Note: In the FT lines, this condition can only be used if the targeted FT is itself not in a FT[Group] case!
Note: The targeted feature number corresponds to its position/order in the FT block. Caution: FT lines in a FT[Group] case are also numbered according to their relative position in the FT lines, but the numbering starts at 1 + the number of FT element - not in a FT[Group] case (it is therefore simpler to put those FT elements at the bottom of FT lines)...
case <FTGroup:1>
==> all features in Group 1 must be propagated
|
Note: In the FT lines, this condition can only be used if the targeted FTGroup is itself not in a FT[Group] case!
case <Feature:PS50084>
case <AnyFeature:PS50084>
case <Feature:Transmembrane>2>
==> feature must be present over two times
|
The difference between `Feature' and `AnyFeature' is as follows. `Feature' only refers to the (rule) triggering feature + features that overlaps (by at least 50%) with it. `AnyFeature' refers to all features matching the sequence.
Those conditions refer to both selected and unselected features, except if an operator (>,<,==,>=,<=) is present: here only selected features will be examined. For example to test that there is no selected Signal_anchor feature (somewhere in the sequence), use:
case <AnyFeature:Signal_anchor<1> ==> selected feature must be absent |
case not <AnyFeature:Signal_anchor> ==> feature must be absent (whether selected or not) |
case <Feature:PS50084:5=E>
case <Feature:PS99999:10-13=N-{P}-[ST]-{P}>
|
case <Triggered> |
case <Property:Membrane=2> case <Property:NITROGEN_FIXATION> case <Property:NODULATION> case not <Property:METHANOG> |
case <Length<256> case <Length>420> |
UniRule conditions should be evaluated using ternary logic, where conditions evaluate to one of three values: true, false, or undef. Operators are defined as follows, consistently with their implementation in the Perl programming language. Note that certain rules are counterintuitive.
Binary operators: `and' and `or'
| i | j | i and j | i or j |
| true | true | true | true |
| true | false | false | true |
| true | undef | undef | true |
| false | true | false | true |
| false | false | false | false |
| false | undef | false | undef |
| undef | true | undef | true |
| undef | false | undef | false |
| undef | undef | undef | undef |
Unary operators: `not' and `defined'
| i | not i | defined i |
| true | false | true |
| false | true | true |
| undef | undef | false |
Operator associativity and precedence
The precedence order from highest to lowest and associativity are as follows.
| associativity | operator |
| right | defined |
| right | not |
| left | and |
| left | or |
Example application: if the number of membranes is known and equal to 2, then apply a given annotation item. Otherwise, apply a less specific annotation item.
case defined <Property:Membrane> and <Property:Membrane=2> CC -!- SUBCELLULAR LOCATION: Inner membrane-associated (By similarity). else CC -!- SUBCELLULAR LOCATION: Membrane-associated (By similarity). end case |
The condition line c! or c? contains additional constraints for the propagation of the line immediately below it. The format of this line is either:
c! condition c? condition |
where condition has the same syntax as in a case line, or, before a FT line, it can also include a PROSITE pattern expression.
Note: In FT lines, FT[Group] condition cannot be used (use case instead!).
The condition line differs from the case line in that
The condition of the c! line must be true, otherwise an error is expected. Tools using UniRules are recommended to produce an error message.
Example:
c! <Feature:PS00013> KW ATP-binding |
The condition of the c? line may be true or not, as the feature does not appear in all matches of the UniRule.
Example:
c? <Feature:PS99999:10-13=N-{P}-[ST]-{P}> and <OC_Eukaryota>
FT CARBOHYD 10 13 N-linked (Potential).
|
Exceptions: See section 7. Hidden information.
Transition: Condition lines should get automatically replaced by c! lines, and some of them later by c? lines. Mandatory conditions for disulfides should be suppressed, optional ones replaced by c? lines.
UniRules strive to include all information relevant to a motif. However, to avoid repetition, we did not include the information below, which is implicitly `known' by the automatic annotation pipeline tools.
The keyword Repeat is relevant to all rules of the data class Domain. This keyword applies when a domain or repeat is found at least twice in a protein. The corresponding part of the rule would be:
case <Feature:current_rule_accession_number>1> KW Repeat end case |
This behavior can be prevented by using the attribute `no keyword' in the Repeats line (See section 4.5 Repeats line).
For features with the key DISULFID, the constraint that the From and To positions both need to be a cysteine is implicit. The corresponding line would be the second line of the following example:
FT DISULFID 4 23 By similarity. FT Condition: C-x*-C |
Introduction
UniAln is a format for protein sequence alignments that complement the UniRules collection. Some UniRules are developed based on predictors from specialized databases such as PROSITE. However, other UniRules are based on curated alignments that constitute the UniAln collection. That is the method used in the HAMAP annotation project.
Format
UniAln alignments are in a format similar to that produced by the CLUSTAL suite of programs. Each alignment is composed of:
The alignment is subject to the following constraints:
The alignment header line
The first line of the alignment must start with the string `CLUSTAL' or `MUSCLE' or `T_COFFEE'. The rest of the line is free-text, but special tags are recognized by programs. The tags may be repeated. Tags are:
indicate that a sequence in the alignment is a template for feature propagation in the UniRule that uses the alignment. It is mandatory to indicate template sequences in alignments to allow alignment-based feature propagation in UniRules.
indicate the method that should be used to generate a profile from the alignment. Allowed values for method are:
(default) A profile should be generated using `pfw' and `pfmake' from the PFTOOLS package. There is no need to indicate this method, as it is the default.
A Hidden Markov Model should be generated using `hmmbuild' from the HMMER package and converted to a profile using `htop' from the PFTOOLS package. Profiles generated with `pfmake' are usually more sensitive than those generated with `hmmbuild'. In some cases this means that they are less discriminating. If it is observed that the default method causes false positives, it can be attempted to use the `hmmbuild' method to see if it solves the problem. See HAMAP 2003 paper for a discussion.
In a few HAMAP families, `hmmbuild' was able to avoid false positives and negatives, while `pfmake' was not:
Example header lines:
CLUSTAL CLUSTAL W (1.83) multiple sequence alignment template=XYLA_ECOLI template=XYLA_ACTMI CLUSTAL W (1.83) multiple sequence alignment template=XYLA_ECOLI profile_method=hmmbuild MUSCLE (3.52) multiple sequence alignment |
Example of a 'Domain' UniRule
AC PRU00241; DC Domain; TR PROSITE; PS50903; RUBREDOXIN_LIKE; 1; level=0 XX Names: Rubredoxin-like domain Function: It is involved in electron transfer processes. XX CC -!- SIMILARITY: Contains # rubredoxin-like domain. DR PROSITE; PS00202; RUBREDOXIN; 0-1; trigger=no case <FTGroup:1> GO GO:0009490; F:mononuclear iron electron carrier GO GO:0006810; P:transport GO GO:0006118; P:electron transport KW Transport KW Electron transport KW Metal-binding KW Iron end case XX FT From: PS50903 FT DOMAIN from to Rubredoxin-like #. FT METAL 6 6 Iron #1 (By similarity). FT Group: 1; Condition: C FT METAL 9 9 Iron #1 (By similarity). FT Group: 1; Condition: C FT METAL 38 38 Iron #1 (By similarity). FT Group: 1; Condition: C FT METAL 41 41 Iron #1 (By similarity). FT Group: 1; Condition: C XX Chop: Nter=0; Cter=0; Size: 34-54; Related: None; Repeats: 2; Topology: Cytoplasmic; Example: Q9V099; Scope: Bacteria Archaea # $Log: ex_domain.txt,v $ # Revision 1.5 2007/07/25 09:51:28 lesaux # Doc updated version 1 LSV/CR # // |
AC MF_00198; DC Protein; auto TR HAMAP; MF_00198; -; 1; level=0 XX ID SPEE case <OC:Bacteria> DE Spermidine synthase (EC 2.5.1.16) (Putrescine aminopropyltransferase) DE (PAPT) (SPDSY). end case case <OC:Archaea> DE Probable spermidine synthase (EC 2.5.1.16) (Putrescine DE aminopropyltransferase) (PAPT) (SPDSY). end case GN Name=speE; XX CC -!- FUNCTION: Catalyzes the production of spermidine from putrescine CC and decarboxylated S-adenosylmethionine (dcSAM), which serves as CC an aminopropyl donor (By similarity). CC -!- CATALYTIC ACTIVITY: S-adenosylmethioninamine + putrescine = 5'-S- CC methyl-5'-thioadenosine + spermidine. CC -!- PATHWAY: Amine and polyamine biosynthesis; spermidine CC biosynthesis; spermidine from putrescine: step 1/1. case <OC:Proteobacteria> CC -!- SUBUNIT: Homodimer (By similarity). else case <OC:Thermotogales> CC -!- SUBUNIT: Homotetramer (By similarity). else CC -!- SUBUNIT: Homodimer or homotetramer (By similarity). end case CC -!- SIMILARITY: Belongs to the spermidine/spermine synthase family. XX DR Pfam; PF01564; Spermine_synth; 1; trigger=no DR TIGRFAMs; TIGR00417; speE; 1; trigger=no DR PROSITE; PS01330; SPERMIDINE_SYNTHASE_1; 1; trigger=no DR PROSITE; PS51006; SPERMIDINE_SYNTHASE_2; 1; trigger=no XX KW Polyamine biosynthesis KW Spermidine biosynthesis KW Transferase XX GO GO:0004766; F:spermidine synthase activity GO GO:0008295; P:spermidine biosynthetic process XX FT From: SPEE_THEMA (Q9WZC2) FT REGION 152 153 S-adenosylmethioninamine binding (By FT similarity). FT Condition: [DN]-[AGV] FT BINDING 46 46 S-adenosylmethioninamine (By similarity). FT Condition: [QHNR] FT BINDING 101 101 S-adenosylmethioninamine (By similarity). FT Condition: [DE] FT BINDING 121 121 S-adenosylmethioninamine (By similarity). FT Condition: [ED] FT BINDING 170 170 S-adenosylmethioninamine (By similarity). FT Condition: D FT BINDING 173 173 Putrescine (By similarity). FT Condition: [DE] XX Size: 261-366; Related: None; Template: P09158; P70998; Q9WZC2; Q8U4G1; O25503; Scope: Bacteria not in AGRT5, ANASP, BACTN, BORBR, BORBU, BORPA, BORPE, BRAJA, BRUME, BRUSU, BUCBP, CAMJE, BLOFL, CAUCR, CHLCV, CHLMU, CHLPN, CHLTE, CHLTR, CORGL, COXBU, DEIRA, ENTFA, FUSNN, GLOVI, HAEDU, HAEIN, HELHP, LACLA, LACPL, LISIN, LISMO, MYCGA, MYCGE, MYCLE, MYCPE, MYCPN, MYCPU, PASMU, PORGI, PSEPK, RHILO, RHIME, RICCN, RICPR, STAAM, STAAN, STAAW, STAES, STRA3, STRA5, STRMU, STRP3, STRP8, STRP1, SYNEL, SYNY3, TREPA, TROW8, TROWT, UREPA, VIBCH, VIBPA, WIGBR Archaea not in HALSA, METAC, METKA, METMA, METTH Fusion: NT: <Unknown> CT: <Unknown> Duplicate: in AQUAE, BACAN, BACCR, LEPIN, PSEAE, RALSO, STRCO, THETN Plasmid: in RALSO Comments: None **In Buchnera sp. only speE and speD are present, neither the pathway from ornithine **nor the pathway from arginine are complete. XX # $Log: ex_protein.txt,v $ # Revision 1.1 2007/07/25 09:51:28 lesaux # Doc updated version 1 LSV/CR # // |
| [Top] | [Contents] | [Index] | [ ? ] |
| [Top] | [Contents] | [Index] | [ ? ] |
1. Introduction to UniRule format
2. Header section
3. Annotation section
4. Computing section
5. History section
6. Control statements
7. Hidden information
A. UniAln
B. Sample UniRules entries
| [Top] | [Contents] | [Index] | [ ? ] |
| Button | Name | Go to | From 1.2.3 go to |
|---|---|---|---|
| [ < ] | Back | previous section in reading order | 1.2.2 |
| [ > ] | Forward | next section in reading order | 1.2.4 |
| [ << ] | FastBack | previous or up-and-previous section | 1.1 |
| [ Up ] | Up | up section | 1.2 |
| [ >> ] | FastForward | next or up-and-next section | 1.3 |
| [Top] | Top | cover (top) of document | |
| [Contents] | Contents | table of contents | |
| [Index] | Index | concept index | |
| [ ? ] | About | this page |
ExPASy Home page |
Site Map | Search ExPASy | Contact us |
| Hosted by | Mirror sites: | Australia | Brazil | Canada | China | Switzerland |