|
Description |
C2 domain signature. |
Pattern |
[ACG]-x(2)-L-x(2,3)-D-x(1,2)-[NGSTLIF]-[GTMR]-x-[STAP]-D-[PA]-[FY]. |
The Table below summarizes the comparison of ConSeq with the WebLogo server and PROSITE in identifying the annotated functional residues as highly conserved, in the eight active sites that were analyzed in the Validation section.
The ConSeq and the WebLogo servers were uploaded with the same MSA. The PROSITE server was scanned using the query sequence that was used for the ConSeq server.
The links to the results are highlighted.
Protein/Domain |
|
ConSeq |
WEBLOGO |
PROSITE |
Functional |
Highly conserved |
Highly conserved |
Highly conserved |
|
|
residues |
Residues |
Residues |
Residues |
|
(by annotation) |
(8+9 color grades) |
(~4 bits) |
(presented in a signature) |
SH2 domain |
|
|
|
|
Peptide binding site |
9 |
7 out of the 9 residues are highly conserved. |
6 out of the 9 residues are highly conserved. |
*profile |
SH3 interface |
3 |
2 of the 3 residues are highly conserved. |
1out of the 3 residues is highly conserved |
*profile |
Pyruvate Kinase |
|
|
|
|
Active site (pyruvate, K+, Mn2+, ATP binding) |
11 |
All 11 residues are highly conserved |
7 out of the 11 residues are highly conserved |
2 out of the 11 residues are highly conserved |
FBP binding site (allosteric regulation) |
4 |
All 4 residues are highly conserved. |
None of the residues is highly conserved. |
No pattern has been detected |
C2 domain |
|
|
|
|
Active site (Ca2+/ membrane binding) |
5 |
All 5 residues are highly conserved. |
3 out of the 5 residues are highly conserved. |
2 out of the 5 residues are highly conserved. |
HIV1- reverse transcriptase |
|
|
|
|
Catalytic site |
5 |
All 5 residues are highly conserved. |
2 out of the 5 residues are highly conserved. |
No pattern has been detected for the sequence |
RNA:DNA binding site |
16 |
12 of the 16 residues are highly conserved. |
5 out of the 16 residues are highly conserved. |
No pattern has been detected for the sequence |
SH3 domain |
|
|
|
|
Peptide binding site |
5 |
4 of the 5 residues are highly conserved. The fifth is color-graded 7. |
1 out of the 5 residues is highly conserved. |
*profile |
* PROSITE successfully identifies the sequences that were uploaded as SH2 and SH3 domains, using a profile. However, the profile does not discriminate between the highly conserved residues and the other residues in the domain; thus we could not compare these examples to the other servers.
Conclusion:
When comparing the ability of the three servers to identify the important functional residues as highly conserved, we found that ConSeq identifies most of the residues as highly conserved in the 8 functional sites. For example, in the active site of Pyruvate Kinase (PK), ConSeq identified all the 11 known residues as highly conserved, while Weblogo identified only 7 of them, and PROSITE successfully identified a motif that corresponds to the PK active site that highlights only 2 of the 11 residues.
Another advantage of the ConSeq server is the user-friendly visualization of the results. The results are easy to deduce and interpret, as compared to the results of the WebLogo server, for example.
REFERENCES
1. Falquet, L., Pagni, M., Bucher, P., Hulo, N., Sigrist, C.J, Hofmann, K. and Bairoch, A. The PROSITE database, its status in 2002. Nucleic Acids Res. 30, 235-238, 2002.
2. Glaser, F., Pupko, T., Paz, I., Bell, R.E., Bechor, D., Martz, E. and Ben-Tal, N. ConSurf: Identification of Functional Regions in Proteins by Surface-Mapping of Phylogenetic Information. Bioinformatics 19:1-3, 2002.
3. Pupko T., Bell R.E., Mayrose I., Glaser F. and Ben-Tal N. (2002). Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues. Bioinformatics 18 Suppl. 1 S1-S7
4. Sridharan, S., Nicholls, A. and Honig, B. A new vertex algorithm to calculate solvent accessible surface area. Biophysical Journal 61: A174, 1992.
5. Thompson, J.D, Higgins, D.G. and Gibson, T.J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.Nucleic Acids Res. 22:4673-4680. 1994.