TY - JOUR
T1 - Enhanced statistics for local alignment of multiple alignments improves prediction of protein function and structure
AU - Frenkel-Morgenstern, Milana
AU - Voet, Hillary
AU - Pietrokovski, Shmuel
PY - 2005/7/1
Y1 - 2005/7/1
N2 - Motivation: Improved comparisons of multiple sequence alignments (profiles) with other profiles can identify subtle relationships between protein families and motifs significantly beyond the resolution of sequence-based comparisons. Results: The local alignment of multiple alignments (LAMA) method was modified to estimate alignment score significance by applying a new measure based on Fisher's combining method. To verify the new procedure, we used known protein structures, sequence annotations and cyclical relations consistency analysis (CYRCA) sets of consistently aligned blocks. Using the new significance measure improved the sensitivity of LAMA without altering its selectivity. The program performed better than other profile-to-profile methods (COM-PASS and Prof_sim) and a sequence-to-profile method (PSI-BLAST). The testing was large scale and used several parameters, including pseudo-counts profile calculations and local ungapped blocks or more extended gapped profiles. This comparison provides guidelines to the relative advantages of each method for different cases. We demonstrate and discuss the unique advantages of using block multiple alignments of protein motifs.
AB - Motivation: Improved comparisons of multiple sequence alignments (profiles) with other profiles can identify subtle relationships between protein families and motifs significantly beyond the resolution of sequence-based comparisons. Results: The local alignment of multiple alignments (LAMA) method was modified to estimate alignment score significance by applying a new measure based on Fisher's combining method. To verify the new procedure, we used known protein structures, sequence annotations and cyclical relations consistency analysis (CYRCA) sets of consistently aligned blocks. Using the new significance measure improved the sensitivity of LAMA without altering its selectivity. The program performed better than other profile-to-profile methods (COM-PASS and Prof_sim) and a sequence-to-profile method (PSI-BLAST). The testing was large scale and used several parameters, including pseudo-counts profile calculations and local ungapped blocks or more extended gapped profiles. This comparison provides guidelines to the relative advantages of each method for different cases. We demonstrate and discuss the unique advantages of using block multiple alignments of protein motifs.
UR - http://www.scopus.com/inward/record.url?scp=21444456479&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/bti462
DO - 10.1093/bioinformatics/bti462
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 15870168
AN - SCOPUS:21444456479
SN - 1367-4803
VL - 21
SP - 2950
EP - 2956
JO - Bioinformatics
JF - Bioinformatics
IS - 13
ER -