Cathepsin L-like Protein
BLASTp Hit to Protein of Known or Suspected Function:
Organism: Tetrahymena thermophila
GI Number: 24474971
E-value: 2e-172
Protein and Coding Sequences:
TGD ID Number: 5.m00542
Evidence: TIGR Preliminary Gene Prediction 08/2004: predicted gene structure is fully supported by ESTs.
Coding Sequence:
ATGAACAAACTCATCCTCCTCGCCCTCGCTGGTACCGTCCTCTTAGGTGCCACCCTCTTA
TTGGTCAACCATAAGAGAGTCTCCGATGATGTCCCCATCACCGAAGAAGTCATCCTTAAA
TGGAAGTAATTCAAGCAAACCTACAACAAGAAGTTCTCTGATCCCGATCAAGAAGTTTAC
AGAATGGAAGTTTTCGCCTAAAACCTCGAAGTCGTCAAGTAGGATACCACTGGTACCTTC
GGTGTTACCCAATTCTTCGATTTAACCGCCCAAGAATTCGCTTCCATCTACCTCACCCTC
CAAGTTGAAAATGCTGAAGAAGCTGTCCACGATGCTGAACTCAACGGTGACATTAACTGG
GTTACCGCTGGTAAGGTTACTGCTGTTAAGAACTAAGGTTAATGCGGTTCATGTTGGGCT
TTCTCCACCACTGGTGCCCTCGAATCTGCTTTGATTGTCGCTGGTCAAGCCACCAACACC
ATCAATCTCTCTGAATAACAATTGGTTGACTGTTCCACCAGCTACGGTAACCAAGGTTGC
AACGGTGGTTTAATGGATAACGCCTTCAAGTACATCAAGGCTAACTAACTCACCACTGAA
AGTAACTATCCTTACACTGGCAAGGACGGAAAATGCAACTCAGCTGCCATCAAGGCCCCC
TTATACAGTCTTAAGGGTTTCACTGATGTCGCCAAGACCACCTCTGCCCTCCAAGCTGCT
ATCTAAAAGCAACCCGTCGCCATCGCTGTCGATGCTTCCAAGTGGTCTTACTACACTGGT
GGTGTCTTCTCTAACTGCGCTACTCAATTAAACCACGGTGTCCTCTTAGTCGGTATCGTT
AACGGTAACTGGTTAGTCAAGAACTCTTGGGGTGCCTCTTGGGGTGAAAACGGTTACATT
ACCTTAAAGGCTGGTAACACTTGCGGTCTTGCCAACGCTGCTTCTTATCCTACTGAGTGA
Protein Sequence:
MNKLILLALAGTVLLGATLLLVNHKRVSDDVPITEEVILKWKQFKQTYNKKFSDPDQEVY
RMEVFAQNLEVVKQDTTGTFGVTQFFDLTAQEFASIYLTLQVENAEEAVHDAELNGDINW
VTAGKVTAVKNQGQCGSCWAFSTTGALESALIVAGQATNTINLSEQQLVDCSTSYGNQGC
NGGLMDNAFKYIKANQLTTESNYPYTGKDGKCNSAAIKAPLYSLKGFTDVAKTTSALQAA
IQKQPVAIAVDASKWSYYTGGVFSNCATQLNHGVLLVGIVNGNWLVKNSWGASWGENGYI
TLKAGNTCGLANAASYPTE*
TTGGTCAACCATAAGAGAGTCTCCGATGATGTCCCCATCACCGAAGAAGTCATCCTTAAA
TGGAAGTAATTCAAGCAAACCTACAACAAGAAGTTCTCTGATCCCGATCAAGAAGTTTAC
AGAATGGAAGTTTTCGCCTAAAACCTCGAAGTCGTCAAGTAGGATACCACTGGTACCTTC
GGTGTTACCCAATTCTTCGATTTAACCGCCCAAGAATTCGCTTCCATCTACCTCACCCTC
CAAGTTGAAAATGCTGAAGAAGCTGTCCACGATGCTGAACTCAACGGTGACATTAACTGG
GTTACCGCTGGTAAGGTTACTGCTGTTAAGAACTAAGGTTAATGCGGTTCATGTTGGGCT
TTCTCCACCACTGGTGCCCTCGAATCTGCTTTGATTGTCGCTGGTCAAGCCACCAACACC
ATCAATCTCTCTGAATAACAATTGGTTGACTGTTCCACCAGCTACGGTAACCAAGGTTGC
AACGGTGGTTTAATGGATAACGCCTTCAAGTACATCAAGGCTAACTAACTCACCACTGAA
AGTAACTATCCTTACACTGGCAAGGACGGAAAATGCAACTCAGCTGCCATCAAGGCCCCC
TTATACAGTCTTAAGGGTTTCACTGATGTCGCCAAGACCACCTCTGCCCTCCAAGCTGCT
ATCTAAAAGCAACCCGTCGCCATCGCTGTCGATGCTTCCAAGTGGTCTTACTACACTGGT
GGTGTCTTCTCTAACTGCGCTACTCAATTAAACCACGGTGTCCTCTTAGTCGGTATCGTT
AACGGTAACTGGTTAGTCAAGAACTCTTGGGGTGCCTCTTGGGGTGAAAACGGTTACATT
ACCTTAAAGGCTGGTAACACTTGCGGTCTTGCCAACGCTGCTTCTTATCCTACTGAGTGA
RMEVFAQNLEVVKQDTTGTFGVTQFFDLTAQEFASIYLTLQVENAEEAVHDAELNGDINW
VTAGKVTAVKNQGQCGSCWAFSTTGALESALIVAGQATNTINLSEQQLVDCSTSYGNQGC
NGGLMDNAFKYIKANQLTTESNYPYTGKDGKCNSAAIKAPLYSLKGFTDVAKTTSALQAA
IQKQPVAIAVDASKWSYYTGGVFSNCATQLNHGVLLVGIVNGNWLVKNSWGASWGENGYI
TLKAGNTCGLANAASYPTE*
