-
Notifications
You must be signed in to change notification settings - Fork 2
/
sample_alignment.msa
executable file
·126 lines (126 loc) · 7.48 KB
/
sample_alignment.msa
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
>sp|P48449|ERG7_HUMAN Lanosterol synthase OS=Homo sapiens GN=LSS PE=1 SV=1
MTEG-TCLRRRGGPYKTEPATDLGRWRLNCERGRQTWTYLQDERA-GREQTGLEAYALGL
DTKNYFKDLPKAHTAFEGALNGMTFYVGLQAEDGHWTGDYGGPLFLLPGLLITCHVARIP
LPAGYREEIVRYLRSVQLPDGGWGLHIEDKSTVFGTALNYVSLRILGVGPDDPDLVRARN
ILHKKGGAVAIPSWGKFWLAVLNVYSWEGLNTLFPEMWLFPDWAPAHPSTLWCHCRQVYL
PMSYCYAVRLSAAEDPLVQSLRQELYVEDFASIDWLAQRNNVAPDELYTPHSWLLRVVYA
LLNLYEHHHSAHLRQRAVQKLYEHIVADDRFTKSISIGPISKTINMLVRWYVDGPASTAF
QEHVSRIPDYLWMGLDGMKMQGTNGSQIWDTAFAIQALLEAGGHHRPEFSSCLQKAHEFL
RLSQVPDNPPDYQKYYRQMRKGGFSFSTLDCGWIVSDCTAEALKAVLLLQEKCPHVTEHI
PRERLCDAVAVLLNMRNPDGGFATYETKRGGHLLELLNPSEVFGDIMIDYTYVECTSAVM
QALKYFHKRFPEHRAAEIRETLTQGLEFCRRQQRADGSWEGSWGVCFTYGTWFGLEAFAC
MGQTYRDGTACAEVSRACDFLLSRQMADGGWGEDFESCEERRYLQSAQSQIHNTCWAMMG
LMAVRHPDIEAQERGVRCLLEKQLPNGDWPQENIAGVFNKSCAISYTSYRNIFPIWALGR
---------FSQLYPERALAG--HP-------------
>UPI0000E25965 status=active
MRGGPGCLRRRGGPYKTEPATDLGRWRLNCERGRQTWTYLQDERA-GREQTGLEAYALGL
DTKNYFKDLPKAHTAFEGALNGMTFYVGLQAEDGHWTGDYGGPLFLLPGLLITCHVARIP
LPAGYREEIVRYLRSVQLPDGGWGLHIEDKSTVFGTALNYVSLRILGVGPDDPDLVRARN
ILHKKGGAVAIPSWGKFWLAVLNVYSWEGLNTLFPEMWLFPDWAPAHPSTLWCHCRQVYL
PMSYCYAVRLSAAEDPLVQSLRQELYVEDFASIDWLAQRNNVAPDELYTPHSWLLRVVYA
LLNLYEHHHSAHLRQRAVQKLYEHIVADDRFTKSISIGPISKTINMLVRWYVDGPTSTAF
QEHVSRIPDYLWMGLDGMKMQGTNGSQIWDTAFAIQALLEAGGHHRPEFSSCLQKAHEFL
RLSQVPDNPPDYQKYYRQMRKGGFSFSTLDCGWIVSDCTAEALKAVLLLQEKCPHVTEHI
PRERLCDAVAVLLNMRNPDGGFATYETKRGGHLLELLNPSEVFGDIMIDYTYVECTSAVM
QALKCFHKHFPEHRAAEIRETLTQGLEFCRRQQRADGSWEGSWGVCFTYGTWFGLEAFAC
MGQTYRDGTACAEVSRACDFLLSRQMADGGWGEDFESCEERRYVQSAQSQIHNTCWAMMG
LMAVRHPDIEAQERGVRCLLEKQLPNGDWPQENIAGVFNKSCAISYTSYRNIFPIWALGR
---------FSQLYTERALAG--HP-------------
>UPI0001CE251D status=active
MTEG-TSLRRRGGPYKTEPATDLSRWRLRSELGRQTWTYVGDTEAPERAQTALEAHSVGL
DTTSYFKDLPKAHTALEGALNGITFYVGLQAEDGHWTGDYGGPLFLLPGLLITCHTARIP
LPAGYREEMVRYLRSVQLPDGGWGLHIEDKSTVFGTALNYVSLRILGVGPDDPDLVRARN
LLHQKGGAVAIPSWGKFWLAILNVYSWEGLNTLFPEMWLFPDWVPAHPSTLWCHCRQVYL
PMSYCYATRLSASEDPLIQSLRQELYVEDYASINWPAQRNNVAPDDLYTPHSWLLHVVYA
VLNLYESYHSTSLRQRAVRKLYAHIDADDRFTKGISIGPISKTINMLVRWFVDGPASPAF
QEHVSRIPDYLWLGLDGMKMQGTNGSQIWDTSFAIQAMLEAGAHHRPEFASCLQKAHEFL
RLSQIPDNPPDYQKYYRQMSKGGFCFSTLDCGWIVADCTAEALKSVLLLQETCPFVTEHV
PRERLCDAVAVLLNMRNPDGGFATYETKRGGHLLELLNPSEVFGDIMIDYTYVECTSAVM
QALRHFHAQFPDHRAQEIRETLQQGLEFCRQKQRPDGSWEGSWGVCFTYGTWFGLEAFAC
MGQRYQDGTACAEVSRACDFLLSRQMADGGWGEDFESCEQRRYVQSAQSQIHNTCWALMG
LMAVRHPDLEAQERGVRCLLDKQLPNGDWPQENISGVFNKSCAISYTSYRNVFPIWTLGR
---------FSRLYPERALAG--RP-------------
>tr|B0S5M5|B0S5M5_DANRE Novel protein similar to vertebrate lanosterol synthase (2,3-oxidosqualene-lanosterol cyclase) (LSS) OS=Danio rerio GN=lss PE=4 SV=1
MTEG-TCLRRRGGPYKTEPATDLSRWRLSNVDGRQSWRYIEETDSLDRPQSMLERHSLGL
DTSEFISASPAAHTAVEAALKGMDFYSRLQAEDGHWAGDYGGPLFLLPGLLITCHIAKIP
LPDAWKKEMVRYLRSVQLPDGGWGLHIEDKSTVFGTALSYTTLRILGVGPDDPDMVRARN
ALHNRGGAVGIPSWGKFWLAILNVYNWEGMNTLFPEMWLLPSWMPAHPSTLWCHCRQVYL
PMSYCYAVRLSADEDPLVLSLRQELYVQDYSTIDWPAQRNNVAACDLYTPHSNLLTFAYF
FLNVYEAHHSTILREKAVKELYDHIKADDRFTKCISIGPISKTINMLVRWYVDGPTSPAF
QKHVSRIPDYLWLGLDGMKMQGTNGSQLWDTAFAVQAFLEAGAQDIPRFTECLTQAHHFL
DLTQVKDNPPEYEKYYRQMNKGGFPFSTRDCGWIVADCVSEGLKSVMLLQEQCNFLKENI
PKERLFDAVNVLLSMRNPDGGFATYETKRGGKLLELLNPSEVFGDIMIDYTYVECTSAVL
QALKHFHSVYPEHRAEEIRSTLQRGLDYCRRVQRPDGSWEGSWGVCFTYGAWFGLEAFAC
MGHTFQNGSVCEEVKRACEFLLAKQMEDGGWGEDFESCEQRRYVQSSSSQIHNTCWALLG
LMAVRYPGTKVIERGIQLLIDKQLPNGDWPQENISGVFNKSCAISYTSYRNVFPVWTLGR
---------FTRLYPCNALTGKLKL-------------
>sp|P84466|ERG7_BOVIN Lanosterol synthase OS=Bos taurus GN=LSS PE=1 SV=2
MTEG-TCLRRRGGPYKTEPATDLSRWRLSNQVGRQTWTYSQEEDP-VREQSGLEAHLLGL
DTKSFFKDLPKAHTACRGALNGVTFYAALQTEDGHWAGDYGGPLFLLPGLLITCHVANIP
LPAGYREEIIRYLRSVQLPDGGWGLHIEDKSTVFGTALNYVSLRILGVGPDDPDLVRARN
LLHKKGGAVFIPSWGKFWLAVLNVYSWEGLNTLFPEMWLFPDWMPAHPSTIWCHCRQVYL
PMAYCYSTRLSAEEGPLVQSLRQELYLEDYSCIDWAAHRNSVAPDDLYTPHSWLLHVVYA
ILNLYERHHSTSLRQWATQKLYEHIAADDRFTKCISIGPISKTINMLVRWHVDGPASAVF
QEHVSRIPDYLWLGLDGMKMQGTNGSQIWDTAFAIQALLEARAQHRPEFWSCLRKAHEYL
RISQVPDNFPDYQKYYRHMSKGGFSFSTLDCGWIVADCTAEALKSILLLQEKCPFVSNHV
PRERLFDTVAVLLSLRNPDGGFATYETKRGGHLLELLNPSEVFGDIMIDYTYVECTSAVM
QALKTFHKQFPDHRAGEIRETLEQGLQFCRQKQRPDGSWEGSWGVCFTYGAWFGLEAFAC
MGHTYHNGVACAEISRACDFLLSRQMADGGWGEDFESCKQRRYVQSAQSQIHNTCWALMG
LMAVRHPDVAALERGVSYLLEKQLPNGDWPQENISGVFNKSCAISYTSYRNVFPIWTLGR
---------FSRLHPDPALAG--HP-------------
>sp|P48450|ERG7_RAT Lanosterol synthase OS=Rattus norvegicus GN=Lss PE=1 SV=2
MTEG-TCLRRRGGPYKTEPATDLTRWRLHNELGRQRWTYYQAEEDPGREQTGLEAHSLGL
DTTSYFKNLPKAQTAHEGALNGVTFYAKLQAEDGHWAGDYGGPLFLLPGLLITCHIAHIP
LPAGYREEMVRYLRSVQLPDGGWGLHIEDKSTVFGTALSYVSLRILGIGPDDPDLVRARN
ILHKKGGAVAIPSWGKFWLAVLNVYSWEGINTLFPEMWLLPEWFPAHPSTLWCHCRQVYL
PMSYCYATRLSASEDPLVQSLRQELYVEDYASIDWPAQKNNVCPDDMYTPHSWLLHVVYG
LLNLYERFHSTSLRKWAIQLLYEHVAADDRFTKCISIGPISKTVNMLIRWSVDGPSSPAF
QEHVSRIKDYLWLGLDGMKMQGTNGSQTWDTSFAVQALLEAGAHRRPEFLPCLQKAHEFL
RLSQVPDNNPDYQKYYRHMHKGGFPFSTLDCGWIVADCTAEALKAVLLLQERCPSITEHV
PRERLYDAVAVLLSMRNSDGGFATYETKRGGYLLELLNPSEVFGDIMIDYTYVECTSAVM
QALRHFREYFPDHRATESRETLNQGLDFCRKKQRADGSWEGSWGVCFTYGTWFGLEAFAC
MGHIYQNRTACAEVAQACHFLLSRQMADGGWGEDFESCEQRRYVQSAGSQVHSTCWALLG
LMAVRHPDISAQERGIRCLLGKQLPNGDWPQENISGVFNKSCAISYTNYRNIFPIWALGR
---------FSSLYPDNTLAG--HI-------------
>sp|Q8BLN5|ERG7_MOUSE Lanosterol synthase OS=Mus musculus GN=Lss PE=2 SV=2
MTEG-TCLRRRGGPYKTEPATDLTRWRLQNELGRQRWTYYQAEDDPGREQTGLEAHSLGL
DTRSYFTDLPKAQTAHEGALNGVTFYAKLQAEDGHWAGDYGGPLFLLPGLLITCHISHIS
LPAGYREEMVRYLRSVQLPDGGWGLHIEDKSTVFGTALNYVALRILGIGPDDPDLVRARN
VLHKKGGAVAIPSWGKFWLAVLNVYSWEGLNTLFPEMWLFPEWVPAHPSTLWCHCRQVYL
PMSYCYATRLSASEDPLVQSLRQELYVQDYASIDWPAQRNNVSPDEMYTPHSWLLHVVYG
LLNLYERFHSTSLRKWAVQMLYEHIAADDCFTKCISIGPISKTINMLVRWSVDGPSSPAF
QEHVSRIKDYLWLGLDGMKMQGTNGSQIWDTSFAIQALLEAGAHHRPEFLPCLQKAHEFL
RLSQVPENCPDYQKYYRHMRKGGFSFSTLDCGWIVADCTAEGLKAVLLLQNQCPSITEHI
PRERLCDAVDVLLSLRNADGGFATYEKKRGGYLLELLNPSEVFGDIMIDYTYVECTSAVM
QALKHFHEHFPDYRAAEVRETLNQGLDFCRRKQRADGSWEGSWGVCFTYGTWFGLEAFAC
MGHTYQDGAACAEVAQACNFLLSQQMADGGWGEDFESCEQRRYVQSARSQVHSTCWALMG
LMAVRHPDITAQERGIRCLLGKQLPNGDWPQENISGVFNKSCAISYTSYRNIFPIWALGR
---------FSNLYPDNTLAG--HI-------------
>tr|Q0IHW7|Q0IHW7_XENTR Lanosterol synthase (2,3-oxidosqualene-lanosterol cyclase) OS=Xenopus tropicalis GN=lss PE=2 SV=1
MS-GETCLRRRGGPYKTAPATDLTHWRLSCTEGRQTWCYVEEED---RKQTVLEAHSLGL
ETSDLLKDLPPPQTAYDGAYNGITFYSALQAEDGHWAGDYGGPLFLLPGLLIACHVTKTS
LPDATKKEMIRYLRSVQLPDGGWGLHIEDKSTVFGTALSYTSLRLLGVSQDDLDLTRARN
NLLSKGGAVGIPSWGKFWLAVLNVYSWEGMNTLFPEMWLLPHWFPAHPSTLWCHCRQVYL
PMSYCYATRLSAHEDDLIRSLRQELYLEDYSSINWPAQRNNVASCDIYTPHSTLLHIAYA
FLNVYESYHIPALRRRAVHELYDHIAADDRFTKCISIGPISKVINMLVRWHVDGSESSVF
REHVDRIPDYLWLGLDGMKMQGTNGSQLWDTAFAVQAYLEAGAHRRKEFQNCLEKAHEFL
RISQIPDNPPDYKKYYRQMNKGGFPFSTRDCGWIVADCTAEGLKSVMLLQEQCPFLTDLV
PPERLRDAVDVLLSMRNSDRGFATYETKRGGLLLELLNPSEVFGDIMIDYTYVECTSAVM
QALKHFQARDPNYRAQEIRETLQKGLDYCCSVQRQDGSWEGSWGVCFTYGIWFGLEAFAC
MGHTYKEG--CPEIIRACNFLLSHQMEDGGWGEDFESCEQRRYVQSAGSQIHNTCWALMG
LMAVGFPDVTVLERGVRLLLDKQLSNGDWPQENISGVFNKSCAISYTSYRNVFPIWTLGR
---------FFHLHPESSLAGLLKN-------------
>tr|F1NED1|F1NED1_CHICK Uncharacterized protein OS=Gallus gallus GN=LSS PE=4 SV=1
MR----------------------------------------------------------
------------------------FYATLQAEDGHWAGDYGGPLFLLPGLLIVCHTARIP
LPDGFRREMVRYLRSVQLPDGGWGLHVEDKSTVFGTALNYVALRILGLGPDDPDIVRARV
NLHSKGGAVGIPSWGKFWLAVLNVYSWEGMNTLLPEMWLLPTWFPAHPSRLWCHCRQVYL
PMSYCYAKRLSAEEDELIRSLQQELYVQDYASIDWPAQRNNVAACDVYTPHSWLLGIAYA
IMNVYEAHHSTYLRQRAITELYDHIKADDRFTKCISIGPISKTINMLVRWFVDGENSPAF
QEHVSRIPDYLWLGLDGMKMQGTNGSQLWDTAFAVQAFLEAEAQKIPEFMSCLQNAHEFL
RFTQIPENPPDYQKYYRHMNKGGFPFSTRDCGWIVADCTAEGLKSIMLLQEKCPFIANPV
PAERLFDAVNVLLSMKNSDGGFATYETKRGGHLLELLNPSEVFGDIMIDYTYVECTSAVM
QALRHFQDVYPEHRAPEIRETLQKGLDFCRKKQQADGSWEGSWGVCFTYGTWFGLEAFAS
MQHVYRDGVACREVARACQFLLSKQMTDGGWGEDFESCEQRTYVQSSTSQIHNTCWALLG
LMAVRYPDTGVLERGIKLLIDKQLPNGDWPQENVAGVFNKSCAISYTAYRNVFPIWTLGR
LGCIPTALLLSTCNPDPWL-GWGRPQRQHCLIEPMLWL