fig|1040638.4.peg.4212
Escherichia coli O104:H4 str. LB226692
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585055.6.peg.1629
Escherichia coli 55989
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585055.8.peg.1632
Escherichia coli 55989
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|550672.3.peg.2335
Escherichia coli B088
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|340184.3.peg.97
Escherichia coli B7A
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|340184.6.peg.100
Escherichia coli B7A
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|595495.4.peg.4477
Escherichia coli KO11
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585395.4.peg.1675
Escherichia coli O103:H2 str. 12009
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|573235.3.peg.2101
Escherichia coli O26:H11 str. 11368
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|566546.3.peg.4544
Escherichia coli W
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|566546.4.peg.1586
Escherichia coli W
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|562.375.peg.3796
Escherichia coli EC4100B
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVILTLAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|358709.5.peg.959
Escherichia coli 101-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|670888.3.peg.2136
Escherichia coli 1827-70
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|481805.3.peg.2350
Escherichia coli ATCC 8739
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|481805.6.peg.2341
Escherichia coli ATCC 8739
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|413997.3.peg.1500
Escherichia coli B str. REL606
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|511693.5.peg.1539
Escherichia coli BL21
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|469008.4.peg.2235
Escherichia coli BL21(DE3)
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|595496.3.peg.1423
Escherichia coli BW2952
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|536056.3.peg.2289
Escherichia coli DH1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|656414.3.peg.1756
Escherichia coli H736
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|83333.1.peg.1454
Escherichia coli K12
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749540.3.peg.707
Escherichia coli MS 146-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749547.3.peg.1800
Escherichia coli MS 187-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749533.3.peg.5054
Escherichia coli MS 84-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|316407.3.peg.1427
Escherichia coli W3110
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|316385.5.peg.1581
Escherichia coli str. K-12 substr. DH10B
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|316385.7.peg.1621
Escherichia coli str. K-12 substr. DH10B
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|511145.12.peg.1534
Escherichia coli str. K-12 substr. MG1655
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|511145.6.peg.1520
Escherichia coli str. K-12 substr. MG1655
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|679207.4.peg.4687
Escherichia coli MS 107-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNG
A
IGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
S
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|679206.4.peg.4222
Escherichia coli MS 119-7
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNG
A
IGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
S
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|679204.3.peg.3255
Escherichia coli MS 145-7
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNG
A
IGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
S
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|656443.3.peg.1808
Escherichia coli TA271
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNG
A
IGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
S
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|340185.3.peg.2588
Escherichia coli E22
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
P
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|340185.4.peg.2731
Escherichia coli E22
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
P
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585396.4.peg.1935
Escherichia coli O111:H- str. 11128
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKL
D
NGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|409438.11.peg.1698
Escherichia coli SE11
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
E
G
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585034.4.peg.1456
Escherichia coli IAI1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELS
L
PCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585034.5.peg.1452
Escherichia coli IAI1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELS
L
PCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|340186.3.peg.174
Escherichia coli E110019
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
A
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|340186.5.peg.186
Escherichia coli E110019
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
A
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|316401.4.peg.1766
Escherichia coli ETEC H10407
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
E
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQGK
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749545.3.peg.3740
Escherichia coli MS 182-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
S
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVT
S
MRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749532.3.peg.203
Escherichia coli MS 78-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
S
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVT
S
MRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749548.3.peg.200
Escherichia coli MS 196-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
H
ALKSG
D
LR
M
ACEQPD
S
S
SNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|331111.12.peg.1932
Escherichia coli E24377A (1-1245/1256)
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
S
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVA
A
KAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAK
fig|331111.3.peg.4092
Escherichia coli E24377A (1-1245/1256)
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
S
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVA
A
KAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAK
fig|749537.3.peg.2509
Escherichia coli MS 115-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFG
S
IENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMN
N
PGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|344610.3.peg.3567
Escherichia coli 53638
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
I
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
E
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQGK
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|344610.7.peg.5142
Escherichia coli 53638
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
I
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
E
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQGK
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|344601.3.peg.239
Escherichia coli B171
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLA
R
ASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
N
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|344601.5.peg.237
Escherichia coli B171
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLA
R
ASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
N
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|562.376.peg.3035
Escherichia coli WV_060327
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EAD
N
AGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
M
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLA
Q
HKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|550676.3.peg.1805
Escherichia coli B185
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYF
A
GIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|670897.3.peg.4873
Escherichia coli 2362-75
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLS
T
AVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|439855.10.peg.1864
Escherichia coli SMS-3-5
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
L
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|656419.3.peg.2138
Escherichia coli M718
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSR
A
GPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585057.4.peg.1805
Escherichia coli IAI39
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAP
E
QGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
I
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
L
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGT
I
GSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585057.6.peg.1804
Escherichia coli IAI39
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAP
E
QGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
I
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
L
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGT
I
GSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|656393.3.peg.2220
Escherichia coli H299
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
I
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLG
C
NPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|550677.3.peg.2883
Escherichia coli B354
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLV
I
LEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
D
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
I
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|362663.8.peg.1482
Escherichia coli 536
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|362663.9.peg.1487
Escherichia coli 536
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|340197.3.peg.2972
Escherichia coli F11
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|340197.5.peg.3105
Escherichia coli F11
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|216593.1.peg.281
Escherichia coli E2348/69 (42-1287/1287)
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
Q
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGS
S
RDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|574521.7.peg.1640
Escherichia coli O127:H6 str. E2348/69
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
Q
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGS
S
RDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|656437.3.peg.1684
Escherichia coli TA143
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
N
N
PVLVRQLP
I
KNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|216592.1.peg.2007
Escherichia coli 042 (42-1287/1287)
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PV
Q
VRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QAL
N
SG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
A
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|216592.3.peg.1656
Escherichia coli 042
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PV
Q
VRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QAL
N
SG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
A
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585397.7.peg.1634
Escherichia coli ED1a
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
L
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQ
V
YQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585397.9.peg.1627
Escherichia coli ED1a
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
L
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQ
V
YQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|331112.3.peg.1455
Escherichia coli HS
MSK
L
LDRFRYFKQKG
A
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRG
C
GGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PN
D
FPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDE
D
RDQVQEAKK
fig|331112.6.peg.1515
Escherichia coli HS
MSK
L
LDRFRYFKQKG
A
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRG
C
GGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTL
V
DG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PN
D
FPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDE
D
RDQVQEAKK
fig|749527.3.peg.3732
Escherichia coli MS 21-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVT
R
EIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
L
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749550.3.peg.672
Escherichia coli MS 200-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWG
K
KGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|431946.3.peg.1438
Escherichia coli SE15
MSK
L
LDRFRYFKQKGE
I
FADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFA
F
DWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749531.3.peg.4237
Escherichia coli MS 69-1
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAG
A
AFPYFGGIENPHFR
S
VK
N
N
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|655817.3.peg.1788
Escherichia coli ABU 83972
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRT
H
PDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSA
V
VDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|199310.1.peg.1841
Escherichia coli CFT073 (42-1287/1287)
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRT
H
PDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSA
V
VDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|199310.4.peg.1770
Escherichia coli CFT073
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRT
H
PDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSA
V
VDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749528.3.peg.1218
Escherichia coli MS 45-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRT
H
PDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSA
V
VDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749546.3.peg.776
Escherichia coli MS 185-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRT
H
PDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHK
A
HGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSA
V
VDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|656417.3.peg.1729
Escherichia coli M605
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLS
L
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEK
V
LNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|656379.3.peg.3505
Escherichia coli FVEC1302
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLV
N
GLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
N
N
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QAL
N
SG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
A
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|656380.3.peg.2858
Escherichia coli FVEC1412
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLV
N
GLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
N
N
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QAL
N
SG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
A
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|749549.3.peg.894
Escherichia coli MS 198-1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLV
N
GLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
N
N
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QAL
N
SG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
A
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585056.7.peg.1910
Escherichia coli UMN026
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLV
N
GLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
N
N
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QAL
N
SG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
Q
E
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQ
A
CVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKREGPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|701177.3.peg.1819
Escherichia coli O55:H7 str. CB9615
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|562.371.peg.1740
Escherichia coli 1044A
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|562.373.peg.5086
Escherichia coli 1125A
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|562.372.peg.1224
Escherichia coli 1212A
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|562.374.peg.2359
Escherichia coli 536A
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|83334.1.peg.2101
Escherichia coli O157:H7
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|155864.1.peg.1996
Escherichia coli O157:H7 EDL933
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|155864.8.peg.1821
Escherichia coli O157:H7 EDL933
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|444454.5.peg.979
Escherichia coli O157:H7 str. EC4024
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|444449.5.peg.305
Escherichia coli O157:H7 str. EC4042
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|444448.5.peg.4661
Escherichia coli O157:H7 str. EC4045
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|444453.5.peg.2859
Escherichia coli O157:H7 str. EC4076
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|444452.5.peg.1959
Escherichia coli O157:H7 str. EC4113
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|444450.8.peg.2123
Escherichia coli O157:H7 str. EC4115
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|444451.5.peg.1950
Escherichia coli O157:H7 str. EC4196
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|444447.5.peg.5573
Escherichia coli O157:H7 str. EC4206
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|478004.5.peg.2848
Escherichia coli O157:H7 str. EC4401
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|478005.5.peg.2974
Escherichia coli O157:H7 str. EC4486
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|478006.5.peg.1941
Escherichia coli O157:H7 str. EC4501
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|478007.5.peg.2145
Escherichia coli O157:H7 str. EC508
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|386585.9.peg.2176
Escherichia coli O157:H7 str. Sakai
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|544404.4.peg.1985
Escherichia coli O157:H7 str. TW14359
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|502346.5.peg.5295
Escherichia coli O157:H7 str. TW14588
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|525281.3.peg.1049
Escherichia coli 83972
M
DRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRT
H
PDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
R
VK
HN
PVLVRQLPVKNLTLADG
N
TCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSA
V
VDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|478008.5.peg.3666
Escherichia coli O157:H7 str. EC869
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
H
H
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|637388.3.peg.1650
Escherichia coli O157:H7 str. FRIK2000
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
H
H
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|570506.3.peg.3021
Escherichia coli O157:H7 str. FRIK966
MSK
L
LDRFRYFKQKG
D
TFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNW
P
ELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
I
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
K
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
H
H
PVLVRQLPVKNLTLA
G
GSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKV
S
AQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
G
IK
A
EADKAGLSP
T
EFT
A
QALKSG
D
LR
M
ACEQPD
S
GSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGTESGIQG
EE
LG
ASD
GIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDL
S
PGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRICPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|753642.3.peg.1683
Escherichia coli NC101
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|685038.3.peg.1468
Escherichia coli O83:H1 str. NRG 857C
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
C
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPA
Q
GRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVW
I
SETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|405955.13.peg.1593
Escherichia coli APEC O1
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPAKGRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|405955.9.peg.1297
Escherichia coli APEC O1 (42-1287/1287)
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPAKGRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|714962.3.peg.1660
Escherichia coli IHE3034
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPAKGRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|585035.6.peg.1552
Escherichia coli S88
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPAKGRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|869729.3.peg.2056
Escherichia coli UM146
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPAKGRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|364106.7.peg.1729
Escherichia coli UTI89
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPAKGRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|364106.8.peg.1731
Escherichia coli UTI89
MSK
L
LDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDP
Q
K
S
LSYKQVRGRGGFIRSNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASP
M
TWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHLDNPS
D
YFINYCRRYSDMPMLVMLEPRDDGSYVPGRM
V
RASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKGKWNLESI
--
AAG
T
ETELSLTLLGQHDAVAGVAFPYFGGIENPHFR
S
VK
HN
PVLVRQLPVKNLTLADGSTCPVVSVYDL
V
LANYGLDRGL
E
DENSAKDYAEIKPYTPAWGEQITGVPRQYIE
T
IAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVTAQELLSPLADASKYSGHLIDFNVRAERMGWLPSAPQLGRNPL
SL
K
A
EADKAGLSP
A
EFTVQALKSG
E
LR
M
ACEQPDNGSNHPRNLFVWRSNLLGSSGKGHEYMQKYLLGT
K
SGIQG
EE
LG
PTE
GI
Q
PEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVC
I
GHLGKETD
V
VLQPLLHDSPAELSQPCEVLDWRKGECDLIPGKTAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKR
D
GPAKGRPLIDTAIDASEVIL
A
LAPETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSGRQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEM
R
QI
P
PNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWLSETDA
R
ELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTR
V
CPKPTHMIGGYAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
fig|679205.4.peg.3294
Escherichia coli MS 124-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
A
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749533.3.peg.4566
Escherichia coli MS 84-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
A
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|362663.8.peg.1285
Escherichia coli 536
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|362663.9.peg.1285
Escherichia coli 536
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|405955.13.peg.1275
Escherichia coli APEC O1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|405955.9.peg.1058
Escherichia coli APEC O1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|199310.1.peg.1633
Escherichia coli CFT073
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|199310.4.peg.1566
Escherichia coli CFT073
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585397.7.peg.1384
Escherichia coli ED1a
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585397.9.peg.1377
Escherichia coli ED1a
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|340197.3.peg.1054
Escherichia coli F11
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|340197.5.peg.1111
Escherichia coli F11
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|714962.3.peg.1404
Escherichia coli IHE3034
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|656417.3.peg.1527
Escherichia coli M605
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749550.3.peg.4511
Escherichia coli MS 200-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|753642.3.peg.1895
Escherichia coli NC101
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|685038.3.peg.1259
Escherichia coli O83:H1 str. NRG 857C
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|431946.3.peg.1246
Escherichia coli SE15
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|656440.3.peg.1134
Escherichia coli TA206
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|869729.3.peg.2335
Escherichia coli UM146
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|364106.7.peg.1476
Escherichia coli UTI89
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|364106.8.peg.1474
Escherichia coli UTI89
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|550677.3.peg.2683
Escherichia coli B354
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IA
R
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
S
Q
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|656437.3.peg.1370
Escherichia coli TA143
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|344601.3.peg.47
Escherichia coli B171
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|344601.5.peg.44
Escherichia coli B171
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585057.4.peg.1634
Escherichia coli IAI39
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585057.6.peg.1634
Escherichia coli IAI39
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|595495.4.peg.1164
Escherichia coli KO11
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585395.4.peg.1398
Escherichia coli O103:H2 str. 12009
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585396.4.peg.1617
Escherichia coli O111:H- str. 11128
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|573235.3.peg.1769
Escherichia coli O26:H11 str. 11368
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|566546.3.peg.278
Escherichia coli W
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|566546.4.peg.1315
Escherichia coli W
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749531.3.peg.4137
Escherichia coli MS 69-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|656379.3.peg.3318
Escherichia coli FVEC1302
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
I
L
LHK
LPVK
R
L
Q
LADG
R
T
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
D
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
K
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|656380.3.peg.2161
Escherichia coli FVEC1412
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
I
L
LHK
LPVK
R
L
Q
LADG
R
T
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
D
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
K
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749549.3.peg.3610
Escherichia coli MS 198-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
I
L
LHK
LPVK
R
L
Q
LADG
R
T
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
D
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
K
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585056.7.peg.1722
Escherichia coli UMN026
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
I
L
LHK
LPVK
R
L
Q
LADG
R
T
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
D
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
K
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|1040638.4.peg.4544
Escherichia coli O104:H4 str. LB226692
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|6666666.5357.peg.1028
Escherichia coli TY-2482
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|358709.5.peg.1557
Escherichia coli 101-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|562.371.peg.1116
Escherichia coli 1044A
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|562.373.peg.2818
Escherichia coli 1125A
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|562.372.peg.5118
Escherichia coli 1212A
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|670888.3.peg.1908
Escherichia coli 1827-70
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|562.374.peg.5820
Escherichia coli 536A
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585055.6.peg.1355
Escherichia coli 55989
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585055.8.peg.1354
Escherichia coli 55989
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|481805.3.peg.2572
Escherichia coli ATCC 8739
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|481805.6.peg.2561
Escherichia coli ATCC 8739
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|413997.3.peg.1268
Escherichia coli B str. REL606
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|550672.3.peg.663
Escherichia coli B088
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|469008.4.peg.2467
Escherichia coli BL21(DE3)
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|331111.12.peg.1664
Escherichia coli E24377A
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|331111.3.peg.3837
Escherichia coli E24377A
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|656414.3.peg.1461
Escherichia coli H736
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|331112.3.peg.1248
Escherichia coli HS
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|331112.6.peg.1301
Escherichia coli HS
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585034.4.peg.1239
Escherichia coli IAI1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585034.5.peg.1235
Escherichia coli IAI1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|679207.4.peg.1931
Escherichia coli MS 107-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749540.3.peg.3368
Escherichia coli MS 146-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749545.3.peg.4375
Escherichia coli MS 182-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749532.3.peg.1875
Escherichia coli MS 78-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|83334.1.peg.1774
Escherichia coli O157:H7
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|155864.1.peg.1780
Escherichia coli O157:H7 EDL933
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|444449.5.peg.5045
Escherichia coli O157:H7 str. EC4042
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|444448.5.peg.4308
Escherichia coli O157:H7 str. EC4045
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|444453.5.peg.3925
Escherichia coli O157:H7 str. EC4076
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|444452.5.peg.3700
Escherichia coli O157:H7 str. EC4113
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|444450.8.peg.1770
Escherichia coli O157:H7 str. EC4115
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|444451.5.peg.4359
Escherichia coli O157:H7 str. EC4196
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|444447.5.peg.4557
Escherichia coli O157:H7 str. EC4206
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|478004.5.peg.4751
Escherichia coli O157:H7 str. EC4401
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|478005.5.peg.4786
Escherichia coli O157:H7 str. EC4486
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|478006.5.peg.4678
Escherichia coli O157:H7 str. EC4501
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|478007.5.peg.4146
Escherichia coli O157:H7 str. EC508
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|478008.5.peg.2943
Escherichia coli O157:H7 str. EC869
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|637388.3.peg.2072
Escherichia coli O157:H7 str. FRIK2000
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|570506.3.peg.3067
Escherichia coli O157:H7 str. FRIK966
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|386585.9.peg.1830
Escherichia coli O157:H7 str. Sakai
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|544404.4.peg.1631
Escherichia coli O157:H7 str. TW14359
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|502346.5.peg.2561
Escherichia coli O157:H7 str. TW14588
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|701177.3.peg.1486
Escherichia coli O55:H7 str. CB9615
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|409438.11.peg.1414
Escherichia coli SE11
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|585035.6.peg.1293
Escherichia coli S88
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VC
A
GHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|656419.3.peg.1566
Escherichia coli M718
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
A
D
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|525281.3.peg.203
Escherichia coli 83972
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
K
FHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|655817.3.peg.1576
Escherichia coli ABU 83972
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
K
FHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|562.376.peg.3239
Escherichia coli WV_060327
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEK
V
LNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749528.3.peg.219
Escherichia coli MS 45-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPR
A
RPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|550676.3.peg.709
Escherichia coli B185
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
T
N
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
D
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQR
I
P
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|656393.3.peg.1990
Escherichia coli H299
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
D
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749527.3.peg.4102
Escherichia coli MS 21-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
A
S
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IA
R
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|679206.4.peg.774
Escherichia coli MS 119-7
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTL
T
GRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|340185.3.peg.3191
Escherichia coli E22
MSKFLDRFRYFKQ
S
GETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|340185.4.peg.3356
Escherichia coli E22
MSKFLDRFRYFKQ
S
GETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|670897.3.peg.2154
Escherichia coli 2362-75
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IA
R
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWL
G
E
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|216593.1.peg.1788
Escherichia coli E2348/69
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IA
R
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWL
G
E
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|574521.7.peg.1381
Escherichia coli O127:H6 str. E2348/69
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IA
R
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWL
G
E
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|439855.10.peg.2072
Escherichia coli SMS-3-5
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
A
S
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
N
D
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|340184.3.peg.1744
Escherichia coli B7A
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
N
D
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|340184.6.peg.1832
Escherichia coli B7A
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
N
D
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|679204.3.peg.1464
Escherichia coli MS 145-7
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
N
D
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|216592.1.peg.1658
Escherichia coli 042
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
T
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMG
Y
V
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IA
R
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
S
Q
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|216592.3.peg.1331
Escherichia coli 042
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
T
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSLLGGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMG
Y
V
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IA
R
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
S
Q
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749546.3.peg.900
Escherichia coli MS 185-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EF
P
LDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
M
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAKEL
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|155864.8.peg.1584
Escherichia coli O157:H7 EDL933
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKAL
X
FLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|340186.3.peg.3188
Escherichia coli E110019
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
N
Q
LMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|340186.5.peg.3322
Escherichia coli E110019
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
R
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
N
Q
LMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|344610.3.peg.4256
Escherichia coli 53638
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|344610.7.peg.4898
Escherichia coli 53638
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|595496.3.peg.1157
Escherichia coli BW2952
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|83333.1.peg.1213
Escherichia coli K12
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|316407.3.peg.1188
Escherichia coli W3110
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|316385.5.peg.1283
Escherichia coli str. K-12 substr. DH10B
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|316385.7.peg.1309
Escherichia coli str. K-12 substr. DH10B
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|511145.12.peg.1276
Escherichia coli str. K-12 substr. MG1655
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|511145.6.peg.1265
Escherichia coli str. K-12 substr. MG1655
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749547.3.peg.103
Escherichia coli MS 187-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
R
C
Y
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
PEKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|316401.4.peg.1539
Escherichia coli ETEC H10407
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LIAA
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
I
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749538.3.peg.4982
Escherichia coli MS 116-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LI
V
A
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749548.3.peg.3848
Escherichia coli MS 196-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LI
V
A
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AEL
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
fig|749544.3.peg.1687
Escherichia coli MS 175-1
MSKFLDRFRYFKQKGETFADGHGQ
LLNT
NRDWED
G
YRQRWQ
H
DKIVRSTHGVNCTGSCSWKIYVKNGLVTWE
T
QQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYP
MM
RKRL
MKM
WREA
KAL
HSDPV
E
AWASI
IE
D
A
DK
A
K
S
F
KQ
A
RGRGGF
V
RS
S
WQE
V
N
E
LI
V
A
S
NV
Y
TIK
N
YGPDRVAGFSPIPAMSMVSYA
S
G
A
RYLSL
I
GGTCLSFYDWYCDLPPASPQTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKT
V
A
V
TPDY
A
E
I
AKLCD
L
WLAPKQGTD
A
A
M
A
L
AMGHV
M
L
R
EFHLDNPSQYF
TD
Y
V
RRY
T
DMPMLVMLE
E
R
-
DG
Y
Y
AA
GRMLRA
A
DLVD
A
LG
QE
NNP
E
WKTVA
F
NT
N
GE
M
V
A
PNGSIGFRWGEKGKWNLE
QRDGKT
G
E
ETEL
Q
L
S
LLG
SQ
D
EI
A
E
V
G
FPYFGG
DGTE
HF
N
KV
E
LE
N
VL
LHK
LPVK
R
L
Q
LADGST
AL
V
TT
VYDLTLANYGL
E
RGLND
V
N
C
A
TS
Y
DDV
K
A
YTPAW
A
EQITGV
S
R
SQ
I
I
RIAREFAD
N
A
D
KTHGRSMII
V
GAG
L
NHWYH
L
DMNYRG
L
INMLIFCGCVGQSGGGWAHYVGQEKLRPQTGW
Q
PLAFALDW
Q
RP
A
R
H
MNSTS
Y
FYNHSSQWRYE
T
VTA
E
ELLSP
M
AD
K
S
R
Y
T
GHLIDFNVRAERMGWLPSAPQLG
T
NPLTI
A
G
EA
E
KAG
MN
PV
DY
TV
KS
LK
E
GS
I
RFA
A
EQP
E
NG
K
NHPRNLF
I
WRSNLLGSSGKGHE
F
M
L
KYLLGTE
H
GIQGKDLGQQGG
V
KPEEV
D
WQ
DNGL
EGKLDL
V
VTLDFR
L
SSTCL
Y
SDI
I
LPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWE
AK
SDWEIYK
A
IAK
K
FS
E
VCVGHLGKETDIV
TL
P
IQ
HDS
A
AE
R
A
QP
LD
V
K
DW
K
KGECDLIPGKTAP
H
I
MV
VERDYPATYERFTS
I
GPLM
E
K
I
GNGGKGI
A
WNTQ
S
E
M
D
L
L
R
KLNYTK
A
EGPAKG
Q
P
MLN
TAIDA
A
E
M
ILTLAPETNG
Q
VAVKAW
A
AL
S
E
F
TGR
D
HTHLAL
N
KEDEKIRFRDIQAQPRKIISSPTWSGLE
DE
HVSYNAGYTNVHELIPWRTLSGRQQLYQDH
Q
WMR
D
FGESL
LV
YRPPIDTRSV
K
E
V
I
GQ
K
S
NG
N
Q
EKALNFLTPHQKWGIHSTYS
D
NLLMLTL
G
RGGP
V
VWLSE
A
DAK
D
L
G
I
A
DNDW
I
EVFN
S
NGALTARAVVSQRVP
A
GMTMMYHAQERI
V
N
L
PGSE
I
T
QQ
RGGIHNSVTRI
T
PKPTHMIGGYA
H
LA
Y
GFNYYGTVGSNRDEF
VVV
RKMKN
ID
WLD
G
EG
N
DQVQE
SV
K
Consen1
Primary consensus
MSKfLDRFRYFKQKGeTFADGHGQvmhsNRDWEDsYRQRWQfDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEiQQTDYPRTRPDLPNHEPRGCPRGASYSWYLYSANRLKYPliRKRLielWREAlkqHSDPVlAWASImnDpdK
lSyKQvRGRGGFiRSnWqElNqLIAAaNVwTIKtYGPDRVAGFSPIPAMSMVSYAaGtRYLSLlGGTCLSFYDWYCDLPPASPqTWGEQTDVPESADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTiAiTPDYsEvAKLCDqWLAPKQGTDsAlAmAMGHViLkEFHLDNPSqYFinYcRRYsDMPMLVMLEpRdDGsYvpGRMlRAsDLVDgLGesNNPqWKTVAvNTaGElVvPNGSIGFRWGEKGKWNLEsi
--
aaG
ETELsLtLLGqhDavAgVaFPYFGGienpHFrkVklepVLvrqLPVKnLtLadGsTcpVvsVYDLtLANYGLdRGLnDeNsAkdYaeiKpYTPAWgEQITGVpRqyIerIAREFADtAhKTHGRSMIIlGAGvNHWYHmDMNYRGmINMLIFCGCVGQSGGGWAHYVGQEKLRPQTGWlPLAFALDWnRPpRqMNSTSfFYNHSSQWRYEkVtAqELLSPlADaSkYsGHLIDFNVRAERMGWLPSAPQLGrNPLtIk
EAdKAGlsPvefTvqaLKsGslRfAcEQPdnGsNHPRNLFvWRSNLLGSSGKGHEyMqKYLLGTEsGIQGkdLGqqgGiKPEEVeWQtaaiEGKLDLlVTLDFRmSSTCLfSDIvLPTATWYEKDDMNTSDMHPFIHPLSAAVDPAWEsrSDWEIYKgIAKaFSqVCVGHLGKETDiVlqPllHDSpAELsQPceVlDWrKGECDLiPGKTAPnIvaVERDYPATYERFTSlGPLMdKlGNGGKGIsWNTQdEiDfLgKLNYTKreGPAkGrPlidTAIDAsEvILtLAPETNGhVAVKAWqALgEiTGReHTHLALhKEDEKIRFRDIQAQPRKIISSPTWSGLEsdHVSYNAGYTNVHELIPWRTLSGRQQLYQDHpWMRaFGESLvaYRPPIDTRSVsEm
qikpNGfPEKALNFLTPHQKWGIHSTYSeNLLMLTLsRGGPiVWlSEtDAkeLtIvDNDWvEVFNaNGALTARAVVSQRVPpGMTMMYHAQERImNiPGSEvTgmRGGIHNSVTRicPKPTHMIGGYAqLAwGFNYYGTVGSNRDEFimiRKMKNvnWLDdEGrDQVQEakK
Consen2
Secondary consensus
l
d
llnt
g
h
t
mm
mkm
kal
e
ie
aq
k
f
a
v
s
p
v
e
s
y
n
s
a
i
m
v
v
a
i
l
a
m
l
m
r
d
td
v
t
e
-
y
aa
a
a
qe
e
f
n
m
a
qrdgkt
q
s
sq
ei
e
g
dgte
ns
ehnn
lhk
r
q
vg
n
al
tt
v
e
e
v
c
ts
ddv
a
a
s
sq
it
n
d
v
l
l
l
q
q
a
h
y
t
s
e
m
k
r
t
t
g
a
e
mn
tdy
aks
e
di
m
a
es
k
i
f
l
h
ee
asd
v
d
dngl
v
l
y
i
ak
a
k
e
v
tl
iq
a
a
ld
k
k
s
h
mv
i
e
i
a
s
m
l
r
ad
q
q
mln
a
m
a
q
a
s
f
d
n
de
q
d
lv
k
v
gqps
n
d
g
v
i
a
rd
g
a
i
s
a
v
l
i
qq
vt
h
y
vvv
id
g
n
sv
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character