fig|1040638.4.peg.2282
Escherichia coli O104:H4 str. LB226692
MKRHLNTSYRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
D
NTGGQ
Q
I
AR
GGTA
R
NTTVT
A
NG
L
Q
D
V
MA
GG
S
A
T
DTVI
SA
GGGQ
N
L
R
G
Q
A
YG
T
V
L
-
N
G
GEQW
I
H
A
GG
S
A
S
GT
V
IN
Q
S
GYQ
T
I
K
H
G
G
Q
ATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQV
I
K
E
GG
T
T
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSV
L
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
T
LTLSGKTVNNDTLTIREGDALLQGG
A
LTGNG
R
VEKSGSGTLTVSNTTLTQK
T
VNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
Q
VKNLNGQNGTISLRV
C
PDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDD
X
QDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDS
T
KH
R
VSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
X
NGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYN
S
QATLNVTF
fig|216592.1.peg.2454
Escherichia coli 042 (3-1041/1041)
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
D
NTGGQ
Q
I
AR
GGTA
R
NTTVT
A
NG
L
Q
D
V
MA
GG
S
A
T
DTVI
SA
GGGQ
N
L
R
G
Q
A
YG
T
V
L
-
N
G
GEQW
I
H
A
GG
S
A
S
GT
V
IN
Q
S
GYQ
T
I
K
H
G
G
Q
ATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQV
I
K
E
GG
T
T
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSV
L
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
T
LTLSGKTVNNDTLTIREGDALLQGG
A
LTGNG
R
VEKSGSGTLTVSNTTLTQK
T
VNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
Q
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
GT
GLAT
T
GKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASS
G
NNDFRARGWGWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYN
S
QATLNVTF
fig|585055.6.peg.3393
Escherichia coli 55989
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
D
NTGGQ
Q
I
AR
GGTA
R
NTTVT
A
NG
L
Q
D
V
MA
GG
S
A
T
DTVI
SA
GGGQ
N
L
R
G
Q
A
YG
T
V
L
-
N
G
GEQW
I
H
A
GG
S
A
S
GT
V
IN
Q
S
GYQ
T
I
K
H
G
G
Q
ATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQV
I
K
E
GG
T
T
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSV
L
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
A
LTGNG
R
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASS
G
NNDFRARGWGWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYN
S
QATLNVTF
fig|585055.8.peg.3397
Escherichia coli 55989
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
D
NTGGQ
Q
I
AR
GGTA
R
NTTVT
A
NG
L
Q
D
V
MA
GG
S
A
T
DTVI
SA
GGGQ
N
L
R
G
Q
A
YG
T
V
L
-
N
G
GEQW
I
H
A
GG
S
A
S
GT
V
IN
Q
S
GYQ
T
I
K
H
G
G
Q
ATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQV
I
K
E
GG
T
T
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSV
L
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
A
LTGNG
R
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASS
G
NNDFRARGWGWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYN
S
QATLNVTF
fig|585056.7.peg.3580
Escherichia coli UMN026
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
D
NTGGQ
Q
I
AR
GGTA
R
NTTVT
A
NG
L
Q
D
V
MA
GG
S
A
T
DTVI
SA
GGGQ
N
L
R
G
Q
A
YG
T
V
L
-
N
G
GEQW
I
H
A
GG
S
A
S
GT
V
IN
Q
S
GYQ
T
I
K
H
G
G
Q
ATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
K
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
I
QGN
K
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNS
A
RLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASS
G
NNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSL
V
LQAGLEARVRENITLGVQAGYAHS
IN
GSSAE
S
YN
S
QATLNVTF
fig|216592.3.peg.4683
Escherichia coli 042
M
VASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
D
NTGGQ
Q
I
AR
GGTA
R
NTTVT
A
NG
L
Q
D
V
MA
GG
S
A
T
DTVI
SA
GGGQ
N
L
R
G
Q
A
YG
T
V
L
-
N
G
GEQW
I
H
A
GG
S
A
S
GT
V
IN
Q
S
GYQ
T
I
K
H
G
G
Q
ATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQV
I
K
E
GG
T
T
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSV
L
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
T
LTLSGKTVNNDTLTIREGDALLQGG
A
LTGNG
R
VEKSGSGTLTVSNTTLTQK
T
VNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
Q
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
GT
GLAT
T
GKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASS
G
NNDFRARGWGWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYN
S
QATLNVTF
fig|340197.3.peg.1382
Escherichia coli F11 (3-1042/1042)
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
GG
K
A
A
GTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
TGQ
M
V
G
GTAESTTIN
N
NGRQ
V
I
WS
SG
V
S
RDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
K
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
E
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVN
V
K
NNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSH
S
DMTFGEGTSSRDTLRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|199310.1.peg.1230
Escherichia coli CFT073 (53-1091/1091)
MKRHLNTSYRLVWNHITG
AF
VVASELAR
A
RGKRAGVAVALSLAA
A
TS
L
PALAAD
S
VV
P
AGETVNGGTL
I
NHD
R
Q
F
V
S
GTA
D
GMT
V
STGLELG
A
DS
D
N
NTGGQ
Q
I
AR
GGTA
R
NT
R
VT
A
NG
L
Q
D
V
MA
GG
S
T
SDTVI
S
T
GGGQ
N
L
R
G
K
A
S
G
T
V
L
-
N
G
G
D
QW
I
H
A
GG
R
A
S
GT
V
IN
Q
DGYQ
T
I
K
H
G
G
L
V
TGTIVNTGAEGGPDSEN
VS
TGQ
M
V
G
G
I
AESTTINKNGRQ
V
I
WS
SG
I
ARDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
R
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
E
GKADNVVLENGGRLDVL
S
GHTAT
R
T
L
VDDGGTLDVRNGG
T
AT
A
VSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
T
F
TL
A
GKTVNNDTLTIREGDALLQGG
A
LTGNG
R
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTD
I
IA
H
RGTALKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
A
RTGKFVP
T
T
---
L
Q
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
GT
GLAT
T
GKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHS
IN
GSSAEGYN
S
QATLNVTF
fig|362663.8.peg.328
Escherichia coli 536
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
GG
K
A
A
GTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
TGQ
M
V
G
GTAESTTIN
N
NGRQ
V
I
WS
SG
V
S
RDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
K
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
E
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVN
V
K
NNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSH
S
DMTFGEGTSSRDTLRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|362663.9.peg.327
Escherichia coli 536
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
GG
K
A
A
GTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
TGQ
M
V
G
GTAESTTIN
N
NGRQ
V
I
WS
SG
V
S
RDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
K
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
E
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVN
V
K
NNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSH
S
DMTFGEGTSSRDTLRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|340197.5.peg.1463
Escherichia coli F11
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
GG
K
A
A
GTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
TGQ
M
V
G
GTAESTTIN
N
NGRQ
V
I
WS
SG
V
S
RDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
K
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
E
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVN
V
K
NNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSH
S
DMTFGEGTSSRDTLRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|655817.3.peg.3471
Escherichia coli ABU 83972
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
C
ETVNGGTL
V
NHDNQ
F
V
S
GTA
D
G
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
GG
K
A
A
GTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
NG
G
T
A
T
E
T
L
IN
R
D
GWQV
I
K
E
GG
T
A
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLD
I
RNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
T
LTLSGKTVNNDTLTIREGDALLQGG
A
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
A
RTGKFVP
T
T
---
L
Q
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
GT
GLAT
T
GKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGI
V
RGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARG
R
GW
Q
GSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDS
T
KH
G
VSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEARVRENITLGVQAGYAHSVSG
N
SAEGYNGQATLNVTF
fig|585397.7.peg.5145
Escherichia coli ED1a
MKRHLNT
C
YRLVWNHITG
AF
VV
V
SELAR
T
RGKR
G
GVAVALSLAAVTS
L
P
V
L
S
AD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTANG
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
E
G
K
ATGTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
TGQ
M
V
G
GTAESTTIN
N
NGRQ
V
I
WS
SG
V
S
RDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
K
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGL
L
TARGGSLAGTTTLNNGA
T
LTLSGKTVNNDTLTIREGDALLQGG
T
LTGNG
R
VEKSGSGTLTVSNTTLTQK
T
VNLNEGTLTLNDSTVTTDVIAQRGT
T
LKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
A
RTGKFVP
T
T
---
L
Q
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
GT
GLAT
T
GKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHS
IN
GSSAEGYN
S
QATLNVTF
fig|585397.9.peg.5146
Escherichia coli ED1a
MKRHLNT
C
YRLVWNHITG
AF
VV
V
SELAR
T
RGKR
G
GVAVALSLAAVTS
L
P
V
L
S
AD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTANG
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
E
G
K
ATGTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
TGQ
M
V
G
GTAESTTIN
N
NGRQ
V
I
WS
SG
V
S
RDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
K
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGL
L
TARGGSLAGTTTLNNGA
T
LTLSGKTVNNDTLTIREGDALLQGG
T
LTGNG
R
VEKSGSGTLTVSNTTLTQK
T
VNLNEGTLTLNDSTVTTDVIAQRGT
T
LKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
A
RTGKFVP
T
T
---
L
Q
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
GT
GLAT
T
GKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHS
IN
GSSAEGYN
S
QATLNVTF
fig|199310.1.peg.3573
Escherichia coli CFT073 (3-1042/1042)
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVT
P
L
P
V
L
S
AD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTANG
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
GG
K
A
A
GTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
NG
G
T
A
T
E
T
L
IN
R
D
GWQV
I
K
E
GG
T
A
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLD
I
RNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
S
D
ATWNIPDNATVQSVVDDLSHAGQIHFTS
S
RTG
T
FVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRT
D
VAGMS
V
T
A
G
I
YGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASS
G
NNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|199310.4.peg.3441
Escherichia coli CFT073
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVT
P
L
P
V
L
S
AD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTANG
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
GG
K
A
A
GTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
NG
G
T
A
T
E
T
L
IN
R
D
GWQV
I
K
E
GG
T
A
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLD
I
RNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
S
D
ATWNIPDNATVQSVVDDLSHAGQIHFTS
S
RTG
T
FVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRT
D
VAGMS
V
T
A
G
I
YGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASS
G
NNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|749531.3.peg.3947
Escherichia coli MS 69-1 (14-1043/1043)
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVT
P
L
P
V
L
S
AD
I
VV
HP
GETVNGGTL
V
NHDNQ
F
V
S
GTANG
V
T
V
STGLELGPDS
D
E
NTGGQWI
KA
GGT
G
R
NTTVT
A
NGRQ
I
V
QA
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
D
NRGEQWVH
G
GG
K
A
A
GTIIN
Q
DGYQ
T
I
K
H
G
G
LATGTIVNTGAEGGP
E
SEN
VS
S
GQ
M
V
G
GTAESTTINKNGRQ
V
I
WS
SG
M
ARDTLIYAGGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
NG
G
T
A
T
E
T
L
IN
R
D
GWQV
I
K
E
GG
T
A
AH
TTINQ
K
G
K
LQV
N
AGG
K
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
A
GKADNVVLENGGRLDVL
S
GHTATNTRVDDGGTLDVRNGG
A
ATTVSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAID
S
TNVTL
A
S
D
ATWNIPDNATVQSVVDDLSHAGQIHFTS
S
RTG
T
FVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRT
D
VAGMS
V
T
A
G
I
YGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASS
G
NNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGY
fig|655817.3.peg.1298
Escherichia coli ABU 83972
MKRHLNTSYRLVWNHITG
AF
VVASELAR
A
RGKRAGVAVALSLAA
A
TS
L
PALAAD
S
VV
P
AGETVNGGTL
I
NHD
R
Q
F
V
S
GTA
D
GMT
V
STGLELG
A
DS
D
N
NTGGQ
Q
I
AR
GGTA
R
NT
R
VT
A
NG
L
Q
D
V
MA
GG
S
T
SDTVI
S
T
GGGQ
N
L
R
G
K
A
S
G
T
V
L
-
N
G
G
D
QW
I
H
A
GG
R
A
S
GT
V
IN
Q
DGYQ
T
I
K
H
G
G
L
V
TGTIVNTGAEGGPDSEN
VS
TGQ
M
V
G
G
I
AESTTINKNGRQ
V
I
WS
SG
I
ARDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
R
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
E
GKADNVVLENGGRLDVL
S
GHTAT
R
T
L
VDDGGTLDVRNGG
T
AT
A
VSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
T
F
TL
A
GKTVNNDTLTIREGDALLQGG
A
LTGNG
R
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTD
I
IA
H
RGTALKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
A
RTGKFVP
T
T
---
L
Q
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
GT
GLAT
T
GKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHS
IN
GSSAEGYN
S
QATLNVTF
fig|199310.4.peg.1208
Escherichia coli CFT073
MKRHLNTSYRLVWNHITG
AF
VVASELAR
A
RGKRAGVAVALSLAA
A
TS
L
PALAAD
S
VV
P
AGETVNGGTL
I
NHD
R
Q
F
V
S
GTA
D
GMT
V
STGLELG
A
DS
D
N
NTGGQ
Q
I
AR
GGTA
R
NT
R
VT
A
NG
L
Q
D
V
MA
GG
S
T
SDTVI
S
T
GGGQ
N
L
R
G
K
A
S
G
T
V
L
-
N
G
G
D
QW
I
H
A
GG
R
A
S
GT
V
IN
Q
DGYQ
T
I
K
H
G
G
L
V
TGTIVNTGAEGGPDSEN
VS
TGQ
M
V
G
G
I
AESTTINKNGRQ
V
I
WS
SG
I
ARDTLIY
T
GGDQTVHG
E
A
H
NT
R
L
E
GG
N
QYVH
KY
GLAL
N
TVINEGGWQVVK
A
GG
T
AGNTTINQNGEL
R
VHAGG
E
A
S
D
VTQNTGGALVTSTAATVTGTNRLGAFSVV
E
GKADNVVLENGGRLDVL
S
GHTAT
R
T
L
VDDGGTLDVRNGG
T
AT
A
VSMGNGGVLLADSGAAVSGTRSDG
T
AF
R
IGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGGSLAGTTTLNNGA
T
F
TL
A
GKTVNNDTLTIREGDALLQGG
A
LTGNG
R
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTD
I
IA
H
RGTALKLTGSTVLNGAIDPTNVTL
T
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
A
RTGKFVP
T
T
---
L
Q
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
GT
GLAT
T
GKGIQVVEAINGATTEEGAF
V
QGN
M
LQAGAFNYTLNRDSDESWYLRSE
ER
YRAEVPLYASMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
M
NL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARG
R
GWLGSLETGLPFSITDNLMLEP
R
LQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHS
IN
GSSAEGYN
S
QATLNVTF
fig|216592.1.peg.2790
Escherichia coli 042 (53-1091/1091)
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGV
S
GENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQL
H
YTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDS
T
KH
G
VSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|316385.5.peg.2115
Escherichia coli str. K-12 substr. DH10B (53-1091/1091)
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|216592.3.peg.2339
Escherichia coli 042
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGV
S
GENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQL
H
YTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDS
T
KH
G
VSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|753642.3.peg.40
Escherichia coli NC101
MKRHLNTSYRLVWNHITGTLVVASELARSRGK
G
AGVAVALSLAAVTSVPALAAD
S
I
VQAGETVNGGTL
E
NHDNQIV
L
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
I
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
KD
GLAL
N
TVINEGGWQVVK
A
GG
A
V
GNTT
V
NQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFT
V
RGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVA
L
GTRHSMKAS
T
DNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|595496.3.peg.1966
Escherichia coli BW2952
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|316407.3.peg.1935
Escherichia coli W3110
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|316385.7.peg.2163
Escherichia coli str. K-12 substr. DH10B
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|511145.12.peg.2076
Escherichia coli str. K-12 substr. MG1655
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|511145.6.peg.2060
Escherichia coli str. K-12 substr. MG1655
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|83333.1.peg.1974
Escherichia coli K12
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAG
C
LGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|679204.3.peg.4331
Escherichia coli MS 145-7
MKRHLNT
C
YRLVWNHITG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RT
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVA
L
GTRHSMKAS
T
DNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|573235.3.peg.2962
Escherichia coli O26:H11 str. 11368
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RT
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVA
L
GTRHSMKAS
T
DNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|679207.4.peg.3519
Escherichia coli MS 107-1
MKRHLNT
C
YRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RT
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNN
T
DRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVA
L
GTRHSMKAS
T
DNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|573235.3.peg.5674
Escherichia coli O26:H11 str. 11368
MKRHLNTSYRLVWNH
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RT
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVA
L
GTRHSMKAS
T
DNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|749540.3.peg.3461
Escherichia coli MS 146-1
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
L
GTANGMTISTGLE
Y
GPD
N
E
A
NTGGQWI
QN
GG
I
A
N
NTTVT
G
G
G
L
Q
R
V
NA
GG
SV
SDTVI
SA
GGGQSL
Q
G
Q
AVNTTL
-
N
G
GEQWVH
E
GG
I
ATGT
V
IN
E
K
G
W
Q
A
I
K
S
G
A
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
T
V
Y
G
D
A
VR
TTINKNGRQI
V
AA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
G
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
I
QGN
K
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDM
S
FGEGTSSRDTLRDSAKH
R
V
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|585056.7.peg.5046
Escherichia coli UMN026
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
L
GTANGMTISTGLE
Y
GPD
N
E
A
NTGGQWI
QN
GG
I
A
N
NTTVT
G
G
G
L
Q
R
V
NA
GG
SV
SDTVI
SA
GGGQSL
Q
G
Q
AVNTTL
-
N
G
GEQWVH
E
GG
I
ATGT
V
IN
E
K
G
W
Q
A
I
K
S
G
A
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
T
V
Y
G
D
A
VR
TTINKNGRQI
V
AA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
G
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
I
QGN
K
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
T
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDM
S
FGEGTSSRDTLRDSAKH
R
V
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|656419.3.peg.8
Escherichia coli M718
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
L
GTANGMTISTGLE
Y
GPD
N
E
A
NTGGQWI
QN
GG
I
A
N
NTTVT
G
G
G
L
Q
R
V
NA
GG
SV
SDTVI
SA
GGGQSL
Q
G
Q
AVNTTL
-
N
G
GEQWVH
E
GG
I
ATGT
V
IN
E
K
G
W
Q
A
I
K
S
G
A
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
T
V
Y
G
D
A
VR
TTINKNGRQI
V
AA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
G
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
I
QGN
K
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNN
N
FRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGS
T
QHVRAGFRLGSHNDMTFGEGTSSRDTLRDS
T
KH
R
VSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|656443.3.peg.2194
Escherichia coli TA271
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAAD
T
VVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLE
Y
GPD
N
E
A
NTGGQWI
QN
GGTA
N
NTTVT
G
G
G
L
Q
R
V
NA
GG
SV
SDTVI
SA
GGGQSL
Q
G
Q
AVNTTL
-
N
G
GEQWVH
E
D
G
I
ATGT
V
IN
E
K
G
W
Q
A
I
K
S
G
A
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
T
V
Y
G
D
A
VR
TTINKNGRQI
V
AA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
G
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
K
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGV
S
GENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFG
K
GTSSRDTLR
G
SAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGY
V
HSVSGSSAEGYNGQATLNVTF
fig|670897.3.peg.1699
Escherichia coli 2362-75
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
L
GTANGMTISTGLE
Y
GPD
N
E
A
NTGGQWI
QN
GG
I
A
N
NTTVT
G
G
G
L
Q
R
V
NA
GG
SV
SDTVI
SA
GGGQSL
Q
G
Q
AVNTTL
-
N
G
GEQWVH
E
GG
I
ATGT
V
IN
E
K
G
W
Q
A
I
K
S
G
A
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
T
V
Y
G
D
A
VR
TTINKNGRQI
V
AA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
G
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
I
QGN
K
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGV
S
GENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVAQGTRHSMKASSDNNDFR
V
RGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSH
H
DM
N
FG
K
GTSSRDTLR
G
SAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGY
V
HSVSGSSAEGYNGQATLNVTF
fig|340184.3.peg.2219
Escherichia coli B7A
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
L
GTANGMTISTGLE
Y
GPD
N
E
A
NTGGQWI
QN
GG
I
A
N
NTTVT
G
G
G
L
Q
R
V
NA
GG
SV
SDTVI
SA
GGGQSL
Q
G
Q
AVNTTL
-
N
G
GEQWVH
E
GG
I
ATGT
V
IN
E
K
G
W
Q
A
I
K
S
G
A
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
T
V
Y
G
D
A
VR
TTINKNGRQI
V
AA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
G
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
I
QGN
K
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGV
S
GENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVAQGTRHSMKASSDNNDFR
V
RGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNA
S
YVKFGHGSAQHVRAGFRLGSH
H
DM
N
FG
K
GTSSRDTLR
G
SAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGY
V
HSVSGSSAEGYNGQATLNVTF
fig|340184.6.peg.2331
Escherichia coli B7A
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
L
GTANGMTISTGLE
Y
GPD
N
E
A
NTGGQWI
QN
GG
I
A
N
NTTVT
G
G
G
L
Q
R
V
NA
GG
SV
SDTVI
SA
GGGQSL
Q
G
Q
AVNTTL
-
N
G
GEQWVH
E
GG
I
ATGT
V
IN
E
K
G
W
Q
A
I
K
S
G
A
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
T
V
Y
G
D
A
VR
TTINKNGRQI
V
AA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
G
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
I
QGN
K
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRILAGSRSHQTGV
S
GENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
I
H
NA
SGLWADIVAQGTRHSMKASSDNNDFR
V
RGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNA
S
YVKFGHGSAQHVRAGFRLGSH
H
DM
N
FG
K
GTSSRDTLR
G
SAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGY
V
HSVSGSSAEGYNGQATLNVTF
fig|536056.3.peg.1746
Escherichia coli DH1
M
TG
AF
VVASELAR
A
RGKR
G
GVAVALSLAAVTS
L
P
V
LAAD
I
VV
HP
GETVNGGTL
A
NHDNQIV
F
GT
T
NGMTISTGLE
Y
GPD
N
E
A
NTGGQW
V
QD
GGTA
N
K
TTVT
S
G
G
L
Q
R
V
NP
GG
SV
SDTVI
SA
GGGQSL
Q
G
R
AVNTTL
-
N
G
GEQW
M
H
E
G
A
I
ATGT
V
IN
D
K
G
W
Q
V
VK
P
G
T
V
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
D
A
VR
TTINKNGRQI
V
RA
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
N
GG
V
AGNTT
V
NQ
K
G
R
LQV
D
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
I
NRLGAFSVV
E
GKADNVVLENGGRLDVL
T
GHTATNTRVDDGGTLDVRNGG
T
ATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGG
--
GQADALMLEKGSSFTLNAGDTATDTTV
--
NGGLFTARGG
T
LAGTTTLNNGA
I
LTLSGKTVNNDTLTIREGDALLQGG
S
LTGNG
S
VEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL
A
SGATWNIPDNATVQSVVDDLSHAGQIHFTS
T
RTGKFVPAT
---
L
K
VKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNS
AS
GLATSGKGIQVVEAINGATTEEGAF
V
QGN
R
LQAGAFNY
S
LNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRI
V
AGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDL
M
RTEVAGMS
V
T
A
GVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDG
K
DNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSR
AP
LRDSAKHSVSELPVNWWVQPSVIRTFSSRGDM
RV
GT
ST
AGS
G
MTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
fig|562.371.peg.730
Escherichia coli 1044A
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|562.373.peg.4700
Escherichia coli 1125A
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|562.372.peg.469
Escherichia coli 1212A
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|562.374.peg.1696
Escherichia coli 536A
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|340186.5.peg.2781
Escherichia coli E110019
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|83334.1.peg.1445
Escherichia coli O157:H7
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|155864.8.peg.1079
Escherichia coli O157:H7 EDL933
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|155864.8.peg.5585
Escherichia coli O157:H7 EDL933
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|444454.5.peg.5882
Escherichia coli O157:H7 str. EC4024
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|444449.5.peg.4736
Escherichia coli O157:H7 str. EC4042
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|444448.5.peg.4004
Escherichia coli O157:H7 str. EC4045
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|444453.5.peg.2620
Escherichia coli O157:H7 str. EC4076
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|444452.5.peg.373
Escherichia coli O157:H7 str. EC4113
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|444450.8.peg.1464
Escherichia coli O157:H7 str. EC4115
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|444451.5.peg.1517
Escherichia coli O157:H7 str. EC4196
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|444447.5.peg.4238
Escherichia coli O157:H7 str. EC4206
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|478004.5.peg.1303
Escherichia coli O157:H7 str. EC4401
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|478006.5.peg.1194
Escherichia coli O157:H7 str. EC4501
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|478007.5.peg.2034
Escherichia coli O157:H7 str. EC508
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|478008.5.peg.4025
Escherichia coli O157:H7 str. EC869
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|637388.3.peg.4455
Escherichia coli O157:H7 str. FRIK2000
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|570506.3.peg.540
Escherichia coli O157:H7 str. FRIK966
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|386585.9.peg.1494
Escherichia coli O157:H7 str. Sakai
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|544404.4.peg.1328
Escherichia coli O157:H7 str. TW14359
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|502346.5.peg.2893
Escherichia coli O157:H7 str. TW14588
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|573235.3.peg.1379
Escherichia coli O26:H11 str. 11368
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVN
D
GTL
T
NHDNQIV
F
GTANGMTISTGLELGPDSE
E
NTGGQWI
QN
GG
I
A
G
NTTVT
T
NGRQ
V
V
LE
GGTASDTVIRDGGGQSL
N
G
L
AVNTTL
N
NRGEQWVH
E
GG
V
ATGTIIN
R
DGYQ
S
VK
S
G
G
LATGTI
I
NTGAEGGPDS
D
N
SY
TGQ
K
V
Q
GTAESTTINKNGRQII
LF
SG
L
ARDTLIYAGGDQ
S
VHG
R
ALNTTLNGGYQYVH
RD
GLAL
N
TVINEGGWQVVK
A
GG
A
AGNTTINQNGEL
R
VHAGG
E
AT
A
VTQNTGGALVTSTAATV
I
GTNRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGGTL
A
V
SA
GG
K
AT
S
V
TITS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
TS
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
P
NA
A
LSRA
V
AKSN
S
---------
-
------
P
V
T
FHK
L
T
TT
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
AS
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
AT
T
G
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEAR
I
RENITLGVQAGYAHSVSGSSAEGYNGQATLN
M
TF
fig|679207.4.peg.1630
Escherichia coli MS 107-1
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVA
I
ALSLAAVTSVPALAAD
T
VVQAGETV
S
GGTL
T
NHDNQIV
L
GTANGMTISTGLE
Y
GPD
N
E
A
NTGGQWI
QN
GG
I
A
N
NTTVT
G
G
G
L
Q
R
V
NA
GG
SV
S
N
TVI
SA
GGGQSL
Q
G
Q
AVNTTL
N
G-
GEQWVH
E
GG
I
ATGT
V
IN
E
K
G
W
Q
A
VK
S
G
A
M
AT
D
T
V
VNTGAEGGPD
A
EN
GD
TGQ
F
V
R
G
N
A
VR
TTINKNGRQI
V
AV
E
G
T
A
NT
T
VV
YAGGDQTVHG
H
AL
D
TTLNGGYQYVH
NG
G
T
A
S
D
TV
V
N
SD
GWQ
I
VK
E
GG
L
A
DF
TT
V
NQ
K
G
K
LQV
N
AGG
T
AT
N
VT
LKQ
GGALVTSTAATVTG
S
NRLG
N
F
T
V
E
N
GKAD
G
VVLE
S
GGRLDVL
E
S
H
S
A
Q
NT
L
VDDGG
I
L
V
V
SA
GG
K
AT
D
V
T
M
TS
GG
A
L
I
ADSGA
T
V
E
GT
NAS
GK
-
FSI
D
G
I
S
GQA
SG
L
L
LE
N
G
G
SFT
V
NAG
GQ
A
GN
TTV
GHR
G
T
L
TL
A
A
GG
N
L
S
G
R
T
Q
L
SK
GA
S
MV
L
N
G
--------------------
-
-----
-
-------------------------------------
DV
VS
--------
TG
DI
V
NA
G
E
I
RFD
N
Q
T
-
-
------
T
PD
A
A
LSRA
V
AK
GD
S
---------
-
------
P
V
T
FHK
L
T
T
S
NL
T
GQ
G
GTI
NM
RVR
L
D
-
GS
N
T
S
D
Q
LVI
N
GG
Q
ATGKT
W
L
AFT
N
V
GNS
NL
G
V
ATSG
Q
GI
R
VV
D
A
Q
NGATTEEGAF
A
LSR
P
LQAGAFNYTLNRDSDE
D
WYLRSENAYRAEVPLY
T
SMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNL
V
HTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDM
N
FG
K
GTSSRDTLRDSAKHSV
R
ELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPS
R
NGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
Consen1
Primary consensus
MKRHLNTsYRLVWNHiTGtlVVASELARsRGKRaGVAVALSLAAVTSvPaLAADkVVqaGETVNgGTL
NHDNQiV
GTanGmTiSTGLElGPDse
NTGGQwi
GGta
nTTVT
nGrQ
V
GGtaSDTVIrdGGGQsL
G
AvnTtL
NrGEQWvH
Gg
AtGTiIN
dGyQ
vK
G
lATgTivNTGAEGGPdseN
tGQ
V
GtAesTTINKNGRQii
sG
ArdTliYaGGDQtVHG
AlnTtLnGGyQYVH
GlAl
TViNegGWQvvK
GG
AgnTTiNQnGeLqVhAGG
At
VTqntGGALVTSTAATVtGtNRLGaFsVv
GKADnVVLEnGGRLDVL
gHtAtNTrVDDGGTLdVrnGG
ATtVsmgnGGvLlADSGAaVsGTrsdGkaFsIgG
--
GQAdaLmLEkGsSFTlNAGdtAtdTTV
--
nGgLftArGGsLaGtTtLnnGA
ltLsGktvnndtltiregdallqgg
ltgng
veksgsgtltvsnttltqkavnlnegtltlndstvttDViaqrgtalklTGstVlnGaIdptNvTl
sgatwniPdnAtvqsVvddlShagqihfts
rtgkfvPaT
---
L
vkNLnGQnGTIslRVRpDmaqNnaDrLVIdGGrATGKTiLnlvNaGNS
GlATsGkGIqVVeAiNGATTEEGAF
qgn
LQAGAFNYtLNRDSDEsWYLRSEnaYRAEVPLYaSMLTQAMDYDRIlAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLlRTEVAGMSlTtGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYlNL
HtsSGLWADIVAQGTRHSMKASSdNNDFRARGwGWLGSLETGLPFSITDNLMLEPqLQYTWQGLSLDDGqDNAGYVKFGHGSAQHVRAGFRLGSHnDMTFGEGTSSRdtLRDSAKHsVsELPVNWWVQPSVIRTFSSRGDMsmGTaaAGSnMTFSPSqNGTSLDLQAGLEARvRENITLGVQAGYAHSVSGSSAEGYNgQATLNvTF
Consen2
Secondary consensus
c
m
af
a
g
l
v
i
hp
d
f
td
v
v
y
nd
qv
ig
k
g
l
sv
sa
n
yg
v
g
a
v
k
w
i
d
vi
ead
s
d
vr
vv
e
nt
vv
t
s
hd
r
e
n
t
s
v
sd
ii
ah
v
k
s
lkq
i
i
n
t
e
g
s
s
s
q
l
a
sa
s
tits
a
i
t
e
nas
t
-
r
d
ts
sg
l
n
g
v
gq
gn
ghr
t
tl
a
t
s
r
q
sk
mv
n
--------------------
-----
-------------------------------------
vs
--------
di
na
e
rfd
q
-
------
t
na
lsra
aksn
---------
------
v
fhk
tt
t
g
nm
l
-
gs
as
q
n
q
w
aft
v
v
t
q
r
d
q
lsr
s
d
er
t
v
m
v
a
m
na
g
r
r
k
s
ap
r
r
rv
st
g
r
i
s
m
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character