fig|1040638.4.peg.5524
Escherichia coli O104:H4 str. LB226692
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPG
X
RI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|585055.6.peg.239
Escherichia coli 55989
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|585055.8.peg.239
Escherichia coli 55989
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|585396.4.peg.244
Escherichia coli O111:H- str. 11128
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|340184.3.peg.4752
Escherichia coli B7A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|340184.6.peg.4976
Escherichia coli B7A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|679204.3.peg.5161
Escherichia coli MS 145-7
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|344610.3.peg.2571
Escherichia coli 53638
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|344610.7.peg.1445
Escherichia coli 53638
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|344601.3.peg.1934
Escherichia coli B171
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|344601.5.peg.2015
Escherichia coli B171
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|340185.3.peg.1518
Escherichia coli E22
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|340185.4.peg.1605
Escherichia coli E22
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|585034.4.peg.238
Escherichia coli IAI1
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|585034.5.peg.238
Escherichia coli IAI1
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|585395.4.peg.239
Escherichia coli O103:H2 str. 12009
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|340186.3.peg.755
Escherichia coli E110019
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQ
I
DGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVR
L
KFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|340186.5.peg.782
Escherichia coli E110019
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQ
I
DGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVR
L
KFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|331112.3.peg.234
Escherichia coli HS
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSP
M
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|331112.6.peg.239
Escherichia coli HS
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSP
M
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|679206.4.peg.3199
Escherichia coli MS 119-7
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
S
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|331111.12.peg.570
Escherichia coli E24377A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FS
D
PHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|331111.3.peg.2806
Escherichia coli E24377A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FS
D
PHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|409438.11.peg.354
Escherichia coli SE11
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|595495.4.peg.4417
Escherichia coli KO11
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|566546.3.peg.4452
Escherichia coli W
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|566546.4.peg.233
Escherichia coli W
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|679207.4.peg.4763
Escherichia coli MS 107-1
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQA
I
PGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|679205.4.peg.4255
Escherichia coli MS 124-1
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKG
M
VTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
L
S
ESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|670888.3.peg.823
Escherichia coli 1827-70
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
Q
E
SSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
IQNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|701177.3.peg.241
Escherichia coli O55:H7 str. CB9615
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|562.373.peg.2699
Escherichia coli 1125A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|562.372.peg.2870
Escherichia coli 1212A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|562.374.peg.1330
Escherichia coli 536A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|155864.1.peg.236
Escherichia coli O157:H7 EDL933
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|155864.8.peg.238
Escherichia coli O157:H7 EDL933
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|444454.5.peg.4699
Escherichia coli O157:H7 str. EC4024
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|444449.5.peg.4154
Escherichia coli O157:H7 str. EC4042
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|444448.5.peg.2909
Escherichia coli O157:H7 str. EC4045
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|444453.5.peg.4271
Escherichia coli O157:H7 str. EC4076
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|444452.5.peg.3165
Escherichia coli O157:H7 str. EC4113
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|444450.8.peg.378
Escherichia coli O157:H7 str. EC4115
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|444451.5.peg.3678
Escherichia coli O157:H7 str. EC4196
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|444447.5.peg.3085
Escherichia coli O157:H7 str. EC4206
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|478004.5.peg.3910
Escherichia coli O157:H7 str. EC4401
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|478005.5.peg.3842
Escherichia coli O157:H7 str. EC4486
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|478007.5.peg.3044
Escherichia coli O157:H7 str. EC508
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|478008.5.peg.4738
Escherichia coli O157:H7 str. EC869
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|637388.3.peg.781
Escherichia coli O157:H7 str. FRIK2000
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|570506.3.peg.1714
Escherichia coli O157:H7 str. FRIK966
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|544404.4.peg.240
Escherichia coli O157:H7 str. TW14359
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|562.371.peg.802
Escherichia coli 1044A
MSTGLRFTLEVDGLPPDAFAVV
F
FHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|83334.1.peg.323
Escherichia coli O157:H7
MSTGLRFTLEVDGLPPDAFAVV
F
FHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|478006.5.peg.3845
Escherichia coli O157:H7 str. EC4501
MSTGLRFTLEVDGLPPDAFAVV
F
FHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|386585.9.peg.338
Escherichia coli O157:H7 str. Sakai
MSTGLRFTLEVDGLPPDAFAVV
F
FHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|502346.5.peg.1025
Escherichia coli O157:H7 str. TW14588
MSTGLRFTLEVDGLPPDAFAVV
F
FHL
N
QSLSSLFSL
A
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
A
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
LY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAE
K
IGNNQ
A
ITVTN
N
Q
ILNI
G
V
------
NQIQTVGVNQ
V
ETVGSNQII
K
VGS
N
QVE
K
VG
II
RALTVGVAYQTTVGGIMNTSVAL
L
QSSQ
V
GLHKSL
M
VG
M
GY
S
V
N
VGNNVTF
S
VGKT
M
K
EN
TGQTA
V
YSAGEHLELCCGKARLVLTKDG
S
IFL
-
NGT
H
IHL
E
----------
G
ES
DVNGD
A
P
V
INWNCGA
T
QPV
P
D
A
P
-------------------
VP
K
D
L
P
PGM
PDMR
QF
fig|562.375.peg.4345
Escherichia coli EC4100B
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|749548.3.peg.5104
Escherichia coli MS 196-1
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
Q
QILDKMAYLTI
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQ
K
LYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SILGTIL
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|585034.4.peg.1441
Escherichia coli IAI1
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIP
X
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|316401.4.peg.1754
Escherichia coli ETEC H10407
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|679205.4.peg.4675
Escherichia coli MS 124-1
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|566546.4.peg.1574
Escherichia coli W
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|344610.3.peg.1045
Escherichia coli 53638
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
V
G
VN
KSL
L
VG
K
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|344610.7.peg.5119
Escherichia coli 53638
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
V
G
VN
KSL
L
VG
K
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|340186.3.peg.188
Escherichia coli E110019
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRV
N
GVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGW
P
GRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
S
G
S
GTTL
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|340186.5.peg.202
Escherichia coli E110019
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRV
N
GVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGW
P
GRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
S
G
S
GTTL
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|481805.3.peg.2363
Escherichia coli ATCC 8739
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|481805.6.peg.2354
Escherichia coli ATCC 8739
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|701177.3.peg.1805
Escherichia coli O55:H7 str. CB9615
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|595495.4.peg.4816
Escherichia coli KO11
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|566546.3.peg.3
Escherichia coli W
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
T
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|344601.3.peg.227
Escherichia coli B171
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|344601.5.peg.224
Escherichia coli B171
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|562.371.peg.1755
Escherichia coli 1044A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|562.373.peg.5100
Escherichia coli 1125A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|562.372.peg.1239
Escherichia coli 1212A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|562.374.peg.2345
Escherichia coli 536A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|83334.1.peg.2090
Escherichia coli O157:H7
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|444454.5.peg.965
Escherichia coli O157:H7 str. EC4024
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|444449.5.peg.291
Escherichia coli O157:H7 str. EC4042
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|444448.5.peg.4647
Escherichia coli O157:H7 str. EC4045
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|444453.5.peg.2844
Escherichia coli O157:H7 str. EC4076
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|444452.5.peg.1973
Escherichia coli O157:H7 str. EC4113
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|444450.8.peg.2109
Escherichia coli O157:H7 str. EC4115
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|444451.5.peg.1965
Escherichia coli O157:H7 str. EC4196
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|444447.5.peg.5559
Escherichia coli O157:H7 str. EC4206
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|478004.5.peg.2834
Escherichia coli O157:H7 str. EC4401
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|478005.5.peg.2986
Escherichia coli O157:H7 str. EC4486
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|478006.5.peg.1956
Escherichia coli O157:H7 str. EC4501
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|478007.5.peg.2159
Escherichia coli O157:H7 str. EC508
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|478008.5.peg.3680
Escherichia coli O157:H7 str. EC869
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|637388.3.peg.1480
Escherichia coli O157:H7 str. FRIK2000
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|386585.9.peg.2161
Escherichia coli O157:H7 str. Sakai
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|544404.4.peg.1971
Escherichia coli O157:H7 str. TW14359
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|502346.5.peg.5310
Escherichia coli O157:H7 str. TW14588
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|670888.3.peg.2125
Escherichia coli 1827-70
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVV
V
S
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|409438.11.peg.1683
Escherichia coli SE11
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
Q
L
LSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|573235.3.peg.2091
Escherichia coli O26:H11 str. 11368
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
P
P
-------------------
--
-
D
E
K
QDTPDMREY
fig|340185.3.peg.1090
Escherichia coli E22
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
K
S
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|340185.4.peg.1145
Escherichia coli E22
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
K
S
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|585395.4.peg.1663
Escherichia coli O103:H2 str. 12009
MSTGLRFTLEVDGLPPDAFAVVSFHL
T
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
K
S
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|155864.1.peg.2012
Escherichia coli O157:H7 EDL933 (13-714/714)
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLT
V
--
WQGD
D
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
C
PPLWR
T
GL
----
R
------
QNFRIFQNEDI
E
SIL
A
TIL
K
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Q
KS
--
I
DQSLVLCDTV
RY
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
Y
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
R
G
S
GTTL
X
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQ
X
VIVDFLNGDPDQPIIMGRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSK
N
YKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|331111.12.peg.1920
Escherichia coli E24377A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYE
D
HA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|331111.3.peg.4080
Escherichia coli E24377A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYE
D
HA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|585396.4.peg.1923
Escherichia coli O111:H- str. 11128
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIR
S
SSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|413997.3.peg.1487
Escherichia coli B str. REL606
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
QNFRIF
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|749547.3.peg.1258
Escherichia coli MS 187-1
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
QNFRIF
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|749532.3.peg.3640
Escherichia coli MS 78-1
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYE
D
HA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRF
E
QEGQ
H
QDYQRTQYEVYDYPGRF
------
K
S
-
AHGQNFARWQMDGWRNNAE
T
ARG
M
SRSPEIWPGRRI
V
LTGHPQANLNREWQVVAS
E
LHGEQPQAVPGR
Q
G
A
GT
A
L
E
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
AD
QDSSCWIRVAQAWAGTGFG
H
LAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
N
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|340184.3.peg.109
Escherichia coli B7A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|340184.6.peg.113
Escherichia coli B7A
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
R
G
S
GTTL
D
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCWIRVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
H
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVYIH
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
I
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|1040638.4.peg.4200
Escherichia coli O104:H4 str. LB226692
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCW
S
RVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPII
L
GRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVY
N
H
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
M
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|585055.6.peg.1618
Escherichia coli 55989
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCW
S
RVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPII
L
GRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVY
N
H
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
M
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|585055.8.peg.1621
Escherichia coli 55989
MSTGLRFTLEVDGLPPDAFAVVSFHL
N
QSLSSLFSL
D
LSLV
-
SQQFLSLEF
A
Q
V
LDKMAYLTI
--
WQGD
E
VQ
---
RRVKGVVTWFELGENDKNQMLYSMKV
H
PPLWR
A
GL
----
R
------
QNFRIFQNEDI
K
SILGT
M
L
Q
ENGVTEWSPL
---
FSEPHPSREFCVQYGETDYDFLCRMAAEEGIFFYEEHA
Y
KS
--
T
DQSLVLCDTV
RH
LPESFEIPW
-
NPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGRFDQEGQ
H
QDYQRTQYEVYDYPGRF
------
KG
-
AHGQNFARWQMDGWRNNAE
V
ARG
T
SRSPEIWPGRRI
A
LTGHPQANLNREWQVVAS
D
LHGEQPQAVPGR
S
G
S
GTTL
N
NHFAVIP
-
ADR
-
TWRPQPLLKPLVDGPQSAVVTGPAGEEIFC
--
DEHGRVRVKFNWDRYNP
SN
QDSSCW
S
RVAQAWAGTGFGNLAIPRV
-
GQEVIVDFLNGDPDQPII
L
GRTY
---
H
Q
EN
-----------
RTPGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNELKFD
---------------
DATGKEQVY
N
H
----------------------------------
AQKNMNTEVLNNRTTDVINNHAETIGNNQ
M
I
A
VTN
-
-
----
-
-
------
NQIQTVGVNQ
I
ETVGSNQII
K
VGS
V
QVE
T
I
G
LV
RALTVGVAYQTTVGGIMNTSVAL
M
QSSQ
M
GLHKSL
R
VG
L
GY
D
V
K
VGNNVTF
T
VGKT
K
K
DD
TGQTA
I
YSAGEHLELCCGKARLVLTKDG
Q
IFL
-
NGT
K
IHLQ
----------
G
KE
Q
VNGD
S
L
L
INWNC
A
AS
KSP
P
K
TP
-------------------
--
-
D
E
K
QDTPDMREY
fig|362663.8.peg.1472
Escherichia coli 536 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRYNP
AT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DATG
G
EQVYIH
----------------------------------
AQKNM
D
TEVLNNRTTDV
K
A
D
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|362663.9.peg.1477
Escherichia coli 536 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRYNP
AT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DATG
G
EQVYIH
----------------------------------
AQKNM
D
TEVLNNRTTDV
K
A
D
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|340197.3.peg.2981
Escherichia coli F11 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRYNP
AT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DATG
G
EQVYIH
----------------------------------
AQKNM
D
TEVLNNRTTDV
K
A
D
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|340197.5.peg.3114
Escherichia coli F11 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRYNP
AT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PGSLP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DATG
G
EQVYIH
----------------------------------
AQKNM
D
TEVLNNRTTDV
K
A
D
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|656379.3.peg.3492
Escherichia coli FVEC1302 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
K
A
T
LTI
--
WQG
A
V
A
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
S
R
LW
AEEG
L
FF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
N
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLY
QM
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
M
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
N
QPQA
LH
G
S
Q
G
K
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQ
N
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRV
Q
F
H
WDRYNP
AT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
SN
EQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
I
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
V
S
GSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|656380.3.peg.2846
Escherichia coli FVEC1412 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
K
A
T
LTI
--
WQG
A
V
A
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
S
R
LW
AEEG
L
FF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
N
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLY
QM
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
M
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
N
QPQA
LH
G
S
Q
G
K
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQ
N
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRV
Q
F
H
WDRYNP
AT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
SN
EQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
I
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
V
S
GSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|585056.7.peg.1896
Escherichia coli UMN026 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
K
A
T
LTI
--
WQG
A
V
A
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
S
R
LW
AEEG
L
FF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
N
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLY
QM
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
M
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
N
QPQA
LH
G
S
Q
G
K
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQ
N
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRV
Q
F
H
WDRYNP
AT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
SN
EQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
I
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
V
S
GSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|753642.3.peg.1470
Escherichia coli NC101 (5-684/743)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
K
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GM
P
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
A
S
T
GAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
V
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
V
V
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
GL
G
Q
TVNV
G
S
K
KEGGHD
Q
KVI
V
ANDR
C
I
TV
RND
Q
M
L
K
V
TN
D
RTV
S
V
S
HD
D
S
L
Y
V
RNDR
W
V
TV
K
G
K
L
E
-----
-
----
-
--
H
R
T
-
-
--
-
--
-
-
-
T
GN
H
I
S
Q
V
E
GK
H
S
L
EV
K
G
DL
A
R
K
I
S
G
A
----------
L
G
M
KV
RD
E
I
V
L
ES
G
G
K
I
T
MK
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
A
P
-------------------
VP
T
L
Q
P
fig|216592.1.peg.1994
Escherichia coli 042 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
K
A
T
LTI
--
WQG
A
V
A
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
S
R
LW
AEEG
L
FF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
N
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLY
QM
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
M
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
N
QPQA
LH
G
S
Q
G
K
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQ
N
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
SN
EQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
I
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
V
S
GSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|216592.3.peg.1645
Escherichia coli 042 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
K
A
T
LTI
--
WQG
A
V
A
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
S
R
LW
AEEG
L
FF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
N
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLY
QM
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
M
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
N
QPQA
LH
G
S
Q
G
K
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQ
N
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
E
A
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQPIIMGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
SN
EQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
I
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
V
S
GSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|655817.3.peg.1774
Escherichia coli ABU 83972 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
M
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
Q
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
R
I
TV
RND
Q
TL
K
V
TN
D
RTV
S
V
S
HD
DG
L
Y
V
RNDR
R
V
TV
K
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
E
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
T
L
Q
P
fig|199310.1.peg.1829
Escherichia coli CFT073 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
M
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
Q
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
R
I
TV
RND
Q
TL
K
V
TN
D
RTV
S
V
S
HD
DG
L
Y
V
RNDR
R
V
TV
K
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
E
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
T
L
Q
P
fig|199310.4.peg.1762
Escherichia coli CFT073 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
M
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
Q
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
R
I
TV
RND
Q
TL
K
V
TN
D
RTV
S
V
S
HD
DG
L
Y
V
RNDR
R
V
TV
K
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
E
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
T
L
Q
P
fig|749528.3.peg.1206
Escherichia coli MS 45-1 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
M
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
Q
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
R
I
TV
RND
Q
TL
K
V
TN
D
RTV
S
V
S
HD
DG
L
Y
V
RNDR
R
V
TV
K
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
E
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
T
L
Q
P
fig|585397.7.peg.1624
Escherichia coli ED1a (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VD
A
A
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
N
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
I
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQV
I
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
V
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGG
Q
D
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
I
S
Q
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|585397.9.peg.1617
Escherichia coli ED1a (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VD
A
A
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
N
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
I
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LTGHPQ
KM
LNREWQV
I
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
V
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGG
Q
D
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
I
S
Q
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|585397.7.peg.241
Escherichia coli ED1a (5-684/743)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
K
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HL
C
I
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LT
E
HPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
V
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
V
V
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
GL
G
Q
TVNV
G
S
K
KEGGHD
Q
KVI
V
ANDR
C
I
TV
RND
Q
M
L
K
V
TN
D
RTV
S
V
S
HD
D
S
L
Y
V
RNDR
W
V
TV
K
G
K
L
E
-----
-
----
-
--
H
R
T
-
-
--
-
--
-
-
-
T
GN
H
I
S
Q
V
E
GK
H
S
L
EV
K
G
DL
A
R
K
I
S
G
A
----------
L
G
M
KV
RD
E
I
V
L
ES
G
G
K
I
T
MK
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
A
P
-------------------
VP
T
L
Q
P
fig|585397.9.peg.241
Escherichia coli ED1a (5-684/743)
GLRFTLEVDG
QE
PD
T
FAVV
N
F
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
K
A
T
LTI
--
WQG
V
I
P
Q
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HL
C
I
E
PPLWR
C
GL
----
R
------
QNFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
F
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
R
F
T
LT
E
HPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
R
V
QS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
V
V
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
GL
G
Q
TVNV
G
S
K
KEGGHD
Q
KVI
V
ANDR
C
I
TV
RND
Q
M
L
K
V
TN
D
RTV
S
V
S
HD
D
S
L
Y
V
RNDR
W
V
TV
K
G
K
L
E
-----
-
----
-
--
H
R
T
-
-
--
-
--
-
-
-
T
GN
H
I
S
Q
V
E
GK
H
S
L
EV
K
G
DL
A
R
K
I
S
G
A
----------
L
G
M
KV
RD
E
I
V
L
ES
G
G
K
I
T
MK
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
IN
L
N
S
G
G
S
---
P
G
A
P
-------------------
VP
T
L
Q
P
fig|405955.13.peg.1582
Escherichia coli APEC O1 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVVSF
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
L
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
W
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|405955.9.peg.1287
Escherichia coli APEC O1 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVVSF
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
L
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
W
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|714962.3.peg.1649
Escherichia coli IHE3034 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVVSF
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
L
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
W
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|585035.6.peg.1541
Escherichia coli S88 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVVSF
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
L
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
W
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|869729.3.peg.2068
Escherichia coli UM146 (5-684/782)
GLRFTLEVDG
QE
PD
T
FAVVSF
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
L
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
W
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|364106.7.peg.1718
Escherichia coli UTI89 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVVSF
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
L
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
W
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|364106.8.peg.1720
Escherichia coli UTI89 (5-684/801)
GLRFTLEVDG
QE
PD
T
FAVVSF
R
L
I
Q
NQ
S
YP
F
VM
S
VDVA
-
S
DS
F
MQTA
-
E
M
L
L
E
K
N
A
T
LTI
--
WQG
V
I
P
L
---
R
Y
V
T
GVV
AG
F
GMQ
EN
NGW
QM
R
Y
HLRI
E
PPLWR
C
GL
----
R
------
R
NFRIFQ
QQ
DI
R
T
I
SA
T
L
L
N
ENGVTEW
T
PL
---
F
Y
E
D
HP
A
REFCVQYGE
S
D
LA
FL
A
R
LW
AEEGIFF
F
E
RF
A
A
D
S
--
P
E
Q
K
L
T
LCD
D
V
AG
L
SQAG
E
L
P
F
-
NP
D
T
SAGAE
T
E
C
V
S
M
FRY
E
A
HV
RPSSV
QSQ
DYTFK
V
P
D
W
P
G
MYE
Q
Q
G
E
S
LNG
Q
LE
QYE
IF
DYPGR
Y
------
K
DEQ
HG
KD
F
TLYR
M
ESL
R
SD
AE
K
A
T
G
Q
S
N
SP
KL
WPG
T
W
F
T
LTGHPQ
KM
LNREWQVV
Q
S
I
L
S
G
D
QPQA
LH
G
S
Q
G
R
GTTL
G
N
QLE
VIP
-
ADR
-
TWRP
RLQS
KP
K
VDGPQSA
I
VTGPAGEEIFC
--
DEHGRVRVKF
H
WDRY
HG
MT
EE
SSCW
V
RV
S
QAWAG
P
GFGNLAIPRV
-
GQEVIVDFLNGDPDQP
LV
MGRTY
---
H
E
D
N
-----------
R
S
PG
D
LP
------------------
GTKTQMTI
-----
RSKTYKGSGFNEL
R
F
E
---------------
DAT
D
KEQVYIH
----------------------------------
AQKNM
D
TEVLN
D
RTTDV
KHD
H
T
ETIGN
D
Q
K
ITV
VK
G
Q
TVQV
G
T
RKEGGHD
Q
SI
TV
ANDR
C
I
TV
RND
Q
TL
Q
V
TN
D
RTV
S
V
S
ND
DG
L
Y
V
RNDRKV
TV
E
G
KQE
-----
-
----
-
--
HK
T
-
-
--
-
--
-
-
-
T
GN
H
V
SL
V
E
GK
H
S
L
VV
K
G
DL
A
R
KVS
G
A
----------
L
GIKV
DG
D
I
V
L
ESSS
R
I
S
L
K
VGGSFVVIHS
G
GV
D
IV
G
-
-
P
K
I
S
L
N
S
G
G
S
---
P
G
TP
-------------------
VP
A
L
Q
P
fig|656440.3.peg.2847
Escherichia coli TA206 (13-756/790)
L
KI
R
GL
K-SPVD
V
LT
F
TG
H
E
Q
LSS
P
F
RY
D
I
QFTS
S
D
K
A
IA
P
E
S
V
L
M
Q
D
GAFS
LT
APPV
QG
M
P
VQ
TAL
R
T
L
H
GV
I
T
G
F
K
HLS
S
S
QDE
A
R
Y
E
V
R
L
E
P
---
R
M
A
L
LTRS
R
------
QN
-A
I
Y
QN
Q
T
V
P
Q
I
V
EK
IL
R
E
RHQMRGQDFVFNLKSE
Y
P
A
RE
QV
M
QYGE
D
D
L
T
F
V
S
R
L
L
S
E
V
GI
W
F
--
RF
A
T
D
A
RL
K
I
E
V
I
EFY
D
D
Q
SG
YERGLT
L
P
LR
H
P
S
GLF
D
G
E
T
E
A
V
WGLNT
A
YS
V
VEK
N
V
T
T
R
DY
N
YR
TATAEM
M
T
E
Q
HDA
T
GGDNT
T
YG
E
A
Y
H
Y
ADN
F
LQKGDK
E
AAES
G
AF
Y
AR
I
R
H
E
R
Y
L
N
EQA
I
L
K
G
Q
S
T
S
SL
L
M
PG
LE
I
R
V
Q
G
DDAPA
V
F
R
K
GV
LI
T-
-
-------G
V
TA
S
A
A
R
DR
S
Y
E
LT
F
TA
IP
Y
SE
R
YG
Y
RP
A
L
I
P
R
P
VM
A
G
TLP
A
R
VT
STVKN
D
I
Y
AHI
D
K
D
GR
Y
RV
NL
DF
DR
DTW
KP
GYE
S
L
W
V
R
Q
S
R
P
Y
AG
DT
Y
G
-
L
H
L
P
L
L
A
G
T
EV
S
I
A
F
EE
G
N
PD
R
P
Y
I
A
G
VK
H
DSA
H
T
D
H
VTIQNYKRNVL
RTP
A
N
NKIRLDDERGKEHIKVSTEY
G
G
K
S
Q
L
N
L
GHLVDAG
K
QQ
R
G
E
GF
-
EL
R
T
D
LWGAVRAKKGIFISA
DA
Q
D
K
A
Q
GQ
V
REMADIISELNGLSDKIQKLSDDATTANADPADMA
AQ
IA
L
I
T
SR
I
N
D
L
T
AS
VI
LM
HA
P-----
K
G
V
A
V
A
S
G
E
HLQL
A
A
V
K
----
N
LQ
I
NA
G
N
N
A
D
I
G
V
VK
N
MF
I
G
VG
-
-
---
-
--
--
RAL
S
V
F
V
-----------
-----
-
----
-
--
R
K
A
-
-
--
-
--
-
-
-
-------
-
-
G
IK
L
I
AN
K
G
AV
S
V
Q
A
QH
D
L
M
EL
LAK
K
S
IE
IVS
T
E
D
E
I
R
I
SAKK
K
I
T
I
NG
GGS
YIR
I
EGS
GI
E
---
-
-
-
-
--------
---
P
G
TP
GDYNVKAVHYGRQPKASEK
VP
Consen1
Primary consensus
MSTGLRFTLEVDGlpPDaFAVVsFhL
QslSslFsl
lslv
-
SqqFlslef
qiLdKmAyLTi
--
WQGd
vQ
---
RrVkGvVtwFelgENdknQmlYsmkv
PPLWR
GL
----
R
------
qNFRIFQneDI
sIlgTiL
ENGVTEWsPL
---
FsEpHPsREFCVQYGEtDydFLcRmaAEEGIFFyEehA
kS
--
dQsLvLCDtV
LpesfEiPw
-
NPnTrtevsTlCiSqFRYsAqiRPSSVvtkDYTFKrPgWaGrfdQeGq
qdyQrtQYEvyDYPGRf
------
Kg
-
aHGqnFarwqMdgwRnnAE
ArG
SrSPeiWPGrRi
LTGHPQanLNREWQVVaS
LhGeQPQAvpGr
G
GTtL
NhfaVIP
-
ADR
-
TWRPqpllKPlVDGPQSAvVTGPAGEEIFC
--
DEHGRVRVKFnWDRYnp
qdSSCWiRVaQAWAGtGFGnLAIPRV
-
GQEVIVDFLNGDPDQPiiMGRTY
---
H
eN
-----------
RtPGsLP
------------------
GTKTQMTI
-----
RSKtYKGSGFNELkFd
---------------
DATgKEQVYIH
----------------------------------
AQKNMnTEVLNnRTTDVinnHaEtIGNnQ
ItVtn
q
g
------
nQiqTVgvnq
eTVgsnQii
Vgs
qve
vg
raLtVgvayqtTVgGimntsval
qssq
glHKsl
vg
gy
v
vGNnVtf
vGKt
k
tGqtA
ysaGehlelccgkarLvltkDG
IfL
-
ngt
IhLq
----------
G
dvnGd
p
INwNcgas
P
tP
-------------------
vp
d
pqdtPDMRey
Consen2
Secondary consensus
qe
t
n
r
nq
yp
vm
vdva
ds
mqta
-
m
e
n
t
v
v
p
y
t
m
ag
gmq
ngw
kr
hlri
r
qq
t
sa
t
y
d
a
s
la
a
lw
f
rf
d
e
k
t
d
sqag
l
f
d
sagae
e
v
m
e
hv
qsq
v
d
p
mye
q
e
lng
le
if
y
deq
kd
tlyr
esl
sd
t
n
kl
t
f
km
q
s
d
lh
s
a
qle
rlqs
k
i
h
hg
ee
v
s
p
h
lv
d
s
d
n
r
e
d
d
d
khd
t
k
d
a
vk
-
-
rkegghd
si
andr
i
rnd
tl
tn
rtv
is
dg
y
rndrkv
e
kqe
-----
----
--
t
-
--
--
-
t
h
sl
e
h
l
k
dl
kvs
a
----------
gikv
v
esss
s
vggsfvvihs
qiv
-
l
l
sagt
a
--
l
kpgm
qf
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character