fig|1040638.4.peg.5683
Escherichia coli O104:H4 str. LB226692
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|6666666.5357.peg.376
Escherichia coli TY-2482
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585055.6.peg.124
Escherichia coli 55989
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585055.8.peg.124
Escherichia coli 55989
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|340186.3.peg.4468
Escherichia coli E110019
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|340186.5.peg.4694
Escherichia coli E110019
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|316401.4.peg.134
Escherichia coli ETEC H10407
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|573235.3.peg.127
Escherichia coli O26:H11 str. 11368
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|679207.4.peg.1167
Escherichia coli MS 107-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDE
V
LKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749532.3.peg.4396
Escherichia coli MS 78-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDE
V
LKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585396.4.peg.124
Escherichia coli O111:H- str. 11128
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPD
I
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|550672.3.peg.4394
Escherichia coli B088
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|344601.3.peg.2573
Escherichia coli B171
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|344601.5.peg.2676
Escherichia coli B171
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|340185.3.peg.1635
Escherichia coli E22
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|340185.4.peg.1724
Escherichia coli E22
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|331111.12.peg.455
Escherichia coli E24377A
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|331111.3.peg.2694
Escherichia coli E24377A
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656408.3.peg.5079
Escherichia coli H591
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|331112.3.peg.122
Escherichia coli HS
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|331112.6.peg.127
Escherichia coli HS
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|679206.4.peg.1642
Escherichia coli MS 119-7
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585395.4.peg.123
Escherichia coli O103:H2 str. 12009
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656443.3.peg.258
Escherichia coli TA271
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749545.3.peg.1258
Escherichia coli MS 182-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSV
P
LNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDE
V
LKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
N
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585034.4.peg.123
Escherichia coli IAI1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585034.5.peg.123
Escherichia coli IAI1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
L
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|562.375.peg.2525
Escherichia coli EC4100B
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|550676.3.peg.4790
Escherichia coli B185
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|701177.3.peg.127
Escherichia coli O55:H7 str. CB9615
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|550677.3.peg.511
Escherichia coli B354
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
N
N
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|216592.1.peg.4283
Escherichia coli 042 (14-529/529)
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
N
T
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|216592.3.peg.132
Escherichia coli 042
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
N
T
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|670888.3.peg.704
Escherichia coli 1827-70
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|344610.3.peg.4025
Escherichia coli 53638
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|344610.7.peg.1616
Escherichia coli 53638
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|481805.3.peg.3800
Escherichia coli ATCC 8739 (14-529/529)
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|481805.6.peg.3779
Escherichia coli ATCC 8739
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|340184.3.peg.1432
Escherichia coli B7A
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|340184.6.peg.1512
Escherichia coli B7A
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|595496.3.peg.126
Escherichia coli BW2952
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|536056.3.peg.3677
Escherichia coli DH1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656414.3.peg.246
Escherichia coli H736
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|83333.1.peg.123
Escherichia coli K12
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749538.3.peg.3139
Escherichia coli MS 116-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|679204.3.peg.3244
Escherichia coli MS 145-7
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749540.3.peg.2902
Escherichia coli MS 146-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749544.3.peg.1764
Escherichia coli MS 175-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749548.3.peg.4295
Escherichia coli MS 196-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|316407.3.peg.120
Escherichia coli W3110
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|316385.5.peg.103
Escherichia coli str. K-12 substr. DH10B
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|316385.7.peg.103
Escherichia coli str. K-12 substr. DH10B
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|511145.12.peg.125
Escherichia coli str. K-12 substr. MG1655
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|511145.6.peg.125
Escherichia coli str. K-12 substr. MG1655
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|595495.4.peg.1596
Escherichia coli KO11
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVK
I
EGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|566546.3.peg.1604
Escherichia coli W
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVK
I
EGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|566546.4.peg.119
Escherichia coli W
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVK
I
EGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|439855.10.peg.306
Escherichia coli SMS-3-5
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656419.3.peg.288
Escherichia coli M718
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
Q
QLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|413997.3.peg.128
Escherichia coli B str. REL606
MQRRDFLKYS
-
VALGVA
S
ALPLW
N
RAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|511693.5.peg.129
Escherichia coli BL21
MQRRDFLKYS
-
VALGVA
S
ALPLW
N
RAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|469008.4.peg.3616
Escherichia coli BL21(DE3)
MQRRDFLKYS
-
VALGVA
S
ALPLW
N
RAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749547.3.peg.3813
Escherichia coli MS 187-1
MQRRDFLKYS
-
VALGVA
S
ALPLW
N
RAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|562.373.peg.2139
Escherichia coli 1125A
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|562.372.peg.2989
Escherichia coli 1212A
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|444454.5.peg.4589
Escherichia coli O157:H7 str. EC4024
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|444449.5.peg.4040
Escherichia coli O157:H7 str. EC4042
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|444448.5.peg.2797
Escherichia coli O157:H7 str. EC4045
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|444453.5.peg.3400
Escherichia coli O157:H7 str. EC4076
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|444452.5.peg.5288
Escherichia coli O157:H7 str. EC4113
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|444450.8.peg.267
Escherichia coli O157:H7 str. EC4115
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|444451.5.peg.4313
Escherichia coli O157:H7 str. EC4196
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|444447.5.peg.2972
Escherichia coli O157:H7 str. EC4206
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|478004.5.peg.2499
Escherichia coli O157:H7 str. EC4401
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|478005.5.peg.4207
Escherichia coli O157:H7 str. EC4486
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|478007.5.peg.4067
Escherichia coli O157:H7 str. EC508
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|478008.5.peg.4318
Escherichia coli O157:H7 str. EC869
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|637388.3.peg.3964
Escherichia coli O157:H7 str. FRIK2000
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|570506.3.peg.1128
Escherichia coli O157:H7 str. FRIK966
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|544404.4.peg.129
Escherichia coli O157:H7 str. TW14359
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|749531.3.peg.4489
Escherichia coli MS 69-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
N
N
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656437.3.peg.188
Escherichia coli TA143
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
N
N
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|405955.13.peg.135
Escherichia coli APEC O1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|405955.9.peg.108
Escherichia coli APEC O1 (14-529/529)
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|714962.3.peg.132
Escherichia coli IHE3034
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585035.6.peg.136
Escherichia coli S88
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|869729.3.peg.5039
Escherichia coli UM146
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|364106.7.peg.265
Escherichia coli UTI89
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|364106.8.peg.264
Escherichia coli UTI89
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585057.4.peg.134
Escherichia coli IAI39
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
T
DGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
N
SLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585057.6.peg.133
Escherichia coli IAI39
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
T
DGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
N
SLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749537.3.peg.3221
Escherichia coli MS 115-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHM
E
H
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656417.3.peg.197
Escherichia coli M605
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|562.376.peg.1179
Escherichia coli WV_060327
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|562.371.peg.922
Escherichia coli 1044A
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
V
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|562.374.peg.2180
Escherichia coli 536A
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
V
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|83334.1.peg.214
Escherichia coli O157:H7
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
V
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|155864.1.peg.127
Escherichia coli O157:H7 EDL933
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
V
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|155864.8.peg.126
Escherichia coli O157:H7 EDL933
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
V
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|478006.5.peg.5021
Escherichia coli O157:H7 str. EC4501
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
V
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|386585.9.peg.225
Escherichia coli O157:H7 str. Sakai
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
V
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|502346.5.peg.1138
Escherichia coli O157:H7 str. TW14588
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
E
KTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
V
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
R
AYMAHCHLLEHEDTGMMLGFTV
fig|679205.4.peg.2288
Escherichia coli MS 124-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQ
H
G
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749533.3.peg.3148
Escherichia coli MS 84-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQ
H
G
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|358709.5.peg.3820
Escherichia coli 101-1
MQRRDFLKYS
-
VALGVA
S
ALPLW
N
RAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQF
L
ILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656379.3.peg.365
Escherichia coli FVEC1302
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
S
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
E
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656380.3.peg.301
Escherichia coli FVEC1412
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
S
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
E
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749549.3.peg.4694
Escherichia coli MS 198-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
S
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
E
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585056.7.peg.312
Escherichia coli UMN026
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
S
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
E
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749527.3.peg.3228
Escherichia coli MS 21-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIY
S
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASG
T
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656393.3.peg.740
Escherichia coli H299
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
N
N
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|753642.3.peg.3500
Escherichia coli NC101
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|340197.3.peg.2931
Escherichia coli F11
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|340197.5.peg.3063
Escherichia coli F11
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749550.3.peg.2648
Escherichia coli MS 200-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|685038.3.peg.127
Escherichia coli O83:H1 str. NRG 857C
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|525281.3.peg.1141
Escherichia coli 83972
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|655817.3.peg.147
Escherichia coli ABU 83972
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|199310.1.peg.146
Escherichia coli CFT073 (14-529/529)
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|199310.4.peg.145
Escherichia coli CFT073
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749546.3.peg.3757
Escherichia coli MS 185-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|749528.3.peg.4693
Escherichia coli MS 45-1
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656440.3.peg.5193
Escherichia coli TA206
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GN
V
NHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656444.3.peg.462
Escherichia coli TA280
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
S
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
S
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
F
PS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
I
LMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|431946.3.peg.137
Escherichia coli SE15
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLE
L
PGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GH
I
GH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|670897.3.peg.1942
Escherichia coli 2362-75
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SS
Q
P
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|216593.1.peg.3449
Escherichia coli E2348/69 (14-529/529)
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SS
Q
P
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|574521.7.peg.130
Escherichia coli O127:H6 str. E2348/69
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SS
Q
P
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|362663.8.peg.133
Escherichia coli 536
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NAR
L
LNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|362663.9.peg.133
Escherichia coli 536
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNGV
I
YPQHAAPRGW
---------
LRLRLLNG
C
NAR
L
LNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585397.7.peg.133
Escherichia coli ED1a
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHM
H
H
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|585397.9.peg.133
Escherichia coli ED1a
MQRRDFLKYS
-
VALGVA
S
ALPLWSRAVFAAERPTLP
I
PDLLTTDA
R
NRIQLTIGAGQSTF
G
-
GKTATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
S
GGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLLTNG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
N
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
T
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
T
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHM
H
H
S
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
T
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
D
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|409438.11.peg.240
Escherichia coli SE11
MQRRDFLKYS
-
VALGVA
S
ALPLWSR
T
VFAAERPTLP
I
PDLLT
S
D
V
R
G
RI
R
L
S
I
MQ
GQSTF
A
-
GK
S
ATTWG
Y
NGNLLGPAV
K
LQRG
K
AVT
V
DIYN
Q
LTEET
T
LHWHGLEVPGEVDGGPQGI
---
I
P
PGGKRSVTLNVDQ
P
AATCW
F
HPHQHGKTGRQVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQD
-----------
KK
F
N
ADGQIDYQLDVM
T
AA
V
G
W
FGD
----------------------------------
TLL
I
NG
AI
YPQHAAPRGW
---------
LRLRLLNG
C
NARSLNFATSD
N
RPLYVIASDGGLLPEPVKV
S
ELPV
L
MGERFEVLVEVND
NK
PFD
L
-------------
V
TLPVSQMGMAI
A
PFDKPHPV
M
-
RIQPIAISASGA
--------------------------
LPDT
-
L
SSLP
A
-------
LPS
--
LE
GL
M
-
VR
K
LQLSM
-
-
DPMLDMMGMQ
---------
MLMEKYGDQAM
A
GMDHSQMM
GHMGH
GNMNHMNH
G
GKFDFHHANKINGQA
F
DMNK
P
MFAAAKGQY
----
ERWVISGVGD
-
MMLHPFHIHGTQFRILSENGK
---------
PPA
A
HR
A
GWKDTVKVEGNV
S
EVLVKFNH
E
APKE
H
AYMAHCHLLEHEDTGMMLGFTV
fig|656444.3.peg.4263
Escherichia coli TA280 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
EDD
VSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749532.3.peg.653
Escherichia coli MS 78-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTV
S
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|1040638.4.peg.2361
Escherichia coli O104:H4 str. LB226692 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|6666666.5357.peg.2242
Escherichia coli TY-2482 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|216592.1.peg.3927
Escherichia coli 042 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|216592.3.peg.3436
Escherichia coli 042 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|562.371.peg.2315
Escherichia coli 1044A (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|562.373.peg.1792
Escherichia coli 1125A (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|562.372.peg.2511
Escherichia coli 1212A (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|362663.8.peg.3132
Escherichia coli 536 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|362663.9.peg.3143
Escherichia coli 536 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|562.374.peg.4587
Escherichia coli 536A (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585055.6.peg.3469
Escherichia coli 55989 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585055.8.peg.3472
Escherichia coli 55989 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|525281.3.peg.3537
Escherichia coli 83972 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|655817.3.peg.3580
Escherichia coli ABU 83972 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|405955.13.peg.3436
Escherichia coli APEC O1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|405955.9.peg.2851
Escherichia coli APEC O1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|550672.3.peg.3264
Escherichia coli B088 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|344601.3.peg.457
Escherichia coli B171 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|344601.5.peg.456
Escherichia coli B171 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|550677.3.peg.3577
Escherichia coli B354 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|340184.3.peg.2622
Escherichia coli B7A (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|340184.6.peg.2744
Escherichia coli B7A (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|340186.3.peg.516
Escherichia coli E110019 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|340186.5.peg.539
Escherichia coli E110019 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|340185.3.peg.385
Escherichia coli E22 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|340185.4.peg.424
Escherichia coli E22 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|331111.12.peg.3756
Escherichia coli E24377A (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|331111.3.peg.1173
Escherichia coli E24377A (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|562.375.peg.652
Escherichia coli EC4100B (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585397.7.peg.3671
Escherichia coli ED1a (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585397.9.peg.3669
Escherichia coli ED1a (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|340197.3.peg.2640
Escherichia coli F11 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|340197.5.peg.2766
Escherichia coli F11 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656393.3.peg.4055
Escherichia coli H299 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656408.3.peg.3419
Escherichia coli H591 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585034.4.peg.3103
Escherichia coli IAI1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585034.5.peg.3101
Escherichia coli IAI1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|714962.3.peg.3476
Escherichia coli IHE3034 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|595495.4.peg.913
Escherichia coli KO11 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656417.3.peg.3844
Escherichia coli M605 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656419.3.peg.3991
Escherichia coli M718 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|679207.4.peg.3730
Escherichia coli MS 107-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|679206.4.peg.1373
Escherichia coli MS 119-7 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|679205.4.peg.494
Escherichia coli MS 124-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|679204.3.peg.739
Escherichia coli MS 145-7 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749545.3.peg.1779
Escherichia coli MS 182-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749546.3.peg.1918
Escherichia coli MS 185-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749550.3.peg.4067
Escherichia coli MS 200-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749527.3.peg.2297
Escherichia coli MS 21-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749528.3.peg.508
Escherichia coli MS 45-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749531.3.peg.78
Escherichia coli MS 69-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749533.3.peg.3626
Escherichia coli MS 84-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|753642.3.peg.4957
Escherichia coli NC101 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585395.4.peg.3863
Escherichia coli O103:H2 str. 12009 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585396.4.peg.3978
Escherichia coli O111:H- str. 11128 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|83334.1.peg.3877
Escherichia coli O157:H7 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|155864.1.peg.3902
Escherichia coli O157:H7 EDL933 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|155864.8.peg.3824
Escherichia coli O157:H7 EDL933 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|444454.5.peg.2950
Escherichia coli O157:H7 str. EC4024 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|444449.5.peg.2408
Escherichia coli O157:H7 str. EC4042 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|444448.5.peg.1161
Escherichia coli O157:H7 str. EC4045 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|444453.5.peg.3289
Escherichia coli O157:H7 str. EC4076 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|444452.5.peg.615
Escherichia coli O157:H7 str. EC4113 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|444450.8.peg.4245
Escherichia coli O157:H7 str. EC4115 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|444451.5.peg.1035
Escherichia coli O157:H7 str. EC4196 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|444447.5.peg.1318
Escherichia coli O157:H7 str. EC4206 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|478004.5.peg.846
Escherichia coli O157:H7 str. EC4401 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|478006.5.peg.735
Escherichia coli O157:H7 str. EC4501 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|478007.5.peg.468
Escherichia coli O157:H7 str. EC508 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|478008.5.peg.1218
Escherichia coli O157:H7 str. EC869 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|637388.3.peg.5110
Escherichia coli O157:H7 str. FRIK2000 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|570506.3.peg.714
Escherichia coli O157:H7 str. FRIK966 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|386585.9.peg.4069
Escherichia coli O157:H7 str. Sakai (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|544404.4.peg.4054
Escherichia coli O157:H7 str. TW14359 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|502346.5.peg.3358
Escherichia coli O157:H7 str. TW14588 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|573235.3.peg.4218
Escherichia coli O26:H11 str. 11368 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|701177.3.peg.3756
Escherichia coli O55:H7 str. CB9615 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|685038.3.peg.3061
Escherichia coli O83:H1 str. NRG 857C (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585035.6.peg.3337
Escherichia coli S88 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|409438.11.peg.3470
Escherichia coli SE11 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|431946.3.peg.2981
Escherichia coli SE15 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|439855.10.peg.3465
Escherichia coli SMS-3-5 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656437.3.peg.3427
Escherichia coli TA143 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656440.3.peg.3244
Escherichia coli TA206 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656443.3.peg.4029
Escherichia coli TA271 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|869729.3.peg.267
Escherichia coli UM146 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|364106.7.peg.3411
Escherichia coli UTI89 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|364106.8.peg.3412
Escherichia coli UTI89 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|566546.3.peg.590
Escherichia coli W (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|566546.4.peg.3250
Escherichia coli W (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|562.376.peg.3875
Escherichia coli WV_060327 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|550676.3.peg.3173
Escherichia coli B185 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
I
WLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TV
H
ADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|199310.1.peg.3671
Escherichia coli CFT073 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
L
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|199310.4.peg.3537
Escherichia coli CFT073 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
L
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|670897.3.peg.4304
Escherichia coli 2362-75 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
G
N
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|216593.1.peg.5130
Escherichia coli E2348/69 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
G
N
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|358709.5.peg.1223
Escherichia coli 101-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|670888.3.peg.3240
Escherichia coli 1827-70 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|344610.3.peg.829
Escherichia coli 53638 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|344610.7.peg.3232
Escherichia coli 53638 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|481805.3.peg.722
Escherichia coli ATCC 8739 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|481805.6.peg.719
Escherichia coli ATCC 8739 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|413997.3.peg.3027
Escherichia coli B str. REL606 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|511693.5.peg.3036
Escherichia coli BL21 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|469008.4.peg.738
Escherichia coli BL21(DE3) (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|595496.3.peg.2995
Escherichia coli BW2952 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|316401.4.peg.3735
Escherichia coli ETEC H10407 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656379.3.peg.3754
Escherichia coli FVEC1302 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656380.3.peg.3675
Escherichia coli FVEC1412 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|656414.3.peg.3479
Escherichia coli H736 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|331112.3.peg.2987
Escherichia coli HS (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|331112.6.peg.3122
Escherichia coli HS (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585057.4.peg.3632
Escherichia coli IAI39 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585057.6.peg.3640
Escherichia coli IAI39 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|83333.1.peg.2964
Escherichia coli K12 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749537.3.peg.697
Escherichia coli MS 115-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749540.3.peg.283
Escherichia coli MS 146-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749547.3.peg.3959
Escherichia coli MS 187-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749549.3.peg.5088
Escherichia coli MS 198-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|585056.7.peg.3676
Escherichia coli UMN026 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|316407.3.peg.2903
Escherichia coli W3110 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|316385.7.peg.3214
Escherichia coli str. K-12 substr. DH10B (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|511145.12.peg.3111
Escherichia coli str. K-12 substr. MG1655 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|511145.6.peg.3096
Escherichia coli str. K-12 substr. MG1655 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749548.3.peg.734
Escherichia coli MS 196-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VP
S
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQM
SDGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
LP
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749538.3.peg.3891
Escherichia coli MS 116-1 (3-459/470)
LS
RR
Q
F
IQA
S
GI
AL
-
C
AGA
V
PL
--
K
A
SA
A
GQQQP
LPVP
P
LL
ESRR
G
QPLFM
T
VQRAHWS
FT
P
G
TR
A
SV
WGING
RY
LGP
TI
R
VWK
GD
D
V
K
L
IYS
NRLTE
NV
S
MTVA
GL
Q
VPG
PLM
GGP
ARM
---
M
SP
NADWAPV
L
PIR
QNAAT
L
WYH
ANTPNR
T
AQ
QV
YN
GLAG
MWLV
ED
EVSKS
L
PI
P
NHY
G
V
DD
F
PVI
I
QD
-----------
K
R
LD
NF
G
TPE
Y
N
---
E
P
GS
GGF
V
GD
----------------------------------
TLL
V
NGVQ
S
P
YVEVS
RGW
---------
V
RLRLLN
A
SN
S
R
RYQLQMN
DGRPL
H
VI
SG
D
Q
G
F
LP
A
PV
S
V
K
Q
L
SL
A
P
GER
R
E
I
LV
DMSN
GD
EVS
I
-------------
T
CGEAASIVDR
IR
G
F
FE
P
SSI
L
VSTLVLTLRPT
G
L
--------------------------
LP
LV
-
T
D
SLPM
RL
-----
L
S
TEI
MAG
S
P
-
I
RS
RDI
S
LG
DDP
--------
---------
-----------
-
--------
-----
--------
-
---------
G
INGQ
L
WD
V
N
R
I
DVT
A
QQ
G
TW
----
ERW
TVRADEP
-
---
QA
FHI
E
G
VM
F
Q
I
RNV
NG
A
---------
M
P
F
P
ED
RGWKDTV
W
V
D
G
Q
V
-
E
L
LV
Y
F
GQ
P
SWAH
F
PFYFNSQT
LE
MA
D
R
G
fig|749537.3.peg.106
Escherichia coli MS 115-1 (7-597/605)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSL
S
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AR
E
------
P
E
P
F
AY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMARMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
fig|585055.6.peg.3938
Escherichia coli 55989 (7-597/605)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSL
S
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AG
E
------
P
E
P
F
TY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMAGMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
fig|585055.8.peg.3940
Escherichia coli 55989 (7-597/605)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSL
S
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AG
E
------
P
E
P
F
TY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMAGMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
fig|405955.13.peg.5322
Escherichia coli APEC O1 (7-597/605)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSL
S
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AG
E
------
P
E
P
F
TY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMAGMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
fig|405955.9.peg.4438
Escherichia coli APEC O1 (9-599/607)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSL
S
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AG
E
------
P
E
P
F
TY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMAGMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
fig|481805.6.peg.3651
Escherichia coli ATCC 8739 (7-597/605)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSL
S
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AR
E
------
P
E
P
F
AY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMAGMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
fig|481805.3.peg.3673
Escherichia coli ATCC 8739 (9-599/607)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSL
S
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AR
E
------
P
E
P
F
AY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMAGMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
fig|1040638.4.peg.2089
Escherichia coli O104:H4 str. LB226692 (7-597/605)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSLX
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AG
E
------
P
E
P
F
TY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMAGMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
fig|6666666.5357.peg.3588
Escherichia coli TY-2482 (7-597/605)
RR
T
FLK
GLT
LS
-
GVAG
S
L
G
V
WS
--FN
A
RSSLX
LPV
AAS
L
Q---
G
T
Q
FD
LTIG
ETAVNI
T
-
G
SERQAKT
ING
G
L
P
GP
V
L
R
W
K
E
GD
T
I
TL
K
V
K
NRL
N
E
Q
TS
I
HWHG
I
I
L
P
AN
M
DG
V
P
GLSFMG
I
E
P
DDTYVY
T
FK
V
K
QN
-G
T
Y
WYH
S
H
----
S
G
L
Q
EQE
G
V
Y
G
A
II
I
D
AG
E
------
P
E
P
F
TY
D
REH
V
VM
LSDWTDENPHSLL
KKL
KK--
Q
S
DY
YNFNK
P
TV
G
S
FF
R
D
VNTRGLSATIADRKMWAEMKMNPTDLADVSGYTY
T
Y
L
M
NG
-
Q
A
P
L----
K
N
W
TGLFRPGEK
I
RLR
F
I
NGS
AMTYF
D
IRIP-
G
LK
M
T
V
V
A
A
DG
QY
V
-N
PV
T
V
D
E
FR
I
A
V
A
E
T
YD
V
I
VE
-PQ
G
E
A
Y
T
I
FAQSMDRTGYARG
T
LATREG
L
SA
A
V
P
P
L
D
-
P
R
P
L
L
-
T
ME
D
M
G
M
GGM
G
HDMAGMDHSQMGGMDNSGEMMSMDGAD
LPD
S
G
TSS
A
PM
DHSSMAG
M
DHSR
MAG
M
P
G
MQ
S
HPA
S
ET
D
N
P
LV
DM
QA
M
SVSPKLNDPG
I
G
L
RNN
G
R
K
V
L
TYA
D
LKSRF
-----
E
D
P
D
GREP
G
RTI
E
L
H
LTGH
M
EKF
AW
SF
N
G
I
K
F
S
D
A
APVLLKYG
ER
LR
I
T
L
I
N
D
T
MM
T
HP
I
H
L
HG
MWSD
L
ED
ENG
NFMVRKHTIDV
P
P
G
T
K
R
S
YR
V
T
ADAL
G
R-
-
--------
-
----
-
-
W
AY
HCHLL
Y
H
M
E
M
GM
Consen1
Primary consensus
mqRRdFlkyS
-
vALgvAgAlPLwsrAvfAaerptLPvPdLLttdagnriqlTigagqstFt
-
gktAttWGiNGnlLGPavrlqrGdaVtldiyNrLTEetslhwhGLeVPGevdGGPqgi
---
isPggkrsvtLnvdQnAATcWyHphqhgkTgrQVamGLAGlvviEDdeilkLmlPkqwGiDDvPVIvQD
-----------
KkldadGqidYqldvmpaagGffGD
----------------------------------
TLLtNGvqyPqhaapRGW
---------
lRLRLLNgsNaRslnfatsDgRPLyVIasDgGlLPePVkV
eLpvamGERfEvLVevndgdpfdi
-------------
ttlpvsqmgmaIrpFdkPhpvl
-
riqpiaisasGa
--------------------------
LPdt
-
tsSLPm
-------
LPs
--
maGlp
-
vRslqlSm
-
dDPmldmmgmq
---------
mlmekygdqam
-
gmdhsqmm
-----
gnmnhmnh
gkfdfhhankINGQawDmNkimfaAakGqy
----
ERWvisgvgd
-
mmlhpFHIhGtqFrIlseNGk
---------
pPa
hrrGWKDTVkVeGnV
-
EvLVkFnh
apke
aymahchlLEheDtGMMLGFTV
Consen2
Secondary consensus
ls
q
iqa
gi
-
c
s
v
--
k
sa
gqqqp
i
p
esrrrqplfm
vqrahws
gpetr
sv
y
ry
tikvwk
kd
kviys
q
nvtmtva
q
plm
arm
mp
nadwapv
pir
p
l
f
antpnr
aq
yn
mwlv
evsks
pi
nhy
v
f
i
rf
nf
tpe
n
---
etgsv
wv
v
ais
yvevs
v
ac
s
ryqlqmn
n
h
sg
q
f
a
s
q
sllp
r
i
dmsnnkevsl
vcgeaasivdr
ag
fe
ssimvstlvltlrpt
l
lv
ld
arl
teile
st
i
krdi
lg
-
--------
-----------
a
--------
ghmgh
--------
---------
g
lf
v
rpdvt
qq
tw
tvradep
---
qa
e
vm
q
rnv
a
m
f
eda
w
d
q
s
l
y
gq
swah
pfyfnsqt
ma
r
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character