fig|1040638.4.peg.637
Escherichia coli O104:H4 str. LB226692
M
ATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADY
K
LLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|595495.4.peg.327
Escherichia coli KO11
M
ATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|340186.3.peg.2493
Escherichia coli E110019 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|340186.5.peg.2574
Escherichia coli E110019 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|656393.3.peg.3140
Escherichia coli H299 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|340184.3.peg.493
Escherichia coli B7A (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|340184.6.peg.513
Escherichia coli B7A (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|340185.3.peg.1269
Escherichia coli E22 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|340185.4.peg.1339
Escherichia coli E22 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|749545.3.peg.2543
Escherichia coli MS 182-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|749532.3.peg.3760
Escherichia coli MS 78-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|585395.4.peg.2710
Escherichia coli O103:H2 str. 12009 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|550672.3.peg.1831
Escherichia coli B088 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|409438.11.peg.2536
Escherichia coli SE11 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|566546.3.peg.4636
Escherichia coli W (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|566546.4.peg.2296
Escherichia coli W (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|344601.3.peg.3203
Escherichia coli B171 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCL
L
RRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|344601.5.peg.3347
Escherichia coli B171 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCL
L
RRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|679205.4.peg.3629
Escherichia coli MS 124-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEA
K
RNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|749533.3.peg.398
Escherichia coli MS 84-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEA
K
RNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|585034.4.peg.2163
Escherichia coli IAI1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
T
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|585034.5.peg.2158
Escherichia coli IAI1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
T
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|749537.3.peg.854
Escherichia coli MS 115-1 (345-1264/1264)
ATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCL
S
RRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|749540.3.peg.758
Escherichia coli MS 146-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAA
M
AAL
T
ELL
A
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGH
K
PLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|562.375.peg.2374
Escherichia coli EC4100B (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSY
R
TADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|573235.3.peg.3090
Escherichia coli O26:H11 str. 11368 (32-952/952)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTE
T
ERNASELTRW
A
GRKCPSGRVMGLANKGW
V
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|656443.3.peg.2814
Escherichia coli TA271 (32-952/952)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLP
R
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|656408.3.peg.2357
Escherichia coli H591 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLP
R
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|679207.4.peg.3830
Escherichia coli MS 107-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLP
R
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|679206.4.peg.1556
Escherichia coli MS 119-7 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLP
R
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|679204.3.peg.144
Escherichia coli MS 145-7 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLP
R
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|481805.3.peg.1642
Escherichia coli ATCC 8739 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNS
S
STADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|481805.6.peg.1637
Escherichia coli ATCC 8739 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNS
S
STADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|749538.3.peg.1658
Escherichia coli MS 116-1 (32-952/952)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNS
S
STADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWG
R
NSTQEAQFSV
I
DAITASELINDIEALFE
fig|331112.3.peg.2107
Escherichia coli HS (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNS
S
STADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFA
A
YELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|331112.6.peg.2203
Escherichia coli HS (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLRE
N
ARSWL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
V
C
S
A
DSLA
G
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNS
S
STADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFA
A
YELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|656419.3.peg.2869
Escherichia coli M718 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHP
V
ALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQV
L
PWLSTPAVAVLKSCQ
-
QQLTQPSNHA
C
ADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SQH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
IS
EF
KV
FHSP
TGHY
W
H
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
S
I
T
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DAR
T
WL
LK
YPEHA
L
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSAL
Q
HLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAW
L
TAGAPSKESWAF
I
ALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDET
R
ANDAVNRYKLLKKDARTIAAQQVARLESAMCLRRRWS
L
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
G
T
PHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|511693.5.peg.2141
Escherichia coli BL21
M
ATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPD
N
ALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFT
T
LGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
R
A
D
E
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHP
M
VRHLTRRLIWGVYS
A
D
N
Q
L
Q
ACFRVAEDNSYSTADDDLFTLPEGDIS
L
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|413997.3.peg.2133
Escherichia coli B str. REL606 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPD
N
ALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFT
T
LGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
R
A
D
E
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHP
M
VRHLTRRLIWGVYS
A
D
N
Q
L
Q
ACFRVAEDNSYSTADDDLFTLPEGDIS
L
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|469008.4.peg.1592
Escherichia coli BL21(DE3) (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPD
N
ALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFT
T
LGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
R
A
D
E
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHP
M
VRHLTRRLIWGVYS
A
D
N
Q
L
Q
ACFRVAEDNSYSTADDDLFTLPEGDIS
L
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|749547.3.peg.3083
Escherichia coli MS 187-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPD
N
ALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFT
T
LGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
R
A
D
E
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHP
M
VRHLTRRLIWGVYS
A
D
N
Q
L
Q
ACFRVAEDNSYSTADDDLFTLPEGDIS
L
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|331111.12.peg.2685
Escherichia coli E24377A (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHA
C
ADLLPA
I
V
VSPPWL
S
KKKK
AT
IPVL
E
LAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STA
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AD
EAQDNARAALRML
S
ENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPS
R
ESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
Q
LL
T
CFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|331111.3.peg.146
Escherichia coli E24377A (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHA
C
ADLLPA
I
V
VSPPWL
S
KKKK
AT
IPVL
E
LAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STA
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AD
EAQDNARAALRML
S
ENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPS
R
ESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
Q
LL
T
CFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|358709.5.peg.558
Escherichia coli 101-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPD
N
ALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFT
T
LGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
W
A
D
E
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHP
M
VRHLTRRLIWGVYS
A
D
N
Q
L
Q
ACFRVAEDNSYSTADDDLFTLPEGDIS
L
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|749548.3.peg.2969
Escherichia coli MS 196-1 (344-1261/1261)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
Q
TKRCHDRMTKA
I
AAFPHAA
M
AAL
T
ELL
G
QKEENSWRIMLMTMLISQP
A
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPAV
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
CY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STT
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
LS
EF
KV
FHSP
TGHY
W
Q
L
GI
L
TT
LP
L
E
KA
V
K
A
WN
-
A
L
TLSP
HT
DTEYS
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPD
N
ALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFT
T
LGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
R
A
D
E
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHP
M
VRHLTRRLIWGVYS
A
D
N
Q
L
Q
ACFRVAEDNSYSTADDDLFTLPEGDIS
L
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
---
GWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
E
R
YGWGSNSTQEAQFSV
I
DAITASELINDIEALFE
fig|656417.3.peg.2778
Escherichia coli M605 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SS
IPVLDLAPL
G
IE
P
I
SY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STA
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
IS
EF
KV
FHSP
TGHY
W
H
L
GI
L
TT
LP
L
E
KA
V
R
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FG
F
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DAR
I
WL
LK
YPEHA
I
TGLLP
A
ALGK
TD
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
AN
N
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGR
I
MGLANKGW
I
K
G
T
PQD
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|749531.3.peg.2033
Escherichia coli MS 69-1 (344-1264/1264)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
G
IE
P
I
SY
LT
EE
IS
NQLL-
--
----
-
--
---
-
--
--
----------
A
K
YI
W
Y
SKH
I
TVS
HE
--
E
STA
N
L
L
AR
M
GF
Q
R
R
-
--IAGT
-
-
YI
K
APE
AV
V
E
AWL
N
ED
Y
S
TL
IS
EF
KV
FHSP
TGHY
W
H
L
GI
L
TT
LP
L
E
KA
V
R
A
WN
-
A
L
TLSP
HT
DTEYA
M
LH
FGL
K
GLPG
L
V
N
SL
A
RYPQE
ALPITN
YFAASELAPAVAR
A
FNKLKTLR
Q
DAR
T
WL
LK
YPEHA
I
TGLL
S
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNA
M
LALDPLDNHPTKIPTLP
A
FYQPS
I
WTRP
V
LKANAQSLPDSAL
Q
HLGEMLRF
H
QEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAI
S
SDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAP
A
LGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
L
Q
ACFRVAEDNSYSTADDDLFTL
Q
EGDIS
V
GIPHVLEISPTDA
V
AFGQLFADYELLPPFRQL
E
RNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPA
D
LSA
EQ
V
LSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLD
D
ITASELINDIEALFE
fig|685038.3.peg.2169
Escherichia coli O83:H1 str. NRG 857C (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|585397.7.peg.2484
Escherichia coli ED1a (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
G
AW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
CFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|585397.9.peg.2482
Escherichia coli ED1a (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
G
AW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
CFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|753642.3.peg.2981
Escherichia coli NC101 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DARSWL
LK
YP
Q
HA
I
TGLLP
A
ALGK
TG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
L
Q
AN
N
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|362663.8.peg.2166
Escherichia coli 536 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
C
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|362663.9.peg.2171
Escherichia coli 536 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
C
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|340197.3.peg.173
Escherichia coli F11 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
C
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|340197.5.peg.178
Escherichia coli F11 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
C
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|749550.3.peg.1549
Escherichia coli MS 200-1 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KP
S
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
C
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|656440.3.peg.2096
Escherichia coli TA206 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
T
ALPR
-------
-
--------
FAPY
-
A
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLIS
H
P
T
Q
AEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
D
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|405955.13.peg.2329
Escherichia coli APEC O1 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLT
H
RLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|405955.9.peg.1898
Escherichia coli APEC O1 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLT
H
RLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|714962.3.peg.2388
Escherichia coli IHE3034 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLT
H
RLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|585035.6.peg.2237
Escherichia coli S88 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLT
H
RLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|869729.3.peg.1323
Escherichia coli UM146 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLT
H
RLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|364106.7.peg.2415
Escherichia coli UTI89 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLT
H
RLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|364106.8.peg.2418
Escherichia coli UTI89 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLT
H
RLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|525281.3.peg.3914
Escherichia coli 83972 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYG
R
GSNSTQEAQFSVLDAITASELINDIEALFE
fig|655817.3.peg.2555
Escherichia coli ABU 83972 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYG
R
GSNSTQEAQFSVLDAITASELINDIEALFE
fig|749528.3.peg.1567
Escherichia coli MS 45-1 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYG
R
GSNSTQEAQFSVLDAITASELINDIEALFE
fig|749546.3.peg.2890
Escherichia coli MS 185-1 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRAT
I
GLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYG
R
GSNSTQEAQFSVLDAITASELINDIEALFE
fig|199310.1.peg.2574
Escherichia coli CFT073 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEG
N
IS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYG
R
GSNSTQEAQFSVLDAITASELINDIEALFE
fig|199310.4.peg.2486
Escherichia coli CFT073 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
N
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YPEHA
I
TGLLP
A
ALGK
AS
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEG
N
IS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
T
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
A
GWIGWMI
N
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYG
R
GSNSTQEAQFSVLDAITASELINDIEALFE
fig|439855.10.peg.1085
Escherichia coli SMS-3-5 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQL
K
QPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
D
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DAR
T
WL
LK
YP
Q
HA
I
TGLLP
A
ALGK
AG
E
S
QDNARAALRML
I
ENG
Y
PS
LLQEIA
Q
RYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSAL
F
HLGEMLRFPQE
D
T
LYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAG
V
PSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDET
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
E
LLACFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
G
GWIGWMIK
P
LG
R
WSLIMEI
N
E
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|749527.3.peg.1470
Escherichia coli MS 21-1 (344-1262/1262)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVL
S
HINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQL
K
QPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
D
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DAR
T
WL
LK
YP
Q
HA
I
TGLLP
S
ALGK
AG
EAQDNARAALRML
I
EN
D
HQPLLQEIA
Q
RYN
L
PEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSAL
F
HLGEMLRFPQE
D
T
LYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAG
V
PSKESWAFTALGVLGNDDTA
C
KLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
E
LLACFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
G
T
PQD
G
GWIGWMIK
P
LG
R
WSLIMEI
N
E
---------
GFA
A
GMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|431946.3.peg.2084
Escherichia coli SE15 (332-1253/1253)
A
A
DE
Q
V
LA
E
L
G
KY
--
YPG
Q
P
GQ
V
FDD
---
YY
GGKI
W
C
AT
I
L
K
EQGV
G
AL
A
R
-------
-
--------
FAPY
-
A
A
G
D
T
C
G
E
VL
M
HINHP
Q
ALTLLI
HA
S
E
Q
G
KRCHDRMTK
T
F
VR
FPHAALAALAELL
A
QK
DQ
K
R
WR
M
MLMTMLISQP
T
LAE
R
VIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASAD
M
LPAV
L
VSPPWL
S
KKKK
SV
M
PVLDL
T
PL
P
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
TR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
A
S
D
AV
V
E
L
W
Q
R
G
D
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
LAGL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DAR
I
WL
LK
YPEHA
I
TGLLP
A
ALGK
TG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
LKANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
Q
AN
N
AVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
LLACFRVAEDNSYSTADDDLFTLPEGDIS
I
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
M
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|216592.1.peg.2904
Escherichia coli 042 (333-1253/1253)
TD
A
Q
V
LA
D
LEKY
--
YPG
Q
P
GQ
V
FDD
---
YY
GGNI
W
C
AT
A
LQEQGV
T
AL
A
R
-------
-
--------
FA
H
Y
-
A
TG
D
T
C
G
E
VL
M
HINHP
Q
ALTLLI
HA
S
E
Q
G
KRCHDRMTKA
F
VR
FPHAALAALAELL
A
QK
D
E
K
R
WR
M
MLMTMLISQP
I
LAEQVIPWLSTPAVAVLKSCQ
-
QQL
K
QPSNHASAD
M
LPA
I
L
VSPPWL
S
KKKK
SV
M
PVLDL
T
PL
P
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
TR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
A
S
D
AV
V
E
L
W
Q
R
G
D
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
LAGL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALD
A
LDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALL
R
LGEMLRFPQEEALYPGL
L
QVK
A
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAI
S
SDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
R
A
D
E
AVNRYKLLKKD
T
RT
V
AAQQV
T
RLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
K
LLACFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTE
T
ERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
M
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|216592.3.peg.2451
Escherichia coli 042 (333-1253/1253)
TD
A
Q
V
LA
D
LEKY
--
YPG
Q
P
GQ
V
FDD
---
YY
GGNI
W
C
AT
A
LQEQGV
T
AL
A
R
-------
-
--------
FA
H
Y
-
A
TG
D
T
C
G
E
VL
M
HINHP
Q
ALTLLI
HA
S
E
Q
G
KRCHDRMTKA
F
VR
FPHAALAALAELL
A
QK
D
E
K
R
WR
M
MLMTMLISQP
I
LAEQVIPWLSTPAVAVLKSCQ
-
QQL
K
QPSNHASAD
M
LPA
I
L
VSPPWL
S
KKKK
SV
M
PVLDL
T
PL
P
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
TR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
A
S
D
AV
V
E
L
W
Q
R
G
D
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
LAGL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALD
A
LDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALL
R
LGEMLRFPQEEALYPGL
L
QVK
A
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAI
S
SDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRDASGSRLKDLPKPNKSDDE
S
R
A
D
E
AVNRYKLLKKD
T
RT
V
AAQQV
T
RLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
K
LLACFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTE
T
ERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
M
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGSNSTQEAQFSVLDAITASELINDIEALFE
fig|550677.3.peg.1660
Escherichia coli B354 (333-1249/1249)
E
S
S
L
L
K
A
LE
S
Y
CAY
-
---
-
D
L
F
N
D
---
YY
HGCI
W
N
V
TVLQEQG
I
A
G
I
A
R
-------
-
--------
F
T
PY
-
A
Y
A
D
L
C
GS
I
L
AE
INHP
Q
AL
M
LLIRV
S
G
K
TKRCH
E
RMTKA
C
AAFPH
T
ALAALAELL
A
QKEENSWRIMLMTMLISQP
A
LAEQV
T
PWLSTPAVAVLKSCQ
-
QQLTQPSNHASAD
M
LPAV
L
VSPPWL
S
KKKK
SV
M
PVLDL
T
PL
P
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
TR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
A
S
D
A
A
I
D
L
W
Q
R
GH
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
LAGL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DAR
T
WL
LK
YPEHA
I
TGLLP
S
ALGK
SG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLW
N
RP
V
LKANAQSLPDSAL
Q
HLGEMLRF
H
QEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAW
LA
AGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
E
LLACFRVAEDNSYSTADDDLFTLPE
D
DIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTE
T
ERNASELTRW
A
GRKCPSGRVMGLANKGW
I
RGEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
I
LSKLWLWEGK
A
ESY
D
W
R
SNSTQEAQ
L
SVLDAITASELINDIEALFE
fig|656444.3.peg.3094
Escherichia coli TA280 (333-1249/1249)
E
S
S
L
L
K
A
LE
S
Y
CAY
-
---
-
D
L
F
N
D
---
YY
HGCI
W
N
V
TVLQEQG
I
A
G
I
A
R
-------
-
--------
FAPY
-
A
Y
A
D
L
C
GS
I
L
AE
INHP
Q
AL
M
LLIRV
S
G
K
TKRCH
E
RMTKA
C
AAFPH
T
ALAALAELL
A
QKEENSWRIMLMTMLISQP
A
LAEQV
T
PWLSTPAVAVLK
A
C
L
-
QQLTQPSNHA
C
ADLLPAV
L
VSPPWL
S
KKKK
SV
IPVLDL
T
PL
S
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
AR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
AP
D
A
A
I
D
L
W
Q
R
GH
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
L
T
GL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DAR
T
WL
LK
YPEHA
I
TGLLP
S
ALGK
SG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALDPLDNHPTKIPTLP
A
FYQPSLW
N
RP
V
LKA
S
AQSLPDSAL
Q
HLGEMLRF
H
QEEALYPGL
L
QVKD
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAI
S
SDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAP
A
LGLDDNGSL
L
LDFGPRQFTVSFDETLKPFVRD
V
SGSRLKDLPKPNKSDDE
S
Q
ANDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
A
EN
Q
L
Q
ACFRVAEDNSYSTADDDLFTL
Q
EGDIS
I
GIPHVLEISPTDA
V
AFGQLFADYELLPPFRQL
E
RNSYALTEAERNASELTRW
A
GRKCPSGRVMGLANKGW
I
K
GEPQD
G
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPA
D
LSA
EQ
V
LSKLWLWE
A
K
A
ESYGWGSNSTQEAQFSVLD
D
ITASELINDIEALFE
fig|656380.3.peg.2204
Escherichia coli FVEC1412 (317-1237/1237)
TD
A
Q
V
LA
D
LEKY
--
YPG
Q
P
GQ
V
FDD
---
YY
GGNI
W
C
AT
A
LQEQG
A
T
AL
A
R
-------
-
--------
FA
H
Y
-
A
TG
D
T
C
G
E
VL
M
HINHP
Q
ALTLLI
HA
S
E
Q
G
KRCHDRMTKA
F
VR
FPHAALAALAELL
A
QK
D
E
K
R
WR
M
MLMTMLISQP
I
LAEQVIPWLSTPAVAVLKSCQ
-
QQL
K
QPSNHASAD
M
LPA
I
L
VSPPWL
S
KKKK
SV
M
PVLDL
T
PL
P
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
TR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
A
S
D
AV
V
E
L
W
Q
R
G
D
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
LAGL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALD
A
LDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALL
R
LGEMLRFPQEEALYPGL
L
QVK
A
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAI
S
SDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPF
M
RDASGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
E
LLACFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTE
T
ERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
P
L
D
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGS
H
STQEAQFSVLD
D
ITASELINDIEALFE
fig|749549.3.peg.4981
Escherichia coli MS 198-1 (333-1253/1253)
TD
A
Q
V
LA
D
LEKY
--
YPG
Q
P
GQ
V
FDD
---
YY
GGNI
W
C
AT
A
LQEQG
A
T
AL
A
R
-------
-
--------
FA
H
Y
-
A
TG
D
T
C
G
E
VL
M
HINHP
Q
ALTLLI
HA
S
E
Q
G
KRCHDRMTKA
F
VR
FPHAALAALAELL
A
QK
D
E
K
R
WR
M
MLMTMLISQP
I
LAEQVIPWLSTPAVAVLKSCQ
-
QQL
K
QPSNHASAD
M
LPA
I
L
VSPPWL
S
KKKK
SV
M
PVLDL
T
PL
P
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
TR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
A
S
D
AV
V
E
L
W
Q
R
G
D
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
LAGL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALD
A
LDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALL
R
LGEMLRFPQEEALYPGL
L
QVK
A
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAI
S
SDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPF
M
RDASGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
E
LLACFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTE
T
ERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
P
L
D
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGS
H
STQEAQFSVLD
D
ITASELINDIEALFE
fig|585056.7.peg.2629
Escherichia coli UMN026 (333-1253/1253)
TD
A
Q
V
LA
D
LEKY
--
YPG
Q
P
GQ
V
FDD
---
YY
GGNI
W
C
AT
A
LQEQG
A
T
AL
A
R
-------
-
--------
FA
H
Y
-
A
TG
D
T
C
G
E
VL
M
HINHP
Q
ALTLLI
HA
S
E
Q
G
KRCHDRMTKA
F
VR
FPHAALAALAELL
A
QK
D
E
K
R
WR
M
MLMTMLISQP
I
LAEQVIPWLSTPAVAVLKSCQ
-
QQL
K
QPSNHASAD
M
LPA
I
L
VSPPWL
S
KKKK
SV
M
PVLDL
T
PL
P
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
TR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
A
S
D
AV
V
E
L
W
Q
R
G
D
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
LAGL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALD
A
LDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALL
R
LGEMLRFPQEEALYPGL
L
QVK
A
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAI
S
SDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDETLKPF
M
RDASGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
E
LLACFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTE
T
ERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
P
L
D
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGS
H
STQEAQFSVLD
D
ITASELINDIEALFE
fig|656379.3.peg.2646
Escherichia coli FVEC1302 (333-1253/1253)
TD
A
Q
V
LA
D
LEKY
--
YPG
Q
P
GQ
V
FDD
---
YY
GGNI
W
C
AT
A
LQEQG
A
T
AL
A
R
-------
-
--------
FA
H
Y
-
A
TG
D
T
C
G
E
VL
M
HINHP
Q
ALTLLI
HA
S
E
Q
G
KRCHDRMTKA
F
VR
FPHAALAALAELL
A
QK
D
E
K
R
WR
M
MLMTMLISQP
I
LAEQVIPWLSTPAVAVLKSCQ
-
QQL
K
QPSNHASAD
M
LPA
I
L
VSPPWL
S
KKKK
SV
M
PVLDL
T
PL
P
L
E
S
C
CT
LT
ET
AE
KEIH-
--
----
-
--
---
-
--
--
----------
A
R
HR
W
H
AHQ
I
DIG
Q
K
--
E
DIQ
N
Y
L
TR
LGF
N
RW
-
--NNGQ
-
-
Y
M
K
A
S
D
AV
V
E
L
W
Q
R
G
D
Y
S
A
L
IS
EF
KT
F
WHS
YQRE
W
Q
L
YM
L
AA
LP
I
E
K
TA
Q
A
WN
-
V
L
SKEP
H
V
GVEFV
M
TH
L
Q
LAGL
Q
GF
I
H
S
F
S
RYPQE
ALPVAQ
YFAA
I
ELAP
L
I
AR
A
FNKLKTLR
Q
DARSWL
LK
YPEHA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDAVNALLALD
A
LDNHPTKIPTLP
A
FYQPSLWTRP
V
LKANAQSLPDSALL
R
LGEMLRFPQEEALYPGL
L
QVK
A
A
CT
A
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAI
S
SDIALMQLNGIAQKLKFKALQERAKEKIA
D
IAESRELTVAELEDRLAPDLGLDDNGSL
P
LDFGPRQFTVSFDETLKPF
M
RDASGSRLKDLPKPNKSDDE
S
R
S
NDAVNRYKLLKKDART
V
AAQQVARLESAMCLRRRWS
P
ENFQLFLVEHPLVRHLTRRLIWGVYS
T
EN
E
LLACFRVAEDNSYSTADDDLFTLPEGDIS
V
GIPHVLEISPTDAAAFGQLFADYELLPPFRQLDRNSYALTE
T
ERNASEL
I
RW
A
GRKCPSGRVMGLANKGW
I
K
G
T
P
L
D
A
GWIGWMIK
P
LG
R
WSLIMEIDE
---------
GFAVGMSPAELSA
EQ
LLSKLWLWEGK
A
ESYGWGS
H
STQEAQFSVLD
D
ITASELINDIEALFE
fig|562.376.peg.4030
Escherichia coli WV_060327 (344-1252/1252)
VATDE
H
ILASLEKY
----
HEPYAIFDD
---
YY
CGAI
WSATVLQEQGV
A
ALPR
-------
-
--------
FAPY
-
V
ASD
Y
CADVLRHINHPFALTLLIRVAG
H
TKRCHDRMTKA
C
AAFPHAALAALAELL
V
QKEENSWRIMLMTMLISQP
T
LAEQVIPWLSTPAVAVLKSCQ
-
QQLTQPSNHASADLLPA
I
V
VSPPWL
S
KKKK
SP
IPVLDLAPL
N
L
E
S
I
CT
I
T
DT
E
A
KEFQ-
--
----
-
--
---
-
--
--
----------
T
H
WD
WE
---
-
--P
H
KPG
E
GAK
D
F
L
YS
LG
Y
R
RW
-
--DFDT
Y
K
YI
G
A
S
D
SA
I
DAW
E
REDF
A
TL
IQ
M
F
KA
H
H
A
P
YQGE
W
H
L
NS
L
PF
LP
M
QKAIK
L
W
E
-
F
L
SKEP
HT
AIKPV
M
LY
LR
LAG
M
S
GF
L
H
S
F
S
RYPQE
GFAVAN
YFAA
T
ELAPAVAR
A
FNKLKTLR
Q
DA
S
SWL
LK
YP
Q
HA
I
TGLLP
A
ALGK
AG
EAQDNARAALRMLTENGHQPLLQEIARRYNQPEVTDA
M
NALLALDPLDNHPTKIPTLP
T
FYQPSLWTRP
L
S
KANAQSLPDSALLHLGEMLRFPQEEALYPGL
L
QVKD
A
CT
T
DSLA
E
FAWDLFTAWQTAGAPSKESWAFTALGVLGNDDTARKLTPLIRAWPGESQHKRATVGLDILAAIGSDIALMQLNGIAQKLKFKALQERAKEKIA
N
IAESRELTVAELEDRLAPDLGLDDNGSL
L
LDFGPRQFTVSFDE
SV
KPFVRDASGSRLKDLPKPNK
T
DDET
L
A
E
E
AVNRYKLLKKD
V
RT
V
AAQQ
I
S
RLE
A
AMC
Q
RRRW
T
A
E
Q
F
S
LFLVEHPLVRH
I
T
Q
RL
M
WG
I
Y
D
A
D
N
Q
L
T
S
CFRVAED
G
S
F
S
D
G
Q
D
TP
FTL
E
Q
G
N
I
-
-
GIPHVLEI
PTMQ
AA
E
F
A
QLF
S
DYELLPPFRQLDR
PWSE
L
S
D
S
E
KSS
G
D
L
Q
RW
A
GR
Q
AA
SGRV
A
GL
M
NKGW
Q
RG
D
VL
D
G
G
-------
-
-
G
Y
Y
S
FYKA
V
D
D
GYVELSVTP
GF
C
VG
L
PV
T
E
I
S
D
SQ
T
I
DH
I
H
L
YK
QT
S
RKSV
Y
-----
---
P
FSVLD
D
ITASELINDIE
S
LF
D
fig|562.376.peg.4029
Escherichia coli WV_060327 (313-1265/1265)
V
V
TD
D
A
A
L
T
T
A
EKY
----
DF
P
-
PLY
H
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLIT
V
T
P
E
L
V
T
D
VIP
R
VNAK
A
AGI
L
SE
C
RP
Q
P
V
T
EECEY
A
TV
D
M
LP
E
L
F
V
A
PPW
V
I
N
KKK
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
Y
QA
SQ
QTLFTDLPPI
K
K
ER
WE
TTF
I
PLT
P
E
--
-
--Q
Q
I
L
WR
LGF
K
E
W
R
RTGEEQ
Y
E
KK
I
M
P
Q
S
V
VDA
L
LR
F
DF
P
A
L
KA
EF
AQ
Y
H
N
K
GSRH
W
Q
L
YA
L
CF
LP
T
Q
H
AI
S
F
L
D
Q
I
I
NEEQ
F
S
GEREI
L
AI
FG
D
A
A
I
P
A
F
M
K
C
L
Q
R
K
PQ
Q
LWIFTL
FLGV
SELA
LPM
A
Q
R
LQ
K
-
K
MS
A
EDAR
K
WL
AN
F
P
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIAR
G
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EE
HP
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
M
C
HLG
T
ML
S
FP
RDITA
Y
A
GL
E
II
KD
T
F
T
R
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
V
Y
GLD
V
LA
S
IGSDIALM
L
LNGIAQK
I
KF
V
ALQE
H
A
S
DR
I
N
M
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
I
NGSL
I
LDFGPR
K
FTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPKPN
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAFGQ
IY
ADYE
K
LPPFRQLDR
G
Y
Y
H
LT
DN
ER
D
TH
EL
I
RW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TE
H
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
TSP
G
E
R
Y
ST
Q
P
SL
T
----
FS
A
LDAITASELINDIE
S
LF
D
fig|550677.3.peg.1659
Escherichia coli B354 (313-1263/1263)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
WR
N
A
L
I
KLIT
I
T
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
F
V
A
PPW
V
I
N
KK
T
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
F
Q
S
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PLT
P
E
--
-
--Q
Q
I
L
WR
LGF
K
E
W
R
RTGEEQ
Y
E
KK
I
M
P
Q
SAVDA
L
LR
F
DF
P
A
L
KA
EF
AQ
Y
H
N
K
GSRH
W
Q
L
YA
L
CF
LP
T
Q
H
AI
S
F
L
N
Q
I
I
NEEQ
F
S
GEREI
L
AI
FG
D
A
A
I
P
A
F
M
K
C
L
Q
R
K
PQ
Q
LWIFTL
FLGV
SELA
LPM
A
Q
R
LQ
K
-
K
MS
A
EDAR
K
WL
AN
F
P
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
V
K
LNQRETIE
EIARRYNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
A
T
I
Q
E
T
F
T
R
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
K
K
I
KF
V
ALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
T
LDFGPRQFTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPKPN
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEI
P
P
ES
AAAF
R
Q
IY
V
DYELLPPF
Q
QL
E
R
G
SY
H
L
V
DN
ERN
V
H
EL
S
RW
D
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
S
I
YA
M
R
K
S
TP
Y
G
A
L
EL
E
TEP
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
T
A
P
Y
DR
YG
KQ
S
S
P
T
----
FSVLD
D
ITASELINDIE
S
LF
D
fig|431946.3.peg.2083
Escherichia coli SE15 (313-1266/1266)
V
V
TDE
A
A
L
T
A
A
EKY
----
DF
P
-
PLYR
D
---
FR
A---
YL
A
ML
L
ANN
GV
S
GVS
R
IL
T
EF
V
E
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
W
H
KA
L
VKLIT
I
T
P
E
L
VCD
VIPW
V
S
AK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
T
AD
M
LP
E
L
F
VSPPW
M
T
K
E
KK
KN
S
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
A
KKLTR
TY
LVTR
F
QQ
IAQ
Q
QA
TK
Q
I
LFTDLPP
M
E
K
AS
WE
RHL
V
PLT
P
E
--
-
--Q
Q
I
L
WC
LGF
D
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
AQ
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGYI
L
AI
FG
S
A
A
I
P
A
F
M
T
C
L
Q
R
E
PQ
R
LWFFTF
FLGV
N
ELA
LPM
A
Q
R
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
QG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
E
PL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
N
Q
P
LPD
D
A
MR
HLG
I
ML
S
FP
RDIT
P
Y
A
GL
A
II
K
E
T
F
T
R
E
SLA
E
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIR
T
WPGESQHKRA
V
Y
GLD
V
LA
S
IGSDIALM
L
LNGIA
K
K
I
KF
V
ALQE
H
A
C
D
KI
N
M
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
T
LDFGPRQFTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPKPN
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAFGQ
IY
T
DYE
Q
LPPFRQLDR
G
Y
Y
H
L
ADN
ER
DS
H
EL
I
RW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QL
PVESVKI
-
TSP
D
N
R
YG
KQ
S
SL
T
----
FS
A
LD
D
ITASELINDIE
S
LF
D
fig|550676.3.peg.2409
Escherichia coli B185 (313-1267/1267)
M
V
S
D
D
V
A
LA
IQ
EKY
----
G
F
P
-
PLY
N
D
---
FR
K---
YL
AT
L
L
ANN
G
M
R
GVS
R
ILL
KLPV
D
YPVKY
T
D
L
F
TH
I
H
A
N
A
E
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGV
N
G
K
K
KHLEYLS
KA
C
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
N
N
E
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
P
VAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VQ
Y
H
K
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
L
S
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
I
T
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QL
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|478005.5.peg.3918
Escherichia coli O157:H7 str. EC4486
M
I
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|562.371.peg.4320
Escherichia coli 1044A (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|562.372.peg.5741
Escherichia coli 1212A (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|562.374.peg.1830
Escherichia coli 536A (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|155864.1.peg.2930
Escherichia coli O157:H7 EDL933 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|155864.8.peg.2813
Escherichia coli O157:H7 EDL933 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|444454.5.peg.1830
Escherichia coli O157:H7 str. EC4024 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|444450.8.peg.3098
Escherichia coli O157:H7 str. EC4115 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|478004.5.peg.3105
Escherichia coli O157:H7 str. EC4401 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|478006.5.peg.2340
Escherichia coli O157:H7 str. EC4501 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|478008.5.peg.4192
Escherichia coli O157:H7 str. EC869 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|637388.3.peg.2113
Escherichia coli O157:H7 str. FRIK2000 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|570506.3.peg.3553
Escherichia coli O157:H7 str. FRIK966 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|386585.9.peg.3053
Escherichia coli O157:H7 str. Sakai (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|502346.5.peg.4462
Escherichia coli O157:H7 str. TW14588 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
R
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|562.373.peg.4568
Escherichia coli 1125A (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
W
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|478007.5.peg.4258
Escherichia coli O157:H7 str. EC508 (9-963/963)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
---
FR
NFRA
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
D
HSDNPTYL
F
ERI
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
S
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
TV
D
M
LP
E
L
L
VSPPW
M
T
K
E
KK
KN
T
PV
F
DL
PV
L
P
V
P
S
V
SD
V
T
PE
I
T
KKLTR
TY
LVTH
F
QQ
IAQ
Q
QA
TK
QTLFTDLPPI
K
K
AS
WE
KHL
I
PLT
P
E
--
-
--Q
Q
I
L
WH
LGF
E
K
W
W
ESGEKI
Y
E
K
I
P
AP
Q
SAVDA
L
LR
F
DF
P
A
L
NA
EF
VH
Y
H
N
N
AYKS
W
N
L
IA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
V
KEDN
YS
GEGNI
L
AI
FG
S
A
A
I
P
A
F
M
A
C
L
Q
R
D
P
RR
LCFFPF
FLGV
SELA
LPM
A
Q
Q
LQ
K
-
K
MSY
EDAR
N
WL
TD
YP
R
HA
A
A
GLLP
V
ALGK
KG
KDR
D
C
AR
Q
ALR
L
L
VNLNQRETIE
EIA
QG
YNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
L
S
D
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
D
IIRE
I
F
T
R
E
SLA
E
F
G
WDL
YP
AW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
I
LDFGPR
K
FTV
G
FDETLKP
V
V
C
DA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
TH
ELTRW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
T
TP
H
GD
L
EL
E
TEP
---------
-
F
SLIYGETGYGD
QH
PVESVKI
-
TSP
D
DR
YG
KQ
S
SL
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
fig|216592.1.peg.2903
Escherichia coli 042 (313-1265/1265)
V
V
TD
D
A
A
L
T
T
A
EKY
----
DF
P
-
PLY
H
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
V
T
D
VIP
R
VNAK
A
AGI
L
SE
C
R
T
Q
L
VAEECEY
A
T
AD
M
LP
E
L
F
V
A
PPW
V
I
N
KK
T
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
F
Q
S
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
L
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
A
L
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEER
YS
GEKEI
L
AV
FG
S
T
A
I
P
A
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
T
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
ALR
L
L
V
K
LNQRETIE
EIARRYNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
P
P
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
A
T
I
K
E
T
F
T
R
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
V
Y
GLD
V
LA
S
IGSDIALM
L
LNGIAQK
I
KF
V
ALQE
H
A
S
DR
I
N
M
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
T
LDFGPRQFTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEI
P
P
ES
AAAF
R
Q
IY
V
DYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
V
H
EL
S
RW
D
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TEP
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
T
A
P
Y
DR
YG
KQ
S
S
P
T
----
FSVLD
D
ITASELINDIE
S
LF
D
fig|216592.3.peg.2450
Escherichia coli 042 (313-1265/1265)
V
V
TD
D
A
A
L
T
T
A
EKY
----
DF
P
-
PLY
H
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
V
T
D
VIP
R
VNAK
A
AGI
L
SE
C
R
T
Q
L
VAEECEY
A
T
AD
M
LP
E
L
F
V
A
PPW
V
I
N
KK
T
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
F
Q
S
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
L
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
A
L
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEER
YS
GEKEI
L
AV
FG
S
T
A
I
P
A
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
T
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
ALR
L
L
V
K
LNQRETIE
EIARRYNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
P
P
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
A
T
I
K
E
T
F
T
R
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
V
Y
GLD
V
LA
S
IGSDIALM
L
LNGIAQK
I
KF
V
ALQE
H
A
S
DR
I
N
M
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
T
LDFGPRQFTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPKPN
Q
SD
EK
T
Q
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
N
D
EN
A
L
IT
CFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEI
P
P
ES
AAAF
R
Q
IY
V
DYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
V
H
EL
S
RW
D
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TEP
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
T
A
P
Y
DR
YG
KQ
S
S
P
T
----
FSVLD
D
ITASELINDIE
S
LF
D
fig|656380.3.peg.2203
Escherichia coli FVEC1412 (249-1201/1201)
V
V
TD
D
A
A
L
T
T
A
EKY
----
DF
P
-
PLY
H
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
V
T
D
VIP
R
VNAK
A
AGI
L
SE
C
R
T
Q
L
VAEECEY
A
T
AD
M
LP
E
L
F
V
A
PPW
V
I
N
KK
T
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
F
Q
S
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
L
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
A
L
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEER
YS
GEKEI
L
AV
FG
S
T
A
I
P
A
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
T
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
ALR
L
L
V
K
LNQRETIE
EIARRYNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
P
P
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
A
T
I
K
E
T
F
T
R
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
K
K
I
KF
V
ALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
T
LDFGPRQFTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPK
Q
N
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQV
D
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEI
P
P
ES
AAAF
R
Q
IY
V
DYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
V
H
EL
S
RW
D
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TE
H
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
T
AS
Y
DR
YG
KQ
S
S
P
T
----
FSVLD
N
ITASELINDIE
S
LF
D
fig|749549.3.peg.4982
Escherichia coli MS 198-1 (297-1249/1249)
V
V
TD
D
A
A
L
T
T
A
EKY
----
DF
P
-
PLY
H
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
V
T
D
VIP
R
VNAK
A
AGI
L
SE
C
R
T
Q
L
VAEECEY
A
T
AD
M
LP
E
L
F
V
A
PPW
V
I
N
KK
T
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
F
Q
S
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
L
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
A
L
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEER
YS
GEKEI
L
AV
FG
S
T
A
I
P
A
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
T
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
ALR
L
L
V
K
LNQRETIE
EIARRYNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
P
P
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
A
T
I
K
E
T
F
T
R
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
K
K
I
KF
V
ALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
T
LDFGPRQFTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPK
Q
N
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQV
D
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEI
P
P
ES
AAAF
R
Q
IY
V
DYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
V
H
EL
S
RW
D
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TE
H
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
T
AS
Y
DR
YG
KQ
S
S
P
T
----
FSVLD
N
ITASELINDIE
S
LF
D
fig|656379.3.peg.2645
Escherichia coli FVEC1302 (313-1265/1265)
V
V
TD
D
A
A
L
T
T
A
EKY
----
DF
P
-
PLY
H
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
V
T
D
VIP
R
VNAK
A
AGI
L
SE
C
R
T
Q
L
VAEECEY
A
T
AD
M
LP
E
L
F
V
A
PPW
V
I
N
KK
T
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
F
Q
S
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
L
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
A
L
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEER
YS
GEKEI
L
AV
FG
S
T
A
I
P
A
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
T
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
ALR
L
L
V
K
LNQRETIE
EIARRYNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
P
P
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
A
T
I
K
E
T
F
T
R
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
K
K
I
KF
V
ALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
T
LDFGPRQFTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPK
Q
N
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQV
D
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEI
P
P
ES
AAAF
R
Q
IY
V
DYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
V
H
EL
S
RW
D
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TE
H
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
T
AS
Y
DR
YG
KQ
S
S
P
T
----
FSVLD
N
ITASELINDIE
S
LF
D
fig|585056.7.peg.2628
Escherichia coli UMN026 (313-1265/1265)
V
V
TD
D
A
A
L
T
T
A
EKY
----
DF
P
-
PLY
H
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
WR
KA
L
VKLITAT
P
E
L
V
T
D
VIP
R
VNAK
A
AGI
L
SE
C
R
T
Q
L
VAEECEY
A
T
AD
M
LP
E
L
F
V
A
PPW
V
I
N
KK
T
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
F
Q
S
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
L
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
A
L
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEER
YS
GEKEI
L
AV
FG
S
T
A
I
P
A
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
T
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
ALR
L
L
V
K
LNQRETIE
EIARRYNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
P
P
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
FP
RDITA
Y
A
GL
A
T
I
K
E
T
F
T
R
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
VS
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
K
K
I
KF
V
ALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
SS
GSL
T
LDFGPRQFTV
G
FDETLKP
V
VRDA
N
G
KV
LKDLPK
Q
N
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQV
D
RLE
Q
AMC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEI
P
P
ES
AAAF
R
Q
IY
V
DYELLPPF
Q
QL
E
R
G
SY
H
L
ADN
ERN
V
H
EL
S
RW
D
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TE
H
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
T
AS
Y
DR
YG
KQ
S
S
P
T
----
FSVLD
N
ITASELINDIE
S
LF
D
fig|585057.4.peg.954
Escherichia coli IAI39 (315-1268/1268)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
FRN
FR
A---
YL
A
ML
L
ANN
G
I
R
GVS
Q
ILLEFTE
E
HSDNPTYL
F
ER
N
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
W
H
KA
L
VKLIT
I
T
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
T
AD
M
LP
Q
L
F
V
A
PPW
V
I
N
KKK
NV
IPV
F
DL
PV
L
P
V
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
Y
QA
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
R
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
TL
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEEQ
YS
GEKEI
L
GI
FG
S
T
A
I
P
S
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
A
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
AL
L
L
L
V
K
LNQRETIE
EIA
QG
YNQP
D
I
LA
A
LAT
L
FDS
DPL
EE
HP
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
M
C
HLG
T
ML
S
FP
RDITA
Y
A
GL
E
II
KD
T
F
T
R
DSLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LG
D
DDTARKLTPLIRAWPGESQHKRA
V
Y
GLD
V
LA
S
IGSDIALM
L
LNGIAQK
I
KF
A
ALQE
H
A
S
D
KI
N
M
V
A
K
N
R
G
LT
M
AELEDRLAPDLGLD
I
NGSL
T
LDFGPRQFTV
G
FDETLKP
M
VRDA
N
G
KV
LKDLPKPN
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
V
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAFGQ
IY
ADYE
K
LPPFRQLDR
G
Y
Y
H
LT
DN
ER
D
TH
EL
I
RW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TE
H
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
TSP
G
E
R
Y
ST
Q
P
SL
T
----
FS
A
LDAITASELINDIE
S
LF
D
fig|585057.6.peg.951
Escherichia coli IAI39 (315-1268/1268)
TD
D
T
A
L
ST
V
EKY
----
DF
P
-
PLYR
D
FRN
FR
A---
YL
A
ML
L
ANN
G
I
R
GVS
Q
ILLEFTE
E
HSDNPTYL
F
ER
N
-
S
ETE
N
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
DKE
W
H
KA
L
VKLIT
I
T
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEECEY
A
T
AD
M
LP
Q
L
F
V
A
PPW
V
I
N
KKK
NV
IPV
F
DL
PV
L
P
V
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
Y
QA
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
R
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
TL
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEEQ
YS
GEKEI
L
GI
FG
S
T
A
I
P
S
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
A
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
AL
L
L
L
V
K
LNQRETIE
EIA
QG
YNQP
D
I
LA
A
LAT
L
FDS
DPL
EE
HP
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
M
C
HLG
T
ML
S
FP
RDITA
Y
A
GL
E
II
KD
T
F
T
R
DSLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LG
D
DDTARKLTPLIRAWPGESQHKRA
V
Y
GLD
V
LA
S
IGSDIALM
L
LNGIAQK
I
KF
A
ALQE
H
A
S
D
KI
N
M
V
A
K
N
R
G
LT
M
AELEDRLAPDLGLD
I
NGSL
T
LDFGPRQFTV
G
FDETLKP
M
VRDA
N
G
KV
LKDLPKPN
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
A
E
Q
V
R
LFLVEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
AAAFGQ
IY
ADYE
K
LPPFRQLDR
G
Y
Y
H
LT
DN
ER
D
TH
EL
I
RW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TE
H
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
TSP
G
E
R
Y
ST
Q
P
SL
T
----
FS
A
LDAITASELINDIE
S
LF
D
fig|749531.3.peg.2034
Escherichia coli MS 69-1 (313-1265/1265)
V
V
TD
D
A
A
L
T
T
A
EKY
----
DF
P
-
PLY
H
D
---
FR
A---
YL
A
ML
L
ANN
GV
R
GVS
R
ILLEFTE
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AALAA
Y
A
T
LL
A
IH
E
D
AQ
WR
KA
L
VKLITAT
P
D
L
V
T
D
VIPW
VNAK
A
AGI
L
SE
C
R
T
Q
L
VAEECEY
A
T
AD
M
LP
E
L
F
V
A
PPW
V
I
N
KK
T
NV
IPV
F
DL
PV
L
P
I
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
Y
Q
S
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
L
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
A
L
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEER
YS
GEKEI
L
AV
FG
S
T
A
I
P
A
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
I
N
WL
AN
N
P
R
HA
A
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
A
F
R
L
L
V
K
LNQRETIE
EI
G
RRYNQP
D
V
LA
A
LAT
L
FDS
DPL
EEY
P
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
FP
RDIT
S
Y
A
GL
A
T
I
K
E
T
F
T
H
E
SLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LGNDDTARKLTPLIRAWPGESQHKRA
V
Y
GLD
V
LA
D
IGSD
V
ALM
L
LNGIA
R
K
I
KFKALQE
H
A
R
EKI
N
I
V
AE
N
R
G
LT
M
AELEDRLAPDLGLD
N
S
GSL
I
LDFGPRQFTV
G
FDETLKP
V
V
H
DA
N
G
KV
LKDLPKPN
Q
SDD
Q
T
L
A
T
D
S
VN
LF
K
Q
LKKD
V
H
A
IA
S
QQ
ID
RLE
Q
D
MC
Q
RRRW
T
A
E
Q
F
R
LFLVEHPLVRHLTRR
V
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
A
Q
D
E
LFTLP
A
G
N
I
-
-
GIPHVLEISP
ES
A
T
AFGQ
IY
ADYE
K
LPPFRQLDR
G
Y
Y
H
LT
DN
ER
D
TH
EL
I
RW
Q
GR
L
C
QA
GR
IV
GL
ERR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
G
A
L
EL
E
TE
H
---------
-
F
SLIYGETGY
S
D
LL
PVESVKI
-
TSP
G
E
R
Y
ST
Q
P
SL
T
----
FS
A
LDAITASELINDIE
S
LF
D
fig|749527.3.peg.1471
Escherichia coli MS 21-1 (313-1265/1265)
V
V
TD
D
A
A
L
T
A
A
EKY
----
DF
P
-
PLYR
D
---
FR
A---
YL
A
ML
L
ANN
GV
S
GVS
R
IL
T
EF
V
E
E
HSDNPTYL
F
ERI
-
S
ETE
D
LVKW
L
WKT
NHP
D
A
IQI
LI
LGVI
G
K
K
KHLEYLS
KA
C
QKH
P
A
AA
I
AA
Y
A
T
LL
A
IH
E
DKE
W
H
KA
L
VKLIT
I
T
P
E
L
VCD
VIPW
VNAK
A
AGI
L
SE
C
RP
Q
SVAEE
Y
EY
A
T
AD
M
LP
Q
L
F
V
A
PPW
V
I
N
KKK
NV
IPV
F
DL
PV
L
P
V
P
A
V
TD
I
T
PG
I
T
ELISH
T
-
DISR
F
SE
IAQ
Y
QA
SQ
QTLFTDLP
L
I
E
K
ES
WE
TSF
I
PFT
P
E
--
-
--Q
Q
I
L
WQ
LGF
N
E
W
R
HCEDDL
H
E
KK
Y
I
P
Q
SAVDA
L
LR
F
DF
P
TL
KA
EF
AK
Y
H
N
N
ANKS
W
N
L
SA
L
CY
LP
G
Q
Q
AI
S
F
L
N
Q
I
I
IEEQ
YS
GEKEI
L
GI
FG
S
T
A
I
P
S
F
M
T
C
L
Q
R
D
H
Q
R
LWIFTL
F
I
G
ASELA
LPM
A
Q
R
LQ
K
-
K
M
A
Y
K
DA
V
N
WL
AN
N
P
R
HA
A
A
GLLP
L
ALGK
PC
Q
N
R
E
Y
AR
Q
AL
L
L
L
V
K
LNQRETIE
EIA
QG
YNQP
D
I
LA
A
LAT
L
FDS
DPL
E
D
HP
A
KI
AP
LP
G
FYQ
FT
LW
R
RP
R
LK
S
N
NLP
LPD
D
A
MR
HLG
T
ML
S
F
S
RDITA
Y
A
GL
E
II
KD
T
F
T
R
DSLA
D
F
G
WDL
Y
TAW
TE
AGAP
A
KE
N
WAFT
S
LG
I
LG
D
DDTARKLTPLIRAWPGESQHKRA
V
Y
GLD
V
LA
S
IGSDIALM
L
LNGIAQK
I
KF
A
ALQE
H
A
S
D
KI
N
M
V
A
K
N
R
G
LT
M
AELEDRLAPDLGLD
I
NGSL
T
LDFGPRQFTV
G
FDETLKP
M
VRDA
N
G
KV
LKDLPKPN
Q
SDD
K
T
L
A
T
DAVN
LF
K
Q
LKKD
V
R
A
IA
S
QQ
ID
RLE
Q
AMC
Q
RRRW
T
T
E
Q
F
R
LF
M
VEHPLVRHLTRRL
L
WGVY
T
E
EN
T
L
I
ACFRVAED
ST
YS
D
M
Q
D
E
LFTLP
A
G
N
I
-
-
GIPH
M
LEI
P
P
ES
AAAF
R
Q
IY
ADYELLPPF
Q
QLDR
G
SY
R
L
ADN
E
Q
N
TH
ELTRW
S
GR
L
C
QA
GR
IV
GL
V
RR
GW
Q
R
L
E
--
E
S
G
SVYA
M
R
K
S
TP
Y
GD
L
EL
E
TE
H
---------
-
F
SLIYGETGYGD
LL
PVESVKI
-
TSP
D
DR
YG
KQ
PL
L
T
----
FS
M
LD
D
ITASELINDIE
S
LF
D
Consen1
Primary consensus
VaTDe
iLaslEKY
----
hePyaifdD
---
yy
wsAtvLqeqGV
alpR
-------
--------
Fapy
-
asd
cadvLrhiNHPfAltlLIrvag
tKrchdrmtKA
aafPhAAlAAlaeLL
qkEensWRimLmtmlisqP
LaeqVIPWlstpAvavLksCq
-
QqltqpsnhAsaDlLPav
VsPPWl
kkKK
iPVlDLapL
ie
i
lT
is
--
-
---
--
----------
k
We
i
he
--
e
n
L
lGf
rw
-
-
yi
apesavdAwlreDf
tL
eF
fHsp
W
L
L
LP
qkAik
wn
-
l
ht
m
fglaglpgfv
sl
RyPqe
yfaasELApavAr
fnKlKtlredArsWL
yPeHA
tGLLP
ALGK
eaqDnARaALRmLtenghqpllqEIArrYNQPeVtdAvnaLlalDPLdnhPtKIptLP
FYQpsLWtRP
LKaNaqsLpDsAllHLGeMLrFPqeealYpGL
qvkd
ct
dSLA
FaWDLftAWqtAGAPsKEsWAFTaLGvLGNDDTARKLTPLIRAWPGESQHKRAtvGLDiLAaIGSDiALMqLNGIAqKlKFkALQErAkEKIa
iAEsReLTvAELEDRLAPDLGLDdnGSL
LDFGPRqFTVsFDETLKPfVrDasGsrLKDLPKPNkSDdet
AndAVNryKlLKKDaRtiAaQQvaRLEsAMClRRRWs
EnFqLFLVEHPLVRHLTRRLiWGVYs
EN
LlaCFRVAEDnsYStAdDdLFTLPeGdIs
GiPHVLEISPtdAAAFgQlfADYELLPPFrQLdRnSYaLteaERNasELtRW
GRkCpsGRvmGLankGW
rgepqd
GwigwMik
lg
wsLimEide
---------
gFavgmspaelsa
llsklwlwegk
esYGwgSnsTqeaqFSvLDaITASELINDIEaLFe
Consen2
Secondary consensus
v
d
a
v
df
-
plyr
fr
yl
ml
ann
gvs
illefte
hsdnptyl
eri
ete
lvkw
wkt
d
iqi
lgvi
k
khleyls
qkh
a
ytt
ih
dke
ka
vklitat
vcd
vnak
agi
se
rp
svaeecey
tv
m
e
a
ne
t
f
pv
p
v
e
ty
f
iaq
qa
qtlftdlppi
h
y
-
pkpg
-
q
m
y
rr
y
kk
is
avie
lenf
y
a
m
n
eq
vs
leq
ys
l
lrska
sal
cf
d
rr
flgvt
lpm
q
lq
-
msyqn
sn
n
r
a
kdr
c
q
l
vnlnqretie
qg
d
la
lat
fds
eey
a
ap
ft
r
s
nlp
s
d
mr
t
s
rdita
a
iire
fs
e
g
yp
te
a
n
s
i
vs
v
d
v
l
r
i
v
h
r
n
v
n
g
m
ss
k
g
v
c
vn
kv
q
eks
te
lf
q
v
av
s
id
q
q
t
q
r
l
it
st
d
q
e
a
n
-
t
es
r
iy
q
e
g
h
adn
th
i
l
qa
iv
err
klt
--
e
svya
rn
tp
gd
el
tep
-
sliygetgygd
pvesvki
-
tsp
dr
kq
sl
----
m
d
s
d
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character