fig|585055.6.peg.698
Escherichia coli 55989 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGL
N
YTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585055.8.peg.700
Escherichia coli 55989 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGL
N
YTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|1040638.4.peg.5049
Escherichia coli O104:H4 str. LB226692
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGL
N
YTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
D
GIFISIR
fig|585034.4.peg.672
Escherichia coli IAI1 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585034.5.peg.671
Escherichia coli IAI1 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|409438.11.peg.887
Escherichia coli SE11 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585055.6.peg.4446
Escherichia coli 55989 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585055.8.peg.4450
Escherichia coli 55989 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331111.12.peg.4332
Escherichia coli E24377A (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331111.3.peg.1735
Escherichia coli E24377A (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585055.6.peg.4084
Escherichia coli 55989 (1-1227/1321)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585055.8.peg.4087
Escherichia coli 55989 (1-1227/1321)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|409438.11.peg.4043
Escherichia coli SE11 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQG
D
LTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGD
I
PLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|340184.3.peg.375
Escherichia coli B7A (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIA
Q
PGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|340184.6.peg.395
Escherichia coli B7A (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIA
Q
PGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.371.peg.1871
Escherichia coli 1044A (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|83334.1.peg.4839
Escherichia coli O157:H7 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|478006.5.peg.2164
Escherichia coli O157:H7 str. EC4501 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|386585.9.peg.5086
Escherichia coli O157:H7 str. Sakai (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|502346.5.peg.1734
Escherichia coli O157:H7 str. TW14588 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.373.peg.2622
Escherichia coli 1125A (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.372.peg.3653
Escherichia coli 1212A (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.374.peg.3612
Escherichia coli 536A (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444454.5.peg.3970
Escherichia coli O157:H7 str. EC4024 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444449.5.peg.3425
Escherichia coli O157:H7 str. EC4042 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444448.5.peg.2180
Escherichia coli O157:H7 str. EC4045 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444453.5.peg.2322
Escherichia coli O157:H7 str. EC4076 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444452.5.peg.1790
Escherichia coli O157:H7 str. EC4113 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444450.8.peg.5262
Escherichia coli O157:H7 str. EC4115 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444451.5.peg.2836
Escherichia coli O157:H7 str. EC4196 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444447.5.peg.2346
Escherichia coli O157:H7 str. EC4206 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|478004.5.peg.2396
Escherichia coli O157:H7 str. EC4401 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|478005.5.peg.2244
Escherichia coli O157:H7 str. EC4486 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|478007.5.peg.2221
Escherichia coli O157:H7 str. EC508 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|544404.4.peg.5072
Escherichia coli O157:H7 str. TW14359 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331111.12.peg.1023
Escherichia coli E24377A (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331111.3.peg.3242
Escherichia coli E24377A (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|478008.5.peg.2068
Escherichia coli O157:H7 str. EC869 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
S
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|566546.4.peg.3830
Escherichia coli W (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
N
GDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYNAAGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585396.4.peg.4438
Escherichia coli O111:H- str. 11128 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|701177.3.peg.4355
Escherichia coli O55:H7 str. CB9615 (1-1227/1425)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTL
S
G
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
G
LTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331112.3.peg.695
Escherichia coli HS (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331112.6.peg.726
Escherichia coli HS (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|344601.3.peg.1791
Escherichia coli B171 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|344601.5.peg.1871
Escherichia coli B171 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.371.peg.3206
Escherichia coli 1044A (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|83334.1.peg.4441
Escherichia coli O157:H7 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|478006.5.peg.299
Escherichia coli O157:H7 str. EC4501 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|386585.9.peg.4682
Escherichia coli O157:H7 str. Sakai (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|502346.5.peg.2160
Escherichia coli O157:H7 str. TW14588 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|679207.4.peg.188
Escherichia coli MS 107-1 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQM
K
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLL
SD
ENP
HH
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.373.peg.1621
Escherichia coli 1125A (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|444454.5.peg.3568
Escherichia coli O157:H7 str. EC4024 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|444449.5.peg.3022
Escherichia coli O157:H7 str. EC4042 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|444448.5.peg.1776
Escherichia coli O157:H7 str. EC4045 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|444453.5.peg.1307
Escherichia coli O157:H7 str. EC4076 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|444452.5.peg.35
Escherichia coli O157:H7 str. EC4113 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|444450.8.peg.4857
Escherichia coli O157:H7 str. EC4115 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|444451.5.peg.209
Escherichia coli O157:H7 str. EC4196 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|444447.5.peg.1939
Escherichia coli O157:H7 str. EC4206 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|478004.5.peg.333
Escherichia coli O157:H7 str. EC4401 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|478005.5.peg.287
Escherichia coli O157:H7 str. EC4486 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|478007.5.peg.306
Escherichia coli O157:H7 str. EC508 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|544404.4.peg.4668
Escherichia coli O157:H7 str. TW14359 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|155864.1.peg.721
Escherichia coli O157:H7 EDL933 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRIT
X
TDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
X
QLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
L
X
WQHET
R
HAYN
A
QGLANR
CI
PDS
X
P
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|701177.3.peg.4755
Escherichia coli O55:H7 str. CB9615 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTL
S
G
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEEN
L
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331112.3.peg.3453
Escherichia coli HS (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331112.6.peg.3587
Escherichia coli HS (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|478008.5.peg.828
Escherichia coli O157:H7 str. EC869 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
S
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|562.375.peg.421
Escherichia coli EC4100B (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLV
D
S
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|595496.3.peg.3464
Escherichia coli BW2952 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|536056.3.peg.245
Escherichia coli DH1 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316407.3.peg.3646
Escherichia coli W3110 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316385.5.peg.3603
Escherichia coli str. K-12 substr. DH10B (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316385.7.peg.3684
Escherichia coli str. K-12 substr. DH10B (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|511145.12.peg.3581
Escherichia coli str. K-12 substr. MG1655 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|511145.6.peg.3564
Escherichia coli str. K-12 substr. MG1655 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316401.4.peg.4223
Escherichia coli ETEC H10407 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGL
Q
LALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|83333.1.peg.3415
Escherichia coli K12 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQM
K
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|155864.8.peg.748
Escherichia coli O157:H7 EDL933 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRIT
X
TDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
X
QLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
L
X
WQHET
R
HAYN
A
QGLAN
X
CI
PDS
X
P
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPG
X
FTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|155864.1.peg.4481
Escherichia coli O157:H7 EDL933 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYIT
H
DPIGL
K
G
fig|155864.8.peg.4440
Escherichia coli O157:H7 EDL933 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
A
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYIT
H
DPIGL
K
G
fig|340185.3.peg.3890
Escherichia coli E22 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACS
G
CP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|340185.4.peg.4100
Escherichia coli E22 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACS
G
CP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|413997.3.peg.3489
Escherichia coli B str. REL606 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|511693.5.peg.3505
Escherichia coli BL21 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|469008.4.peg.275
Escherichia coli BL21(DE3) (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585395.4.peg.4862
Escherichia coli O103:H2 str. 12009 (1-1227/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYG
R
T
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|573235.3.peg.4681
Escherichia coli O26:H11 str. 11368 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
A
QLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|595496.3.peg.627
Escherichia coli BW2952 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|536056.3.peg.3096
Escherichia coli DH1 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|83333.1.peg.692
Escherichia coli K12 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316407.3.peg.675
Escherichia coli W3110 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316385.5.peg.764
Escherichia coli str. K-12 substr. DH10B (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316385.7.peg.776
Escherichia coli str. K-12 substr. DH10B (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|511145.12.peg.731
Escherichia coli str. K-12 substr. MG1655 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|511145.6.peg.722
Escherichia coli str. K-12 substr. MG1655 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|481805.3.peg.3167
Escherichia coli ATCC 8739 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AS
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLAL
V
S
T
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|481805.6.peg.3152
Escherichia coli ATCC 8739 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AS
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLAL
V
S
T
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|670888.3.peg.2766
Escherichia coli 1827-70 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTD
T
AGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
YA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PL
I
ESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|358709.5.peg.199
Escherichia coli 101-1 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTD
T
AGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAV
Y
YGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QG
Q
ANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|573235.3.peg.778
Escherichia coli O26:H11 str. 11368 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
A
QLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
AD
W
VSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585396.4.peg.4570
Escherichia coli O111:H- str. 11128 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
A
A
RVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.372.peg.1697
Escherichia coli 1212A (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.374.peg.5281
Escherichia coli 536A (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|83334.1.peg.800
Escherichia coli O157:H7 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444451.5.peg.4504
Escherichia coli O157:H7 str. EC4196 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|386585.9.peg.845
Escherichia coli O157:H7 str. Sakai (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|502346.5.peg.462
Escherichia coli O157:H7 str. TW14588 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316401.4.peg.4362
Escherichia coli ETEC H10407 (1-1227/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|344610.7.peg.990
Escherichia coli 53638 (1-1227/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444454.5.peg.5222
Escherichia coli O157:H7 str. EC4024 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444449.5.peg.5558
Escherichia coli O157:H7 str. EC4042 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444448.5.peg.3433
Escherichia coli O157:H7 str. EC4045 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444453.5.peg.775
Escherichia coli O157:H7 str. EC4076 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444452.5.peg.3560
Escherichia coli O157:H7 str. EC4113 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444450.8.peg.892
Escherichia coli O157:H7 str. EC4115 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|444447.5.peg.3613
Escherichia coli O157:H7 str. EC4206 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|478004.5.peg.3837
Escherichia coli O157:H7 str. EC4401 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|478005.5.peg.1280
Escherichia coli O157:H7 str. EC4486 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|544404.4.peg.757
Escherichia coli O157:H7 str. TW14359 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|573235.3.peg.4775
Escherichia coli O26:H11 str. 11368 (1-1227/1394)
MSGKPAARQGDMTQYG
S
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
A
QLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLAN
L
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|413997.3.peg.3621
Escherichia coli B str. REL606 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|562.373.peg.3029
Escherichia coli 1125A (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
H
GRYITQDPIGL
K
G
fig|340184.3.peg.2297
Escherichia coli B7A (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
R
X
GRLTEKTDLIPEG
X
I
X
TDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVE
X
RYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLAL
V
S
T
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|340184.6.peg.2411
Escherichia coli B7A (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDS
A
L
S
MWPDNRIARDAHYLYRYD
R
X
GRLTEKTDLIPEG
X
I
X
TDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVE
X
RYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLAL
V
S
T
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|679205.4.peg.1938
Escherichia coli MS 124-1 (1-1237/1387)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
PLVESRYLYD
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGN
Q
LNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
R
GRYITQDPIGL
K
G
fig|478008.5.peg.2122
Escherichia coli O157:H7 str. EC869 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLY
W
YD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|566546.4.peg.4232
Escherichia coli W (1-1223/1390)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|481805.3.peg.246
Escherichia coli ATCC 8739 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AS
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|481805.6.peg.256
Escherichia coli ATCC 8739 (1-1227/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AS
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|595496.3.peg.3592
Escherichia coli BW2952 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|536056.3.peg.117
Escherichia coli DH1 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|83333.1.peg.3527
Escherichia coli K12 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316407.3.peg.3535
Escherichia coli W3110 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316385.5.peg.3725
Escherichia coli str. K-12 substr. DH10B (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316385.7.peg.3810
Escherichia coli str. K-12 substr. DH10B (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|511145.12.peg.3710
Escherichia coli str. K-12 substr. MG1655 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|511145.6.peg.3692
Escherichia coli str. K-12 substr. MG1655 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|701177.3.peg.856
Escherichia coli O55:H7 str. CB9615 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTL
S
G
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
G
LTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
D
S
GRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
T
E
W
C
AEYDEWGNLLNEEN
S
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|340186.3.peg.1732
Escherichia coli E110019 (1-1223/1373)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
R
GRYITQDPIGL
K
G
fig|340186.5.peg.1792
Escherichia coli E110019 (1-1223/1373)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
R
GRYITQDPIGL
K
G
fig|585396.4.peg.751
Escherichia coli O111:H- str. 11128 (1-1223/1393)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585396.4.peg.4936
Escherichia coli O111:H- str. 11128 (1-1223/1390)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
A
A
RVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585395.4.peg.737
Escherichia coli O103:H2 str. 12009 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACS
G
CP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|481805.3.peg.127
Escherichia coli ATCC 8739 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|481805.6.peg.128
Escherichia coli ATCC 8739 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|340186.3.peg.3871
Escherichia coli E110019 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
M
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
I
YGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLAL
V
S
T
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|340186.5.peg.4064
Escherichia coli E110019 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
M
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
I
YGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLAL
V
S
T
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|316401.4.peg.829
Escherichia coli ETEC H10407 (1-1227/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMV
V
HR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSF
D
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPY
T
TDPAGNRLPDPELHPDS
P
L
S
MWPD
H
RIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLAL
V
S
T
EG
A
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|670888.3.peg.2635
Escherichia coli 1827-70 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTD
T
AGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
YA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
N
ELLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PL
I
ESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGN
Q
LNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
R
GRYITQDPIGL
K
G
fig|344610.7.peg.2616
Escherichia coli 53638 (1-1227/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
TRIQT
I
YQPGSFTPLIRVET
A
T
GE
L
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTV
A
QMQ
S
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
T
TAW
Y
AEYDEWGN
Q
LNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
R
GRYITQDPIGL
K
G
fig|573235.3.peg.5159
Escherichia coli O26:H11 str. 11368 (1-1223/1373)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
C
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWLT
D
ISH
I
SEGHRVAVHYGYD
S
KGRL
A
S
E
H
L
TVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLA
K
R
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEG
G
IRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGN
Q
LNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPL
R
GRYITQDPIGL
K
G
fig|409438.11.peg.4404
Escherichia coli SE11 (1-1207/1374)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLN
-
---
----
-
-----------
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
R
YGRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
L
A
R
T
QRRSLA
D
A
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
A
A
RVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|585395.4.peg.4804
Escherichia coli O103:H2 str. 12009 (1-1205/1389)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYG
R
T
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKDTQGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHYGYD
E
KGRLTGERQTVHHP
Q
T
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNS
LLS
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLT
S
VHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
A
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
CGLTVEQMQ
N
QM
D
PVYTPARKIHLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIR
----------------------
LQGRYITQDPIGL
K
G
fig|331111.12.peg.4692
Escherichia coli E24377A (1-1198/1365)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
C
--------
-
--------------------
GLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|331111.3.peg.2089
Escherichia coli E24377A (1-1198/1365)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEAR
Q
Q
AI
S
G
-
G
TE
P
---
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQMTAVHREEG
L
S
Q
YR
A
YD
S
RGQL
T
A
VKDTQGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWLT
D
ISH
I
SEGHRV
T
VHY
R
YD
E
KGRLTGERQTVHHPET
EA
LLWQHET
R
HAYN
A
QGLANR
CI
PDSLP
A
VEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNS
LQY
DRDY
T
WNDNGELIRIS
S
PRQTR
S
Y
S
YS
T
TGRLTGVHTTA
A
NLDIRIPYATDPAGNRLPDPELHPDSTL
S
MWPDNRIARDAHYLYRYD
RH
GRLTEKTDLIPEGVIRTDDERTH
R
YHYDSQHRLVHYTR
T
QY
E
E
----------
PLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
Q
VTWYGWDGDRLTTIQ
N
D
R
S
RIQT
I
YQPGSFTPLIRVET
A
T
GE
Q
AK
T
QRRSLA
D
T
LQQ
S
G
G
E
D
G
G
S
VVFP
PV
LV
Q
MLDRLE
S
EI
L
ADRVSEESR
R
WLA
S
C
--------
-
--------------------
GLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENP
HQ
L
Q
QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|481805.3.peg.1772
Escherichia coli ATCC 8739 (1-1229/1251)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSG
N
PVNPLLGAKVLPGETD
F
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DI
H
LQLRDN
E
LILNDNGGRSIHFEHLFPGE
DG
F
SRSE
L
F
WLVRGGVA
K
L
NE
S
H
R
LA
P
LWQALPEELRLSPH
I
YLATNS
P
QGPWWILGW
S
ERVP
G
V
DE
M
LPAPLPPYRVLTGLVDRFGRTLTF
R
REAAGE
F
T
GEITGVTDGAGR
Q
FRLVLTTQAQRAE
N
AR
Q
Q
AI
AA
GAKG
P
D--
-
--
-----
-
--I
PD
S
LP
D
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRY
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMVAHRYAGRPE
M
RYRYDD
T
GRVTEQ
F
NPAGLSYTYQYEK
N
RITITDSLNRREVLHTEGEAGLK
C
VVK
T
E
L
ADGS
I
T
R
S
K
FD
YM
GRL
Q
S
QTDAAGRTTEYSP
N
VVTG
L
V
T
C
ITTPDGR
KSE
FYYN
N
QN
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
TQ
ETAR
N
GD
V
TRY
S
YDNPHS
E
LP
SA
T
E
DATGSRK
Q
MTWSRYGQL
Q
T
FTDCSGY
E
T
H
YEYDRFGQM
M
AVHREEGIS
T
Y
N
T
Y
N
P
RGQL
V
S
W
KDTQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
T
L
YDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNGEPAEQW
R
Y
N
D
HGWLT
E
ISH
L
SEGHRVAVHYGY
N
R
KGRLTGERQTVHHPETGELLWQHET
K
H
T
YNEQGLANR
FQ
A
DSLP
P
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
R
LQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYA
S
DPAGNRLPDPELHPDSTLT
A
WPDNRI
T
K
DAHYLYRYDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
E
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
E
NGE
L
D
KAQRRSL
V
E
K
LQQ
E
G
S
E
D
G
H
GVVFP
V
ELV
R
MLDRLE
G
EIRADRVS
S
ESR
A
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RK
V
HLYHCDHRGLPLALIS
E
D
G
N
T
M
W
S
AEYDEWGNLLNEENP
HH
L
Y
Q
PY
RLPGQQYD
D
ESGL
C
YNR
N
RYYDPLQGRYITQDPIGL
S
G
fig|481805.6.peg.1765
Escherichia coli ATCC 8739 (1-1229/1251)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GGV
---
TSG
N
PVNPLLGAKVLPGETD
F
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DI
H
LQLRDN
E
LILNDNGGRSIHFEHLFPGE
DG
F
SRSE
L
F
WLVRGGVA
K
L
NE
S
H
R
LA
P
LWQALPEELRLSPH
I
YLATNS
P
QGPWWILGW
S
ERVP
G
V
DE
M
LPAPLPPYRVLTGLVDRFGRTLTF
R
REAAGE
F
T
GEITGVTDGAGR
Q
FRLVLTTQAQRAE
N
AR
Q
Q
AI
AA
GAKG
P
D--
-
--
-----
-
--I
PD
S
LP
D
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRY
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMVAHRYAGRPE
M
RYRYDD
T
GRVTEQ
F
NPAGLSYTYQYEK
N
RITITDSLNRREVLHTEGEAGLK
C
VVK
T
E
L
ADGS
I
T
R
S
K
FD
YM
GRL
Q
S
QTDAAGRTTEYSP
N
VVTG
L
V
T
C
ITTPDGR
KSE
FYYN
N
QN
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
TQ
ETAR
N
GD
V
TRY
S
YDNPHS
E
LP
SA
T
E
DATGSRK
Q
MTWSRYGQL
Q
T
FTDCSGY
E
T
H
YEYDRFGQM
M
AVHREEGIS
T
Y
N
T
Y
N
P
RGQL
V
S
W
KDTQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
T
L
YDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNGEPAEQW
R
Y
N
D
HGWLT
E
ISH
L
SEGHRVAVHYGY
N
R
KGRLTGERQTVHHPETGELLWQHET
K
H
T
YNEQGLANR
FQ
A
DSLP
P
VEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
R
LQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYA
S
DPAGNRLPDPELHPDSTLT
A
WPDNRI
T
K
DAHYLYRYDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
E
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
E
NGE
L
D
KAQRRSL
V
E
K
LQQ
E
G
S
E
D
G
H
GVVFP
V
ELV
R
MLDRLE
G
EIRADRVS
S
ESR
A
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RK
V
HLYHCDHRGLPLALIS
E
D
G
N
T
M
W
S
AEYDEWGNLLNEENP
HH
L
Y
Q
PY
RLPGQQYD
D
ESGL
C
YNR
N
RYYDPLQGRYITQDPIGL
S
G
fig|331111.12.peg.1921
Escherichia coli E24377A (1-1241/1405)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
V
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
G
VP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTL
AYR
C
EAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
P
HT
A
S
-
L
SSP
DSP
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|331111.3.peg.4081
Escherichia coli E24377A (1-1241/1405)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
V
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
G
VP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTL
AYR
C
EAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
P
HT
A
S
-
L
SSP
DSP
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|585055.6.peg.522
Escherichia coli 55989 (1-1241/1422)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
RT
A
S
-
L
SSP
DTP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
T
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDA
S
GR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IP
A
GVIRTDDERTHHYHYDS
L
HRLVHY
I
RIQY
E
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|585055.8.peg.523
Escherichia coli 55989 (1-1241/1422)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
RT
A
S
-
L
SSP
DTP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
T
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDA
S
GR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IP
A
GVIRTDDERTHHYHYDS
L
HRLVHY
I
RIQY
E
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|749537.3.peg.955
Escherichia coli MS 115-1 (1-1238/1406)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
V
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
T
V
W
S
AEYDEWGN
Q
LNEENP
YY
L
Y
Q
PY
RLPGQQYDEESGL
D
YNRHRYYDPLQGRYITQDPIGL
A
G
fig|679205.4.peg.4676
Escherichia coli MS 124-1 (1-1238/1406)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
V
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
T
V
W
S
AEYDEWGN
Q
LNEENP
YY
L
Y
Q
PY
RLPGQQYDEESGL
D
YNRHRYYDPLQGRYITQDPIGL
A
G
fig|749533.3.peg.773
Escherichia coli MS 84-1 (1-1238/1406)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
V
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
T
V
W
S
AEYDEWGN
Q
LNEENP
YY
L
Y
Q
PY
RLPGQQYDEESGL
D
YNRHRYYDPLQGRYITQDPIGL
A
G
fig|595496.3.peg.420
Escherichia coli BW2952 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|536056.3.peg.3290
Escherichia coli DH1 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|83333.1.peg.493
Escherichia coli K12 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|316407.3.peg.482
Escherichia coli W3110 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|316385.5.peg.453
Escherichia coli str. K-12 substr. DH10B (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|316385.7.peg.460
Escherichia coli str. K-12 substr. DH10B (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|511145.12.peg.518
Escherichia coli str. K-12 substr. MG1655 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|511145.6.peg.512
Escherichia coli str. K-12 substr. MG1655 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|344610.7.peg.5120
Escherichia coli 53638 (1-1238/1407)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGLYYNRHR
H
YDPLQGRYIT
P
DPIGL
R
G
fig|585396.4.peg.1924
Escherichia coli O111:H- str. 11128 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|679206.4.peg.2994
Escherichia coli MS 119-7 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
W
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
C
G
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|316401.4.peg.630
Escherichia coli ETEC H10407 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAV
Y
YGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|413997.3.peg.1488
Escherichia coli B str. REL606 (1-1238/1407)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGLYYNRHR
H
YDPLQGRYIT
P
DPIGL
R
G
fig|413997.3.peg.481
Escherichia coli B str. REL606 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|511693.5.peg.486
Escherichia coli BL21 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|469008.4.peg.3264
Escherichia coli BL21(DE3) (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|562.375.peg.3784
Escherichia coli EC4100B (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALP
D
PLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|679207.4.peg.1730
Escherichia coli MS 107-1 (1-1241/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
RT
A
S
-
L
SSP
DTP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
K
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDS
L
HRLVHY
I
RIQY
E
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|585034.4.peg.498
Escherichia coli IAI1 (1-1241/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
RT
A
S
-
L
SSP
DTP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
K
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDS
L
HRLVHY
I
RIQY
E
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|585034.5.peg.497
Escherichia coli IAI1 (1-1241/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
RT
A
S
-
L
SSP
DTP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
K
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDS
L
HRLVHY
I
RIQY
E
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|573235.3.peg.2092
Escherichia coli O26:H11 str. 11368 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
D
T
RLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|478008.5.peg.3679
Escherichia coli O157:H7 str. EC869 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|573235.3.peg.541
Escherichia coli O26:H11 str. 11368 (1-1241/1256)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQ
I
RD
D
A
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
L
LPAPLPPYRVLTG
M
A
DRFGRTL
AYR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
P
HT
A
S
-
L
SSP
DSP
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
G
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
D
ESR
A
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|409438.11.peg.651
Escherichia coli SE11 (1-1241/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
RT
A
S
-
L
SSP
DTP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
K
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDS
L
HRLVHY
I
RIQY
E
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|409438.11.peg.1684
Escherichia coli SE11 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVC
Q
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|316401.4.peg.1755
Escherichia coli ETEC H10407 (1-1238/1407)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLW
H
HET
G
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPL
L
E
F
TRDRLHRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGLYYNRHR
H
YDPLQGRYIT
P
DPIGL
R
G
fig|749547.3.peg.1423
Escherichia coli MS 187-1 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLW
H
HET
G
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPL
L
E
F
TRDRLHRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|331111.12.peg.843
Escherichia coli E24377A (1-1241/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
RT
A
S
-
L
SSP
DTP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
K
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDS
L
HRLVHY
I
RIQY
E
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|331111.3.peg.3071
Escherichia coli E24377A (1-1241/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
RT
A
S
-
L
SSP
DTP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
K
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDS
L
HRLVHY
I
RIQY
E
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|358709.5.peg.2056
Escherichia coli 101-1 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILG
G
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGE
C
QTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
W
S
DNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
V
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|585034.4.peg.1442
Escherichia coli IAI1 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QG
L
WWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|585034.5.peg.1439
Escherichia coli IAI1 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QG
L
WWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|562.371.peg.1754
Escherichia coli 1044A (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|562.373.peg.5099
Escherichia coli 1125A (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|562.372.peg.1238
Escherichia coli 1212A (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|562.374.peg.2346
Escherichia coli 536A (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|83334.1.peg.2091
Escherichia coli O157:H7 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|444454.5.peg.966
Escherichia coli O157:H7 str. EC4024 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|444449.5.peg.292
Escherichia coli O157:H7 str. EC4042 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|444448.5.peg.4648
Escherichia coli O157:H7 str. EC4045 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|444452.5.peg.1972
Escherichia coli O157:H7 str. EC4113 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|444450.8.peg.2110
Escherichia coli O157:H7 str. EC4115 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|444447.5.peg.5560
Escherichia coli O157:H7 str. EC4206 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|478004.5.peg.2835
Escherichia coli O157:H7 str. EC4401 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|478005.5.peg.2985
Escherichia coli O157:H7 str. EC4486 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|478006.5.peg.1955
Escherichia coli O157:H7 str. EC4501 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|478007.5.peg.2158
Escherichia coli O157:H7 str. EC508 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|386585.9.peg.2162
Escherichia coli O157:H7 str. Sakai (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|544404.4.peg.1972
Escherichia coli O157:H7 str. TW14359 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DPP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|502346.5.peg.5309
Escherichia coli O157:H7 str. TW14588 (1-1236/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|585395.4.peg.1664
Escherichia coli O103:H2 str. 12009 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DR
A
Y
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|566546.4.peg.1575
Escherichia coli W (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|562.371.peg.2767
Escherichia coli 1044A (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|562.373.peg.3227
Escherichia coli 1125A (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|562.372.peg.1499
Escherichia coli 1212A (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|562.374.peg.5477
Escherichia coli 536A (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|155864.1.peg.554
Escherichia coli O157:H7 EDL933 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|155864.8.peg.568
Escherichia coli O157:H7 EDL933 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|444454.5.peg.5043
Escherichia coli O157:H7 str. EC4024 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|444450.8.peg.716
Escherichia coli O157:H7 str. EC4115 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|478004.5.peg.1424
Escherichia coli O157:H7 str. EC4401 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|478006.5.peg.892
Escherichia coli O157:H7 str. EC4501 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|478008.5.peg.1656
Escherichia coli O157:H7 str. EC869 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|502346.5.peg.643
Escherichia coli O157:H7 str. TW14588 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|444449.5.peg.5380
Escherichia coli O157:H7 str. EC4042 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|444448.5.peg.3255
Escherichia coli O157:H7 str. EC4045 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|444453.5.peg.596
Escherichia coli O157:H7 str. EC4076 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|444452.5.peg.931
Escherichia coli O157:H7 str. EC4113 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|444451.5.peg.2216
Escherichia coli O157:H7 str. EC4196 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|444447.5.peg.3428
Escherichia coli O157:H7 str. EC4206 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|544404.4.peg.580
Escherichia coli O157:H7 str. TW14359 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|344610.3.peg.1505
Escherichia coli 53638 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
P
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQW
R
YD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDRLHRET
V
RSFG
N
GT
GSN
A
A
YELT
ST
YTPAG
R
LQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
E
GV
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRI
T
K
DAHYLYRYDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|344610.7.peg.1175
Escherichia coli 53638 (1-1238/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
P
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQW
R
YD
G
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDRLHRET
V
RSFG
N
GT
GSN
A
A
YELT
ST
YTPAG
R
LQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
E
GV
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRI
T
K
DAHYLYRYDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
V
Y
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GL
K
G
fig|83334.1.peg.635
Escherichia coli O157:H7 (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|386585.9.peg.667
Escherichia coli O157:H7 str. Sakai (1-1236/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQT
W
EY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|344601.3.peg.228
Escherichia coli B171 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HD
X
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DR
A
Y
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRL
D
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|344601.5.peg.225
Escherichia coli B171 (1-1238/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HD
X
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DR
A
Y
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRL
D
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|670888.3.peg.2126
Escherichia coli 1827-70 (1-1238/1335)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
L
LT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
V
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
T
V
W
S
AEYDEWGN
Q
LNEENP
YY
L
Y
Q
PY
RLPGQQYDEESGL
D
YNRHRYYDPLQGRYITQDPIGL
A
G
fig|585396.4.peg.552
Escherichia coli O111:H- str. 11128 (1-1241/1256)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQ
I
RD
D
A
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
L
LPAPLPPYRVLTG
M
A
DRFGRTL
AYR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
P
HT
A
S
-
L
SSP
DSP
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYP
D
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
G
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQ
C
RSLAE
K
I
QQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|478007.5.peg.914
Escherichia coli O157:H7 str. EC508 (1-1254/1416)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
Q
HT
A
S
-
L
SSP
DTP
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKD
A
QGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
TTQGGLTRSMEYDLAGRI
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWLT
E
ISH
L
SEGH
Q
VAVHYGYDDKGRL
A
GERQTVH
N
PETGELLWQHET
E
HAYNEQGLANR
VT
PDSLP
R
VEWLTYGSGYLAGMKLG
G
TPLVE
F
TRDRLHRET
V
RSFG
-
----
N
N
A
YELT
ST
YTPAG
H
LQSQ
R
LNS
QVY
DRDY
D
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
S
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRR
G
RDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|340186.3.peg.841
Escherichia coli E110019 (1-1241/1256)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILS
L
TYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
S
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
L
LPAPLPPYRVLTG
M
A
DRFGRTL
AYR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
P
HT
A
S
-
L
SSP
DSP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
T
L
YDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|340186.5.peg.878
Escherichia coli E110019 (1-1241/1256)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILS
L
TYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
S
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
L
LPAPLPPYRVLTG
M
A
DRFGRTL
AYR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
K
P
HT
A
S
-
L
SSP
DSP
R
PL
S
----
A
SAFPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
T
L
YDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|340184.3.peg.2563
Escherichia coli B7A (1-1238/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALP
D
PLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
W
LPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|340184.6.peg.2683
Escherichia coli B7A (1-1238/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALP
D
PLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
T
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDR
M
HRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
W
LPGQQYD
K
ESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|481805.3.peg.2362
Escherichia coli ATCC 8739 (1-1238/1268)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGL
D
YNRHRYYDPLQGRYITQDPIGL
A
G
fig|481805.6.peg.2353
Escherichia coli ATCC 8739 (1-1238/1268)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QTD
T
TRIQTVY
E
PGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
T
LQQ
E
G
S
E
N
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGL
D
YNRHRYYDPLQGRYITQDPIGL
A
G
fig|749531.3.peg.1549
Escherichia coli MS 69-1 (1-1238/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
G
R
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPL
S
FILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PG
G
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
T
ED
VLPAPLPPYRVLTGL
A
DRFG
Q
TLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DRS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
N
LP
G
APLVRY
TY
T
EA
GEL
L
AVYDRSGTQVR
A
FTYD
P
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
R
LT
AVV
Y
PDGLE
S
RR
A
YDE
R
D
RL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDR
I
THRTVNGEPAEQW
R
YD
G
HGWL
R
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
Q
AYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EY
A
RDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
A
S
V
R
T
L
A
P
D
LDIRIPYATDPAGNRL
Q
DPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRL
E
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPEVTWYGWDGD
H
LTT
V
QTD
S
TRIQTVY
E
PGSFTPLIR
I
ET
D
NGE
R
E
K
T
QRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
G
EIRADRVS
S
ESR
Q
WLAQCGLTVE
R
LA
T
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
G
EYDEWGNLLNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYY
N
PL
L
GRYITQDPIGL
A
G
fig|585395.4.peg.499
Escherichia coli O103:H2 str. 12009 (1-1238/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
-
-
-T
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYITQDPIGL
E
G
fig|340185.3.peg.2516
Escherichia coli E22 (1-1238/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
-
-
-T
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
X
T
L
A
P
D
LDIRIPYATDPAGNRLPDPE
X
HPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
X
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYITQDPIGL
E
G
fig|340185.4.peg.2656
Escherichia coli E22 (1-1238/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEAR
-
-
-T
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
X
T
L
A
P
D
LDIRIPYATDPAGNRLPDPE
X
HPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
X
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYITQDPIGL
E
G
fig|585055.6.peg.1619
Escherichia coli 55989 (1-1234/1413)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
R
GE
fig|585055.8.peg.1622
Escherichia coli 55989 (1-1234/1413)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCP
---
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
R
GE
fig|478008.5.peg.4739
Escherichia coli O157:H7 str. EC869 (1-1238/1407)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
T
LWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
V
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRLTEKTD
R
IPEGVIR
MH
DERTHHYHYD
N
QHRLV
F
YTRIQY
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QT
G
T
TRIQTVY
R
PGSFTPLIR
I
ET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
MLDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
L
E
K
Q
V
EP
E
YTPAR
T
L
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|637388.3.peg.780
Escherichia coli O157:H7 str. FRIK2000 (1-1238/1407)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
T
LWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
V
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRLTEKTD
R
IPEGVIR
MH
DERTHHYHYD
N
QHRLV
F
YTRIQY
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QT
G
T
TRIQTVY
R
PGSFTPLIR
I
ET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
MLDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
L
E
K
Q
V
EP
E
YTPAR
T
L
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|570506.3.peg.1715
Escherichia coli O157:H7 str. FRIK966 (1-1238/1407)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
T
LWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
V
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRLTEKTD
R
IPEGVIR
MH
DERTHHYHYD
N
QHRLV
F
YTRIQY
G
E
----------
PLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTT
V
QT
G
T
TRIQTVY
R
PGSFTPLIR
I
ET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
MLDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
L
E
K
Q
V
EP
E
YTPAR
T
L
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|585396.4.peg.245
Escherichia coli O111:H- str. 11128 (1-1240/1409)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
-
--
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
G
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQ
C
RSLAE
K
I
QQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|585034.4.peg.245
Escherichia coli IAI1 (1-1246/1415)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQ
C
RSLAE
K
I
QQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|585034.5.peg.244
Escherichia coli IAI1 (1-1246/1415)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQ
C
RSLAE
K
I
QQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|344601.3.peg.1933
Escherichia coli B171 (1-1246/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|344601.5.peg.2014
Escherichia coli B171 (1-1246/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|340185.3.peg.1517
Escherichia coli E22 (1-1246/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|340185.4.peg.1604
Escherichia coli E22 (1-1246/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|585395.4.peg.240
Escherichia coli O103:H2 str. 12009 (1-1246/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSFTPLIRVET
E
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|331111.12.peg.571
Escherichia coli E24377A (1-1246/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALP
C
PLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DA
M
GS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KD
A
QG
R
ET
P
YEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQ
C
RSLAE
K
I
QQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
R
M
PGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|331111.3.peg.2807
Escherichia coli E24377A (1-1246/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALP
C
PLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DA
M
GS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KD
A
QG
R
ET
P
YEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
T
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQ
C
RSLAE
K
I
QQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
R
M
PGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|595495.4.peg.4416
Escherichia coli KO11 (1-1238/1349)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAG
G
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
SS
-
L
SSP
ASP
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
I
IAPDGSRS
E
TQYDAWGKA
I
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLT
A
EQM
A
A
QME
DA
Y
I
P
E
R
RL
HLYHCDHRGLPLALIS
P
EG
E
TAW
C
G
EYDEWGN
Q
LNEENP
HH
L
Y
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|566546.3.peg.4451
Escherichia coli W (1-1238/1349)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAG
G
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
SS
-
L
SSP
ASP
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
I
IAPDGSRS
E
TQYDAWGKA
I
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLT
A
EQM
A
A
QME
DA
Y
I
P
E
R
RL
HLYHCDHRGLPLALIS
P
EG
E
TAW
C
G
EYDEWGN
Q
LNEENP
HH
L
Y
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|566546.4.peg.234
Escherichia coli W (1-1238/1349)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAG
G
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
SS
-
L
SSP
ASP
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
I
IAPDGSRS
E
TQYDAWGKA
I
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTD
R
IP
A
GVIRTDDERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLT
A
EQM
A
A
QME
DA
Y
I
P
E
R
RL
HLYHCDHRGLPLALIS
P
EG
E
TAW
C
G
EYDEWGN
Q
LNEENP
HH
L
Y
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|670888.3.peg.824
Escherichia coli 1827-70 (1-1238/1409)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
R
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
M
RY
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNG
D
PAEQWQYDEHGWLT
T
ISH
T
S
D
GHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YT
LT
GQLQS
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRL
A
EKTDLIPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
R
V
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|701177.3.peg.242
Escherichia coli O55:H7 str. CB9615 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|562.371.peg.801
Escherichia coli 1044A (1-1233/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|155864.1.peg.237
Escherichia coli O157:H7 EDL933 (1-1233/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|155864.8.peg.239
Escherichia coli O157:H7 EDL933 (1-1233/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|478006.5.peg.3846
Escherichia coli O157:H7 str. EC4501 (1-1233/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|502346.5.peg.1024
Escherichia coli O157:H7 str. TW14588 (1-1233/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|562.373.peg.2700
Escherichia coli 1125A (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|444454.5.peg.4700
Escherichia coli O157:H7 str. EC4024 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|444449.5.peg.4155
Escherichia coli O157:H7 str. EC4042 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|444448.5.peg.2910
Escherichia coli O157:H7 str. EC4045 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|444453.5.peg.4272
Escherichia coli O157:H7 str. EC4076 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|444452.5.peg.3166
Escherichia coli O157:H7 str. EC4113 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|444450.8.peg.379
Escherichia coli O157:H7 str. EC4115 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|444451.5.peg.3677
Escherichia coli O157:H7 str. EC4196 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|444447.5.peg.3086
Escherichia coli O157:H7 str. EC4206 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|478004.5.peg.3911
Escherichia coli O157:H7 str. EC4401 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|478005.5.peg.3843
Escherichia coli O157:H7 str. EC4486 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|478007.5.peg.3043
Escherichia coli O157:H7 str. EC508 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|544404.4.peg.241
Escherichia coli O157:H7 str. TW14359 (1-1233/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
I
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|344610.3.peg.2570
Escherichia coli 53638 (1-1238/1411)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWILGW
S
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
R
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
M
RY
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNG
D
PAEQWQYDEHGWLT
T
ISH
T
S
D
GHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YT
LT
GQLQS
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLTVEQM
E
S
QME
AE
Y
I
P
E
R
RL
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
R
G
EYDEWGN
Q
LNEENP
HH
L
Y
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|344610.7.peg.1444
Escherichia coli 53638 (1-1238/1411)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWILGW
S
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
R
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
M
RY
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNG
D
PAEQWQYDEHGWLT
T
ISH
T
S
D
GHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YT
LT
GQLQS
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLTVEQM
E
S
QME
AE
Y
I
P
E
R
RL
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
R
G
EYDEWGN
Q
LNEENP
HH
L
Y
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|340186.3.peg.756
Escherichia coli E110019 (1-1237/1419)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
ASP
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
DRDY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|340186.5.peg.783
Escherichia coli E110019 (1-1237/1419)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
ASP
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
DRDY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
D
I
K
G
fig|83334.1.peg.324
Escherichia coli O157:H7 (1-1233/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
G
A
Q
L
E
A
G
Y
I
P
E
RK
L
HLYHCD
Q
RGLPL
G
LIS
P
GR
E
TA
L
T
AEYDEWGNLL
S
E
T
SA
QP
L
Q
Q
S
L
R
F
PGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|386585.9.peg.339
Escherichia coli O157:H7 str. Sakai (1-1233/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
FH
TRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
G
A
Q
L
E
A
G
Y
I
P
E
RK
L
HLYHCD
Q
RGLPL
G
LIS
P
GR
E
TA
L
T
AEYDEWGNLL
S
E
T
SA
QP
L
Q
Q
S
L
R
F
PGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|409438.11.peg.355
Escherichia coli SE11 (4-1242/1411)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSP
---
N
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGR
S
L
I
FHREAAGE
L
A
GEITGVTDGAGR
R
F
H
L
A
L
S
TQAQRAE
A
F
R
K
Q
RV
T
S
-
L
SSP
AGP
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
E
I
S
R
N
G
N
ITR
W
F
YD
S
SR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQL
Q
A
FTDCSGY
A
TRYEYDR
Y
GQ
QI
A
I
HREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
T
LTNENGS
Q
S
T
F
L
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
ISH
T
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGE
I
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
G
-----
E
A
C
EL
A
TA
W
N
TS
GQLQS
R
HLN
L
PQL
D
C
DY
T
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQ
C
RSLAE
K
I
QQ
E
G
S
E
D
G
H
GVVFPAELV
G
L
LDRLE
G
EIRA
N
C
VS
S
ESR
Q
WLAQCGLTVE
R
LA
A
Q
I
EPVY
L
P
E
RKIHLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENP
HH
L
H
Q
PY
RLPGQQYD
K
ESGLYYNRHRYYDPLQGRYIT
P
DPIGL
R
G
fig|331112.3.peg.235
Escherichia coli HS (1-1238/1417)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
C
EGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
D
C
DY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
R
GE
fig|331112.6.peg.240
Escherichia coli HS (1-1238/1417)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
C
EGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
D
C
DY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
R
GE
fig|679206.4.peg.2787
Escherichia coli MS 119-7 (1-1241/1420)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSP
---
N
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
C
EGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
D
C
DY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
R
GE
fig|585034.4.peg.239
Escherichia coli IAI1 (1-1237/1408)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
ASP
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
D
C
DY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|585034.5.peg.239
Escherichia coli IAI1 (1-1237/1408)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
ASP
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
D
C
DY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
E
G
fig|585034.4.peg.1446
Escherichia coli IAI1 (1-1241/1420)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSP
---
N
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
C
EGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
D
C
DY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
R
GE
fig|585034.5.peg.1442
Escherichia coli IAI1 (1-1241/1420)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSP
---
N
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
C
EGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
L
PQL
D
C
DY
T
WNDNG
Q
L
V
RISGP
QE
C
REY
R
YS
G
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
R
GE
fig|409438.11.peg.1688
Escherichia coli SE11 (1-1242/1421)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSP
---
N
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
R
D
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNG
D
PAEQWQYDEHGWLT
T
ISH
T
S
D
GHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YT
LT
GQLQS
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYGWDGDRLTT
V
QT
Q
Q
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLPLALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
EE
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
R
GE
fig|340184.3.peg.108
Escherichia coli B7A (1-1203/1367)
MSGKPAARQGDMTQYGG
FGRCKNWR
A
HR
R
G
VL
GV
-------------S
---
G
RD
---
DF
G
Q
P
-----
G
KSAAG
GE
GAARRD
GP
C------------AAR
PA
A
V
H
S
L
PHLQ
Q
L
P
D
-------
ED
A
C----T
GGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
-
-
-
RT
SS
-
L
SS
S
DSS
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQMTAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWLT
D
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGELLWQHET
K
HAYNEQGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNS
LVY
DRDY
G
WNDNG
D
L
V
RISGPRQTREY
G
YS
A
TGRL
ES
V
R
T
L
A
P
D
LDIRIPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
Y
H
YDEYGRLTEKTDLIP
A
GVIRTDDERTH
--
HYDSQHRLV
F
YTRIQ
H
G
E
----------
PLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKPE
M
TWYGWDGDRLTT
V
QTD
T
TRIQTVYQPGSF
A
PLIR
I
ET
D
NGE
R
E
KAQRRSLAE
K
LQQ
E
G
S
E
D
G
H
GVVFPAELV
R
L
LDRLE
E
EIRADRVS
S
ESR
A
WLAQCGLTVEQ
LA
R
Q
V
EP
E
YTPARK
V
H
F
YHCDHRGLPLALIS
E
D
G
N
TAW
R
G
EYDEWGN
Q
LNEENP
YY
L
H
Q
PY
RLPGQQ
H
DEESGLYYNR
N
RYYDPLQGRYITQDPIGL
A
G
fig|550676.3.peg.751
Escherichia coli B185 (1-1241/1423)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
---
GG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TG
G
TDGAGR
R
F
H
LVLTTQAQRAE
VF
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
SSF
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWLT
T
L
SH
T
SEGHRV
S
VHYGYDDKGRLT
D
ERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YT
LT
GQLQSQHLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
W
S
DNRIA
E
DAHY
V
YR
H
DEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPE
E
TWYG
G
DGDRLTT
V
QT
G
T
TRIQTVYQPGSFTPLIR
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
S
V
ML
G
RLE
R
E
L
R
QGS
VSEES
Q
Q
WLAQCGLT
A
EQM
A
A
Q
L
E
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
PY
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPI
D
I
K
G
fig|340186.3.peg.183
Escherichia coli E110019 (1-1235/1323)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSP
---
N
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
-
P
PE
LP
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSG
M
QVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
TAVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
R
D
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
E
T
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYG
R
GYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQT
G
T
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLT
A
EQM
A
A
QME
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
C
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
K
G
fig|340186.5.peg.196
Escherichia coli E110019 (1-1235/1323)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSP
---
N
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
-
P
PE
LP
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
F
R
K
Q
RA
T
S
-
L
SSP
AGP
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSG
M
QVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
TAVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
R
D
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
RS
T
I
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
E
T
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYG
R
GYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YRYDEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQT
G
T
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLT
A
EQM
A
A
QME
AE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
C
G
EYDEWGNLL
G
E
T
SA
QH
L
Q
Q
S
L
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPIGL
K
G
fig|749548.3.peg.5106
Escherichia coli MS 196-1 (4-1237/1410)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSP
---
N
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
S
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVL
S
TQAQRAE
A
F
R
K
Q
RE
SS
-
L
SSP
AGP
R
SA
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
ET
S
R
N
G
N
ITR
W
F
YD
FSR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQLL
A
FTDCSGY
T
TRYEYD
Q
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
I
S
R
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
A
T
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQT
G
T
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLT
A
EQM
A
A
QME
DA
Y
I
P
E
R
RL
HLYHCDHRGLP
Q
ALI
T
P
EG
E
TAW
C
G
EYDEWGN
Q
LNEENP
HH
L
Y
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|749538.3.peg.1684
Escherichia coli MS 116-1 (4-1237/1410)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSP
---
N
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
S
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVL
S
TQAQRAE
A
F
R
K
Q
RE
SS
-
L
SSP
AGP
R
SA
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
ET
S
R
N
G
N
ITR
W
F
YD
FSR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQLL
A
FTDCSGY
T
TRYEYD
Q
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
I
S
R
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
A
T
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRV
S
VHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
T
R
QE
PD
G
L
Q
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQT
G
T
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLT
A
EQM
A
A
QME
DA
Y
I
P
E
R
RL
HLYHCDHRGLP
Q
ALI
T
P
EG
E
TAW
C
G
EYDEWGN
Q
LNEENP
HH
L
Y
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
fig|749544.3.peg.3348
Escherichia coli MS 175-1 (4-1237/1410)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSP
---
N
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
S
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVL
S
TQAQRAE
A
F
R
K
Q
RE
SS
-
L
SSP
AGP
R
SA
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
ET
S
R
N
G
N
ITR
W
F
YD
FSR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQLL
A
FTDCSGY
T
TRYEYD
Q
Y
GQ
QI
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
I
S
R
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
A
T
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWLT
E
ISH
L
SEGHRVAVHYGYDDKGRLTGERQTV
EN
PETGE
M
LW
E
HET
G
HAY
S
EQGLA
P
R
QE
PD
G
L
Q
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
W
N
TS
GQL
R
S
R
HLN
L
PQL
DRDY
D
WNDNG
Q
LIRISGP
QES
REY
R
YS
D
TGRLTGVHTTA
A
NLDI
D
IPYATDPAGNRLPDPELHPDSTLT
A
WPDNRIA
E
DAHY
V
YR
H
DEYGRL
A
EKTD
R
IPEGVIR
MH
DERTHHYHYDSQHRLV
F
YTRIQ
H
G
E
----------
P
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKPEVTWYGWDGDRLTTIQT
G
T
TRIQTVYQPGSFTPL
L
R
I
ET
E
NGE
Q
AKA
RH
RSLAE
V
LQ
E
D
T
-
-
-
-
-
GV
TL
PAEL
A
V
ML
G
RLE
R
E
L
RA
G
A
VS
A
ES
E
A
WLAQCGLT
A
EQM
A
A
QME
DA
Y
I
P
E
R
RL
HLYHCDHRGLP
Q
ALI
T
P
EG
E
TAW
C
G
EYDEWGN
Q
LNEENP
HH
L
Y
Q
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGL
K
G
Consen1
Primary consensus
MsGKPAARQGDMTqyGg
--------
IVQGSAGVrIGAPTGVACSVCP
---
GGv
---
TsghPVNPLLGAKVLPGETDiALPgPLPFILSRtYSSYRTkTPAPVG
fGPGWKaP
DIRLQlRDn
LiLnDNGGRSihFEhLfPGE
YSRSES
WLvRGGvA
l
gh
LaaLWqaLPeelRLSPH
YLATNS
QGPWWiLgW
ERVPeAdeVLPaplPpYRVLTGlvDrFGRTltfhReAaGe
sGeiTGVTDGAGR
FrLVLTTQAQRAEear
q
ss
-
gssp
-
-----
saFPDTLPg
TEYG
DnGIRLsAVWLtHDPeYPe
LP
APLvRYgwT
GEL
aVYDRSgtQVR
FTYDdky
GRMVaHryaGRPE
RYRYDd
GRVtEQlNPaGLsYtyqYekDriTiTDSLnRREVLhTeGeaGLKRVVKKEhADGSvT
S
fDaaGRL
AQTDAAGRtTEYspdvvtG
iT
iTtPDGR
fyYNh
QlTsat
PDGLe
rReYDE
GRL
ETardGditRYrYDnphSdLP
t
DATGSrktMtWSRYGQLLsFTDCSGY
TRYeyDRfGQmtAVHREEGiS
Yr
Yd
RGqL
svKDtQGhETrYEYnaAGDLTaviaPDGsRsgtQYDAWGKAv
------------------
TTQGGLTRSMeYDaAGRvi
LTnENGS
t
F
YD
lDRL
qq
GFDGRTQRYHyDLTGKLtqSEDEGLvTlWhYDesDRlTHRTVnGepAEqWQYDehGWLT
iSH
SEGHrVaVHYGYDdKGRLtGErQTVhhPeTgelLWqHET
HAYneQGLAnR
PDsLP
VEWLTYGSGyLaGMKLGdTPLVeyTRDRLHReT
RsFG
-----
-
YELttaYTpaGqLQSqhLNs
DRDY
WNDNGeLiRISgPrqtReY
YS
tGRLtgVhTtA
nLDIrIPYATDPAGNRLPDPELHPDSTLtmWPDNRIArDAHYlYryDeyGRLtEKTDlIPeGVIRtdDERTHhYHYDSQHRLVhyTRiQy
E
----------
PlVESRYLYDPLGRR
aKRVWRRERDLTGWMSLSRKPevTWYGWDGDRLTTiQtd
tRIQTvYqPGSFtPLiRvET
nGE
aKaqrRSLAe
LQq
g
e
g
gVvfPaeLv
mLdRLE
EiradrVSeESr
WLAqCGLTveQmq
QmepvYtPaRKiHlYHCDHRGLPLALIS
eG
TaW
aEYDEWGNlLnEEnp
L
QliRLPGQQyDeESGLYYNRhRYYDPLQGRYITQDPIGL
GE
Consen2
Secondary consensus
g
rk
l
l
yan
l
a
a
r
l
m
i
v
s
ly
p
l
a
k
q
sq
sr
gv
pd
l
s
g
ed
pep
a
a
g
qayr
a
e
d
a
av
h
vf
-
-
g
ltes
r
ss
lv
a
r
e
m
a
d
a
ty
v
nk
a
h
g
hht
s
v
v
e
d
rfe
gq
hv
v
d
y
q
gg
l
i
y
ev
r
gl
mas
v
v
g
yg
v
avv
r
s
a
sps
etv
s
daa
e
i
trq
a
dh
y
qi
l
s
n
r
aq
a
r
q
si
tivt
n
nei
i
g
l
it
s
s
v
ee
h
ir
i
h
y
aa
i
k
dt
r
dr
l
q
e
a
h
en
q
eam
e
sa
t
g
w
s
g
df
k
r
magsn
a
ast
ts
h
rr
l
v
s
qes
s
s
es
r
l
d
d
sa
e
v
hh
rh
a
r
a
mh
r
fh
t
h
q
g
qe
v
nq
s
i
e
a
l
i
t
e
trh
d
e
t
-
-
s
tl
pv
a
l
g
llqgs
s
q
s
aa
la
dae
i
e
f
d
e
g
q
g
sa
py
h
k
n
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character