fig|585055.6.peg.4084
Escherichia coli 55989 (38-1321/1321)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMT
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|585055.8.peg.4087
Escherichia coli 55989 (38-1321/1321)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMT
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|1040638.4.peg.1677
Escherichia coli O104:H4 str. LB226692
M
CPGGVTSGHPVN
X
LLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMT
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|409438.11.peg.4043
Escherichia coli SE11 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQG
D
LTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGD
I
PLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|344601.3.peg.1791
Escherichia coli B171 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
Q
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|344601.5.peg.1871
Escherichia coli B171 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
Q
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|331111.12.peg.4332
Escherichia coli E24377A (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
T
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|331111.3.peg.1735
Escherichia coli E24377A (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
T
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|340184.3.peg.375
Escherichia coli B7A (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIA
Q
PGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHT
S
RPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|340184.6.peg.395
Escherichia coli B7A (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIA
Q
PGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHT
S
RPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|566546.4.peg.3830
Escherichia coli W (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAP
N
GDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
T
AVKDTQGHETRYEYNAAGDLTAVIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|595496.3.peg.3464
Escherichia coli BW2952 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|536056.3.peg.245
Escherichia coli DH1 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|316407.3.peg.3646
Escherichia coli W3110 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|316385.5.peg.3603
Escherichia coli str. K-12 substr. DH10B (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|316385.7.peg.3684
Escherichia coli str. K-12 substr. DH10B (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|511145.12.peg.3581
Escherichia coli str. K-12 substr. MG1655 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|511145.6.peg.3564
Escherichia coli str. K-12 substr. MG1655 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|83333.1.peg.3415
Escherichia coli K12 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQM
K
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|562.375.peg.421
Escherichia coli EC4100B (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLV
D
S
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|331112.3.peg.3453
Escherichia coli HS (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|331112.6.peg.3587
Escherichia coli HS (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|413997.3.peg.3489
Escherichia coli B str. REL606 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHP
E
TEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|511693.5.peg.3505
Escherichia coli BL21 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHP
E
TEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|469008.4.peg.275
Escherichia coli BL21(DE3) (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHP
E
TEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YEL
S
TAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|670888.3.peg.2766
Escherichia coli 1827-70 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTD
T
AGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
YA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAEQWQYDERGWLTDISH
I
SEGHRV
A
VHYGYD
S
KGRL
AS
E
HL
TVHHPQT
N
E
LLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPL
I
ESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|316401.4.peg.4223
Escherichia coli ETEC H10407 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
C
YRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHY
R
YDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGL
Q
LALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|585396.4.peg.4438
Escherichia coli O111:H- str. 11128 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
C
YRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
T
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHW
Y
YDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|481805.3.peg.246
Escherichia coli ATCC 8739 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
S
SG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHW
Y
YDEADRLTHRTV
K
GETAEQWQYDERGWLTDISH
I
SEGHRV
A
VHYGYD
S
KGRL
AS
E
HL
TVHHPQT
N
E
LLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|481805.6.peg.256
Escherichia coli ATCC 8739 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
S
SG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHW
Y
YDEADRLTHRTV
K
GETAEQWQYDERGWLTDISH
I
SEGHRV
A
VHYGYD
S
KGRL
AS
E
HL
TVHHPQT
N
E
LLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTH
Q
YHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|573235.3.peg.4681
Escherichia coli O26:H11 str. 11368 (38-1321/1411)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHT
S
RPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
A
QLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|585395.4.peg.4804
Escherichia coli O103:H2 str. 12009 (38-1299/1389)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYG
R
T
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
Q
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIR
----------------------
LQGRYITQDPIGLKGGWNLY
G
Y
Q
LNP
I
SD
IDPLG
-
L
SMWE
DA
K
S
GA
C
TNGL
CG
TLSAMI
GP
D
K
FDSIDSTAY
DAL
NKINS
QSI
CED
KE
F
A
GLICKD
NS
GRYF
ST
APNRGE
fig|585055.6.peg.4446
Escherichia coli 55989 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
Q
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
Y
L
PM
N
P
fig|585055.8.peg.4450
Escherichia coli 55989 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
Q
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
Y
L
PM
N
P
fig|585395.4.peg.4862
Escherichia coli O103:H2 str. 12009 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYG
R
T
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
Q
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
Y
L
PM
N
P
fig|562.371.peg.1871
Escherichia coli 1044A (38-1293/1394)
VCPGGVT
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|83334.1.peg.4839
Escherichia coli O157:H7 (38-1293/1394)
VCPGGVT
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|478006.5.peg.2164
Escherichia coli O157:H7 str. EC4501 (38-1293/1394)
VCPGGVT
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|386585.9.peg.5086
Escherichia coli O157:H7 str. Sakai (38-1293/1394)
VCPGGVT
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|502346.5.peg.1734
Escherichia coli O157:H7 str. TW14588 (38-1293/1394)
VCPGGVT
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|562.373.peg.2622
Escherichia coli 1125A (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|562.372.peg.3653
Escherichia coli 1212A (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|562.374.peg.3612
Escherichia coli 536A (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|444454.5.peg.3970
Escherichia coli O157:H7 str. EC4024 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|444449.5.peg.3425
Escherichia coli O157:H7 str. EC4042 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|444448.5.peg.2180
Escherichia coli O157:H7 str. EC4045 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|444453.5.peg.2322
Escherichia coli O157:H7 str. EC4076 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|444452.5.peg.1790
Escherichia coli O157:H7 str. EC4113 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|444450.8.peg.5262
Escherichia coli O157:H7 str. EC4115 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|444451.5.peg.2836
Escherichia coli O157:H7 str. EC4196 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|444447.5.peg.2346
Escherichia coli O157:H7 str. EC4206 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|478004.5.peg.2396
Escherichia coli O157:H7 str. EC4401 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|478005.5.peg.2244
Escherichia coli O157:H7 str. EC4486 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|478007.5.peg.2221
Escherichia coli O157:H7 str. EC508 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|544404.4.peg.5072
Escherichia coli O157:H7 str. TW14359 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|478008.5.peg.2068
Escherichia coli O157:H7 str. EC869 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
S
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLY
W
YDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
TRIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|701177.3.peg.4755
Escherichia coli O55:H7 str. CB9615 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTL
S
G
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGE
G
GLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
LAGMKLGDTPLV
DF
TRDRLHR
K
T
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
D
S
GRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTV
A
QMQ
S
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
A
T
E
W
C
AEYDEWGNLLNEEN
L
HQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NG
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|566546.4.peg.4232
Escherichia coli W (38-1289/1390)
VCPGGVTSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
C
YRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
T
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKA
I
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
LS
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
Y
L
PM
N
P
fig|573235.3.peg.4775
Escherichia coli O26:H11 str. 11368 (38-1293/1394)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHT
S
RPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
A
QLLSFTDCSGY
V
TRYDHDRFGQ
M
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
I
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLAN
L
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQSQHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|409438.11.peg.4404
Escherichia coli SE11 (38-1273/1374)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKD
H
ITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
YR
A
YD
S
RGQL
I
AVKDTQGHETRYEYNAAGDLT
T
VIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAEQWQYDERGWLTDISH
I
SEGHRVTVHYGYDEKGRLTGERQTVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSGYLAGMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQSQHLN
--
--
----
-
-----------
SPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILA
A
RVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|585396.4.peg.4936
Escherichia coli O111:H- str. 11128 (38-1289/1390)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWWLLGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
C
YRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSLNRREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRYDHDRFGQ
V
TAVHREEGLS
Q
Y
H
A
YD
S
RGQL
T
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKA
I
C
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHW
Y
YDEADRLTHRTV
K
GETAE
R
WQYDERGWLTDISH
I
SEGHRV
A
VHYGYD
S
KGRL
AS
E
HL
TVHHPQTEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWLTYGSG
W
L
S
GMKLGDTPLVEYTRDRLHRET
L
R
R
FG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLT
S
VHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDRYGRLTEKTDLIPEG
G
IRTDDERTHRYHYDSQHRLVHYTRTQY
A
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
L
A
R
TQRRSLAD
A
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILA
A
RVSEESR
R
WLASCGLTVEQMQ
N
QMDPVYTPARK
I
HLYHCDHRGLPLALIS
K
EG
T
T
E
W
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|331111.12.peg.4692
Escherichia coli E24377A (38-1264/1365)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
T
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHY
R
YDEKGRLTGERQTVHHP
E
TEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
Q
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASC
--------
-
-----------
-
--------
GLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|331111.3.peg.2089
Escherichia coli E24377A (38-1264/1365)
VCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVGSLGPGWKMP
A
DIRLQLRDN
T
LILSDNGGRSLYFEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
AISG
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LPAAPLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHRHTGRPE
I
RYRYDSDGRVTEQLNPAGLSYTY
Q
YEKDRITITDSL
D
RREVLHTQGEAGLKRVVKKEHADGSVTQS
Q
FDAVGRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYN
HHN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETAPDGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRYDHDRFGQ
M
TAVHREEGLS
Q
YR
A
YD
S
RGQL
T
AVKDTQGHETRYEYN
I
AGDLTAVIAPDGSRNGTQYDAWGKAV
R
TTQGGLTRSMEYDAAGRVI
R
LTSENGSHTTFRYD
V
LDRL
I
QE
T
GFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTV
K
GETAE
R
W
R
YDERGWLTDISH
I
SEGHRVTVHY
R
YDEKGRLTGERQTVHHP
E
TEALLWQHET
R
HAYNAQGLANR
CI
PDSLPAVEWL
A
YGSGYLAGMKLGDTPLV
DF
TRDRLHRET
L
RSFG
R
-----
-
-
YELTTAYTPAGQLQ
R
QHLNSL
QY
DRDY
T
WNDNGELIRISSPRQTRSY
S
YS
T
TGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLS
M
WPDNRIARDAHYLYRYDR
H
GRLTEKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQY
E
EPLVESRYLYDPLGRR
V
AKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
R
S
RIQTIYQPGSFTPLIRVETATGE
Q
AKTQRRSLAD
T
LQQ
S
GGE
D
G
GS
VVFPPVLV
Q
MLDRLE
S
EILADRVSEESR
R
WLASC
--------
-
-----------
-
--------
GLPLALIS
T
EG
A
TAW
C
AEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNLY
T
Y
P
L
S
PV
NS
M
DPLG
-
L
YEF-
--
-
-
--
-
----
--
------
--
-
K
SKNIDDIGI
F
AL
AMCNG
E
SI
NEN
KEY
G
GLICK
-
KQ
G
E
YF
PM
N
P
fig|585055.6.peg.522
Escherichia coli 55989 (38-1330/1422)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
F
GPGWK
A
P
S
DIRLQ
I
RD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
SL
S
S
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
T
D
S
GIRLSAVWL
M
HDPEYPE
N
LPAAPLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HR
YA
GRPE
M
RYRYD
D
A
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDA
S
GR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYD
GH
GWL
R
E
ISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YRYD
E
YGRL
A
EKTD
R
IP
A
GVIRTDDERTH
H
YHYDS
L
HRLVHY
I
R
I
QY
E
EPLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|585055.8.peg.523
Escherichia coli 55989 (38-1330/1422)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
F
GPGWK
A
P
S
DIRLQ
I
RD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
SL
S
S
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
T
D
S
GIRLSAVWL
M
HDPEYPE
N
LPAAPLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HR
YA
GRPE
M
RYRYD
D
A
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDA
S
GR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYD
GH
GWL
R
E
ISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YRYD
E
YGRL
A
EKTD
R
IP
A
GVIRTDDERTH
H
YHYDS
L
HRLVHY
I
R
I
QY
E
EPLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|595496.3.peg.420
Escherichia coli BW2952 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|536056.3.peg.3290
Escherichia coli DH1 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|83333.1.peg.493
Escherichia coli K12 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|316407.3.peg.482
Escherichia coli W3110 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|316385.5.peg.453
Escherichia coli str. K-12 substr. DH10B (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|316385.7.peg.460
Escherichia coli str. K-12 substr. DH10B (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|511145.12.peg.518
Escherichia coli str. K-12 substr. MG1655 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|511145.6.peg.512
Escherichia coli str. K-12 substr. MG1655 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|679207.4.peg.1730
Escherichia coli MS 107-1 (38-1330/1429)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
SL
S
S
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYD
GH
GWL
R
DISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
K
DAHY
V
Y
H
YD
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDS
L
HRLVHY
I
R
I
QY
E
EPLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|585034.4.peg.498
Escherichia coli IAI1 (38-1330/1429)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
F
GPGWK
A
P
S
DIRLQ
I
RD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
SL
S
S
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYD
GH
GWL
R
E
ISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
K
DAHY
V
Y
H
YD
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDS
L
HRLVHY
I
R
I
QY
E
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
M
TWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|585034.5.peg.497
Escherichia coli IAI1 (38-1330/1429)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
F
GPGWK
A
P
S
DIRLQ
I
RD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
SL
S
S
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYD
GH
GWL
R
E
ISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
K
DAHY
V
Y
H
YD
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDS
L
HRLVHY
I
R
I
QY
E
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
M
TWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|409438.11.peg.651
Escherichia coli SE11 (38-1330/1429)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
F
GPGWK
A
P
S
DIRLQ
I
RD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
SL
S
S
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYD
GH
GWL
R
E
ISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
K
DAHY
V
Y
H
YD
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDS
L
HRLVHY
I
R
I
QY
E
EPLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|316401.4.peg.630
Escherichia coli ETEC H10407 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
V
Y
YGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|413997.3.peg.481
Escherichia coli B str. REL606 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
H
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YD
AS
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|511693.5.peg.486
Escherichia coli BL21 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
H
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YD
AS
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|469008.4.peg.3264
Escherichia coli BL21(DE3) (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
H
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YD
AS
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|749547.3.peg.1423
Escherichia coli MS 187-1 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
H
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
S
D
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGL
I
T
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLW
H
HET
G
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPL
L
E
F
TRDRLHRET
V
RSFG
S
MAGSN
A
A
Y
K
LT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
H
L
H
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|331111.12.peg.843
Escherichia coli E24377A (38-1330/1429)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
F
GPGWK
A
P
S
DIRLQ
I
RD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
SL
S
S
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYD
GH
GWL
R
E
ISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
K
DAHY
V
Y
H
YD
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDS
L
HRLVHY
I
R
I
QY
E
EPLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|331111.3.peg.3071
Escherichia coli E24377A (38-1330/1429)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
F
GPGWK
A
P
S
DIRLQ
I
RD
D
A
L
V
L
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
SL
S
S
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
R
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYD
GH
GWL
R
E
ISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
R
AGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
K
DAHY
V
Y
H
YD
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDS
L
HRLVHY
I
R
I
QY
E
EPLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|358709.5.peg.2056
Escherichia coli 101-1 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LG
G
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRT
L
T
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LPAAPLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHR
YA
GRPE
M
RYRYD
DT
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
H
YD
N
RG
R
L
T
S
VKD
A
QG
R
ETRYEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGSH
SV
F
S
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLV
IL
W
Y
YD
AS
DR
I
THRTVNGE
P
AEQWQYD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGE
C
QTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YTPAGQLQSQHLNSL
VY
DRDY
G
W
S
DNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
ES
V
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
V
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRLTEKTD
R
IP
A
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
M
AKRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
Y
E
PGSFTPLIRVET
EN
GE
R
E
K
A
QRRSLA
E
T
LQQ
E
G
S
E
N
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|344610.3.peg.1505
Escherichia coli 53638 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRT
L
T
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
P
Q
H
P
GRMV
G
HR
YA
GRPE
M
RYRYD
D
A
GRV
V
EQLNPAGLSY
R
Y
Q
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
H
YD
N
RG
R
L
T
S
VKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGS
R
S
E
F
T
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQW
R
YD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
T
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDRLHRET
V
RSFG
N
GT
GSN
A
A
YELT
ST
YTPAG
R
LQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
E
GV
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRI
T
K
DAHYLYRYD
E
YGRLTEKTD
R
IP
T
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|344610.7.peg.1175
Escherichia coli 53638 (38-1327/1426)
VCPGG
M
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
VF
GPGWK
A
P
S
DIRLQLRD
D
G
LIL
N
DNGGRS
IH
FE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWW
I
LGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRT
L
T
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
SL
S
S
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
P
Q
H
P
GRMV
G
HR
YA
GRPE
M
RYRYD
D
A
GRV
V
EQLNPAGLSY
R
Y
Q
YE
Q
DRIT
V
TDSLNRREVLHT
E
G
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DA
A
GRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
DGN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
SRS
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRY
EY
DRFGQ
M
TAVHREEG
I
S
L
YR
H
YD
N
RG
R
L
T
S
VKD
A
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
R
SE
TQYDAWGKAV
S
TTQGGLTRSMEYDAAGRVI
S
LT
N
ENGS
R
S
E
F
T
YD
A
LDRL
V
Q
Q
G
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
W
Y
YDE
S
DR
I
THRTVNGE
P
AEQW
R
YD
GH
GWLTDISH
L
SEGHRV
A
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
T
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLGDTPL
L
EYTRDRLHRET
V
RSFG
N
GT
GSN
A
A
YELT
ST
YTPAG
R
LQSQHLNSL
VY
DRDY
G
WNDNG
D
L
V
RIS
G
PRQTR
E
Y
G
YS
A
TGRL
E
GV
R
T
L
A
PD
LDIRIPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRI
T
K
DAHYLYRYD
E
YGRLTEKTD
R
IP
T
GVIRTDDERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EPLVESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
E
VTWYGWDGDRLTT
V
Q
T
D
T
TRIQT
V
YQPGSF
A
PLIR
I
ET
D
N
GE
R
E
K
A
QRRSLA
E
K
LQQ
E
G
S
E
D
G
HG
VVFP
AE
LV
R
L
LDRLE
E
EI
R
ADRVS
S
ESR
A
WLA
Q
CGLTVEQ
LA
R
Q
V
E
P
E
YTPARK
A
HLYHCDHRGLPLALIS
E
D
G
N
TAW
S
AEYDEWGN
Q
LNEENPH
HVY
Q
PY
RLPGQQ
H
DEESGLYYNRHRYYDPLQGRYITQDP
M
GLKGGWNLY
Q
Y
P
LNP
L
QQ
IDP
M
G
L
L
QTWD
DA
R
S
GA
C
TGGV
CG
VLSRII
GP
S
K
FDSTADAAL
DAL
KETQN
R
S
L
CND
M
EY
S
G
I
V
CKD
TN
G
K
YF
AS
fig|701177.3.peg.242
Escherichia coli O55:H7 str. CB9615 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTEQ
V
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGL
I
T
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
V
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|562.373.peg.2700
Escherichia coli 1125A (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|444454.5.peg.4700
Escherichia coli O157:H7 str. EC4024 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|444449.5.peg.4155
Escherichia coli O157:H7 str. EC4042 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|444448.5.peg.2910
Escherichia coli O157:H7 str. EC4045 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|444453.5.peg.4272
Escherichia coli O157:H7 str. EC4076 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|444452.5.peg.3166
Escherichia coli O157:H7 str. EC4113 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|444450.8.peg.379
Escherichia coli O157:H7 str. EC4115 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|444451.5.peg.3677
Escherichia coli O157:H7 str. EC4196 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|444447.5.peg.3086
Escherichia coli O157:H7 str. EC4206 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|478004.5.peg.3911
Escherichia coli O157:H7 str. EC4401 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|478005.5.peg.3843
Escherichia coli O157:H7 str. EC4486 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|478007.5.peg.3043
Escherichia coli O157:H7 str. EC508 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|544404.4.peg.241
Escherichia coli O157:H7 str. TW14359 (38-1305/1410)
VCPGG
I
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWW
I
L
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LPAAPL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTE
LV
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
IV
APDGSR
SEI
QYDAWGKAV
S
TTQGGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGLVT
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEY
M
RDRLHRET
A
RSFG
G
-----
E
A
YEL
A
TA
WNTS
GQL
R
S
R
HLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
I
YRYD
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
FH
TR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|340186.3.peg.756
Escherichia coli E110019 (38-1309/1419)
VCPGG
I
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWW
I
L
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRT
L
TFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
SL
S
S
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTEQ
V
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQ
M
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
R
S
T
I
A
YD
KR
G
R
P
V
S
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLT
R
KL
TQ
SEDEGL
I
T
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
LP
QL
DRDY
T
WNDNG
Q
L
V
RIS
G
P
QE
C
R
E
Y
R
YS
G
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|340186.5.peg.783
Escherichia coli E110019 (38-1309/1419)
VCPGG
I
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWW
I
L
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRT
L
TFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
SL
S
S
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTEQ
V
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQ
M
V
SQ
KD
A
QG
R
ETRYEY
S
AAGDLTA
T
V
S
PDG
K
R
S
T
I
A
YD
KR
G
R
P
V
S
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLT
R
KL
TQ
SEDEGL
I
T
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLTGERQTV
EN
P
E
T
GE
LLWQHET
K
HAYN
E
QGLANR
VT
PDSLP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
A
RSFG
G
-
AGS
T
A
G
YE
Q
A
TAYT
LT
GQLQS
R
HLN
LP
QL
DRDY
T
WNDNG
Q
L
V
RIS
G
P
QE
C
R
E
Y
R
YS
G
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
WPDNRIA
E
DAHY
V
YR
H
D
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYGWDGDRLTT
V
Q
TQ
Q
TRIQT
V
YQPGSFTPL
L
R
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
A
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
fig|550676.3.peg.751
Escherichia coli B185 (38-1313/1423)
VCPGG
I
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
VF
GPGWK
A
P
F
DIRLQ
I
RD
E
G
LIL
N
DNGGRS
IH
FE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWW
I
L
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRT
LA
FHR
A
A
E
G
D
V
A
G
AV
TG
G
TDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
SL
S
S
PAG
P
R
SA
SS
SSF
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
HYA
GRPE
S
RYRYD
DT
GRVTEQ
V
NP
E
GL
D
Y
RF
E
Y
GQ
DR
V
TITDSLNRREVL
Y
T
E
GE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
EA
GRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
SQR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
SRS
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRY
EY
DR
Y
GQ
Q
I
AVHREEG
I
S
T
Y
S
S
Y
N
P
RGQL
V
SQ
KD
A
QG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
R
S
T
I
E
YD
KR
G
R
P
V
S
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LT
N
ENGS
QS
TFRYD
P
V
DRL
T
EQ
R
GFDGRTQRYH
Y
DLTGKL
TQ
SEDEGL
I
T
L
WHYD
AS
DR
I
THRTVNG
DP
AEQWQYDE
H
GWLT
TL
SH
T
SEGHRV
S
VHYGYD
D
KGRLT
D
ERQTV
EN
P
E
T
GEM
LW
E
HET
G
HAY
SE
QGLA
T
R
QE
PD
G
LP
P
VEWLTYGSGYLAGMKLG
G
TPLVEYTRDRLHRET
V
RSFG
S
MAGSN
A
A
YELT
ST
YT
LT
GQLQSQHLN
LP
QL
DRDY
D
WNDNG
Q
LIRIS
G
P
QES
R
E
Y
R
YS
D
TGRLTGVHTTAANLDI
D
IPYATDPAGNRLPDPELHPDSTL
T
A
W
S
DNRIA
E
DAHY
V
YR
H
D
E
YGRL
A
EKTD
R
IPEGVIR
MH
DERTH
H
YHYDSQHRLV
F
YTR
I
Q
H
G
EP
Q
VESRYLYDPLGRR
T
G
KRVWRRERDLTGWMSLSRKP
EE
TWYG
G
DGDRLTT
V
Q
T
G
T
TRIQT
V
YQPGSFTPLIR
I
ET
EN
GE
Q
AK
ARH
RSLA
E
V
LQ
E
D
T
G
-
-
-
--
V
TL
P
AE
L
S
V
ML
G
RLE
R
E
LRQGS
VSEES
Q
Q
WLA
Q
CGLT
A
EQM
A
A
Q
L
EAE
Y
I
P
E
RK
L
HLYHCDHRGLP
Q
ALIS
P
EG
E
TAW
Q
G
EYDEWGNLL
G
E
TSAQH
LQQ
PY
RLPGQQYDEESGLYYNR
N
RYYDPLQGRYITQDPI
DI
KGGWNLY
S
Y
A
LNPV
SW
IDPLG
-
L
TQCD
--
-
S
EG
C
NNDI
--
LFTGGS
GP
D
N
KILNELGPR
D
GI
DGLGS
Q
N
M
---
K
M
Y
S
GL
L
GG
D
Consen1
Primary consensus
VCPGGvTsghPVNPlLGAKVLPGETDiALPgPLPFILSRtYSSYRTkTPAPVGslGPGWKmP
DIRLQlRDn
LiLsDNGGRSlyFEhLfPGE
YSRSES
WLvRGGvA
l
gh
LaaLWqaLPeelRLSPH
YLATNS
QGPWWlLgW
ERVPeAdeVLPaplPpYRVLTGlvDrFGRTqtfhReAaGe
sGeiTGVTDGAGR
FrLVLTTQAQRAEea
---
R
aiSg
p
-
-----
saFPDTLPg
TEYG
DnGIRLsAVWLtHDPeYPe
LPaAPLvRYgwT
GEL
aVYDRSgtQVR
FtYDdky
GRMVAHrhtGRPE
RYRYDsdGRVtEqlNPaGLsYty
YekDriTiTDSLnRREVLhTqGeaGLKRVVKKEhADGSvTqS
fDavGRL
AQTDAAGRtTEYspdvvtG
iT
iTtPDGR
fyYN
QlTsat
PDGLe
rREYDE
GRL
ETapdGditRYrYDnphSdLP
t
DATGSrktMtWSRYGQLLsFTDCSGY
TRYdhDRfGQ
tAVHREEGlS
Yr
Yd
RGqL
avKDtQGhETRYEYnaAGDLTaviaPDGsRngtQYDAWGKAv
TTQGGLTRSMeYDAAGRvi
LTsENGShttFrYD
lDRL
qe
GFDGRTQRYHhDLTGKLirSEDEGLVthWhYDeaDRlTHRTVnGetAEqWqYDerGWLTdiSH
SEGHRVtVHYgYDeKGRLTGErQTVhhPqTealLWqHET
HAYnaQGLAnR
PDsLPaVEWLTYGSGyLaGMKLGdTPLVeytRDRLHReT
RsFG
-----
-
YELttaytpaGQLqSqHLNsl
DRDY
WnDNGeLiRISsPrqtRsY
YS
tGRLtgVhTtAanLDIrIPYATDPAGNRLPDPELHPDSTLs
WPDNRIArDAHYlYryDryGRLtEKTDlIPeGvIRtdDERTHrYHYDSQHRLVhyTRtQy
EPlVESRYLYDPLGRR
aKRVWRRERDLTGWMSLSRKPqvTWYGWDGDRLTTiQnd
tRIQTiYqPGSFTPLiRvETatGE
aKtqrRSLAd
LQq
gge
g
VvfPpvLv
mLdRLE
EiladrVSeESr
WLAsCGLTveQmq
QmdpvYtPaRK
HLYHCDHRGLPlALIS
eG
TaW
aEYDEWGNlLnEenphqlqQliRLPGQQyDEESGLYYNRHRYYDPLQGRYITQDPiglKGGWNLY
Y
LnPv
iDPlG
-
L
da
s
c
cg
gp
k
dal
qsi
key
Glickd
GrYF
aPNRGE
Consen2
Secondary consensus
yan
v
l
a
a
r
vf
a
i
v
n
ih
p
l
a
k
q
sq
sr
gv
pd
i
n
g
ed
pep
a
va
g
layr
a
e
d
a
av
h
vfrkq
sl
s
sr
ss
lv
a
r
e
m
a
d
g
a
ty
v
nk
a
a
h
hya
dt
v
lv
e
d
rf
gq
hv
v
d
y
e
gg
l
i
r
y
ea
r
gl
mas
v
v
g
yg
v
avv
r
s
srs
etv
s
daa
e
i
trq
a
ey
y
i
i
s
n
r
sq
a
r
si
tivt
n
sei
i
g
it
n
qsv
s
v
eq
y
tq
il
y
as
i
k
dp
r
r
gh
tl
r
d
c
en
e
gem
e
se
t
g
p
w
s
g
dfm
k
r
magsn
a
astwnts
r
r
lp
s
v
g
qes
e
s
es
r
l
pd
d
t
e
wh
eh
a
r
a
g
mh
h
fh
i
h
q
g
ee
v
tq
s
v
e
l
i
en
e
arh
e
e
ts
-
-
tl
ae
a
l
g
lrqgs
s
q
q
aa
la
eae
i
e
q
d
e
g
q
g
tsaqhvy
py
h
mdi
s
m
m
l
--
-
-
--
--
n
fgi
n
mmf
i
gg
-
n
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character