fig|585055.6.peg.1619
Escherichia coli 55989 (1-730/1413)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585055.8.peg.1622
Escherichia coli 55989 (1-730/1413)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|1040638.4.peg.4201
Escherichia coli O104:H4 str. LB226692
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGW
F
NHPQPYQ
fig|679207.4.peg.1730
Escherichia coli MS 107-1 (1-733/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|585395.4.peg.1664
Escherichia coli O103:H2 str. 12009 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|316401.4.peg.1755
Escherichia coli ETEC H10407 (1-730/1407)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|344601.3.peg.228
Escherichia coli B171 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HD
X
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|344601.5.peg.225
Escherichia coli B171 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HD
X
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|585034.4.peg.1442
Escherichia coli IAI1 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QG
L
WWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|585034.5.peg.1439
Escherichia coli IAI1 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QG
L
WWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|749537.3.peg.955
Escherichia coli MS 115-1 (1-730/1406)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|679205.4.peg.4676
Escherichia coli MS 124-1 (1-730/1406)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|749533.3.peg.773
Escherichia coli MS 84-1 (1-730/1406)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|566546.4.peg.1575
Escherichia coli W (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|331111.12.peg.843
Escherichia coli E24377A (1-733/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|331111.3.peg.3071
Escherichia coli E24377A (1-733/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|585034.4.peg.498
Escherichia coli IAI1 (1-733/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|585034.5.peg.497
Escherichia coli IAI1 (1-733/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|409438.11.peg.651
Escherichia coli SE11 (1-733/1429)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|749547.3.peg.1423
Escherichia coli MS 187-1 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|562.375.peg.3784
Escherichia coli EC4100B (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALP
D
PLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|585396.4.peg.1924
Escherichia coli O111:H- str. 11128 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|573235.3.peg.2092
Escherichia coli O26:H11 str. 11368 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
D
T
RLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|340184.3.peg.2563
Escherichia coli B7A (1-730/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALP
D
PLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|340184.6.peg.2683
Escherichia coli B7A (1-730/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALP
D
PLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|413997.3.peg.1488
Escherichia coli B str. REL606 (1-730/1407)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|340185.3.peg.2516
Escherichia coli E22 (1-730/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|340185.4.peg.2656
Escherichia coli E22 (1-730/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585395.4.peg.499
Escherichia coli O103:H2 str. 12009 (1-730/1253)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|595496.3.peg.420
Escherichia coli BW2952 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|536056.3.peg.3290
Escherichia coli DH1 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|83333.1.peg.493
Escherichia coli K12 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|316407.3.peg.482
Escherichia coli W3110 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|316385.5.peg.453
Escherichia coli str. K-12 substr. DH10B (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|316385.7.peg.460
Escherichia coli str. K-12 substr. DH10B (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|511145.12.peg.518
Escherichia coli str. K-12 substr. MG1655 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|511145.6.peg.512
Escherichia coli str. K-12 substr. MG1655 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|413997.3.peg.481
Escherichia coli B str. REL606 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|511693.5.peg.486
Escherichia coli BL21 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|469008.4.peg.3264
Escherichia coli BL21(DE3) (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|344610.7.peg.5120
Escherichia coli 53638 (1-730/1407)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|481805.3.peg.2362
Escherichia coli ATCC 8739 (1-730/1268)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|481805.6.peg.2353
Escherichia coli ATCC 8739 (1-730/1268)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|316401.4.peg.630
Escherichia coli ETEC H10407 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|344610.3.peg.1505
Escherichia coli 53638 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
P
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQW
R
YD
G
HGWL
fig|344610.7.peg.1175
Escherichia coli 53638 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
P
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQW
R
YD
G
HGWL
fig|358709.5.peg.2056
Escherichia coli 101-1 (1-730/1426)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILG
G
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YD
A
SDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|566546.3.peg.1
Escherichia coli W (1-730/1034)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|679206.4.peg.2994
Escherichia coli MS 119-7 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
W
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
S
P
FPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
C
G
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|562.371.peg.2767
Escherichia coli 1044A (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|562.373.peg.3227
Escherichia coli 1125A (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|562.372.peg.1499
Escherichia coli 1212A (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|562.374.peg.5477
Escherichia coli 536A (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|83334.1.peg.635
Escherichia coli O157:H7 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|155864.1.peg.554
Escherichia coli O157:H7 EDL933 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|155864.8.peg.568
Escherichia coli O157:H7 EDL933 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444454.5.peg.5043
Escherichia coli O157:H7 str. EC4024 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444450.8.peg.716
Escherichia coli O157:H7 str. EC4115 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|478004.5.peg.1424
Escherichia coli O157:H7 str. EC4401 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|478006.5.peg.892
Escherichia coli O157:H7 str. EC4501 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|478008.5.peg.1656
Escherichia coli O157:H7 str. EC869 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|386585.9.peg.667
Escherichia coli O157:H7 str. Sakai (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|502346.5.peg.643
Escherichia coli O157:H7 str. TW14588 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|566546.3.peg.4671
Escherichia coli W (1-733/1037)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|444449.5.peg.5380
Escherichia coli O157:H7 str. EC4042 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444448.5.peg.3255
Escherichia coli O157:H7 str. EC4045 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444453.5.peg.596
Escherichia coli O157:H7 str. EC4076 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444452.5.peg.931
Escherichia coli O157:H7 str. EC4113 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444451.5.peg.2216
Escherichia coli O157:H7 str. EC4196 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444447.5.peg.3428
Escherichia coli O157:H7 str. EC4206 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|544404.4.peg.580
Escherichia coli O157:H7 str. TW14359 (1-733/1398)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
V
T
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|562.371.peg.1754
Escherichia coli 1044A (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|562.373.peg.5099
Escherichia coli 1125A (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|562.372.peg.1238
Escherichia coli 1212A (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|562.374.peg.2346
Escherichia coli 536A (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|478006.5.peg.1955
Escherichia coli O157:H7 str. EC4501 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|478007.5.peg.2158
Escherichia coli O157:H7 str. EC508 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|502346.5.peg.5309
Escherichia coli O157:H7 str. TW14588 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|83334.1.peg.2091
Escherichia coli O157:H7 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|386585.9.peg.2162
Escherichia coli O157:H7 str. Sakai (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444454.5.peg.966
Escherichia coli O157:H7 str. EC4024 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444449.5.peg.292
Escherichia coli O157:H7 str. EC4042 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444448.5.peg.4648
Escherichia coli O157:H7 str. EC4045 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444452.5.peg.1972
Escherichia coli O157:H7 str. EC4113 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444450.8.peg.2110
Escherichia coli O157:H7 str. EC4115 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|444447.5.peg.5560
Escherichia coli O157:H7 str. EC4206 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|478004.5.peg.2835
Escherichia coli O157:H7 str. EC4401 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|478005.5.peg.2985
Escherichia coli O157:H7 str. EC4486 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|544404.4.peg.1972
Escherichia coli O157:H7 str. TW14359 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDP
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
G
APL
A
RY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|478008.5.peg.3679
Escherichia coli O157:H7 str. EC869 (1-733/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
C
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|409438.11.peg.1684
Escherichia coli SE11 (1-730/1402)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVC
Q
GG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLV
I
LW
Y
YDESDR
I
THRTVNGEPAEQWQYD
G
HGWL
fig|670888.3.peg.2126
Escherichia coli 1827-70 (1-730/1335)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
V
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
T
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
L
LT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|478007.5.peg.914
Escherichia coli O157:H7 str. EC508 (1-751/1416)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
TTQGGLTRSMEYDLAGRI
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNGEPAEQWQYDEHGWL
fig|749531.3.peg.1549
Escherichia coli MS 69-1 (1-730/1400)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPG
R
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPL
S
FILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PG
G
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
T
ED
VLPAPLPPYRVLTGL
A
DRFG
Q
TLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDR
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
N
LP
G
APLVRY
TY
T
EA
GEL
L
AVYDRSGTQVR
A
FTYD
P
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
R
LT
AVV
Y
PDGLE
S
RR
A
YDE
R
D
RL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
R
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDR
I
THRTVNGEPAEQW
R
YD
G
HGWL
fig|585396.4.peg.552
Escherichia coli O111:H- str. 11128 (1-733/1256)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQ
I
RD
D
A
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
L
LPAPLPPYRVLTG
M
A
DRFGRTL
AYR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RK
P
H
TA
S
LSS
PDS
P
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYP
D
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|585055.6.peg.522
Escherichia coli 55989 (1-733/1422)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
T
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDA
S
GR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|585055.8.peg.523
Escherichia coli 55989 (1-733/1422)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
L
WLVRGG
K
A
T
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
L
VP
G
A
ED
VLPAPLPPYRVLTGL
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQ
R
TA
S
LSS
PDT
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
T
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
Y
L
YE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
R
S
G
Y
DAAGRL
T
AQTDA
S
GR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
T
SVKDAQG
R
ETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
V
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|573235.3.peg.541
Escherichia coli O26:H11 str. 11368 (1-733/1256)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQ
I
RD
D
A
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
L
LPAPLPPYRVLTG
M
A
DRFGRTL
AYR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RK
P
H
TA
S
LSS
PDS
P
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|331111.12.peg.1921
Escherichia coli E24377A (1-733/1405)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
V
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
G
VP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTL
AYR
C
EAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RK
P
H
TA
S
LSS
PDS
P
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|331111.3.peg.4081
Escherichia coli E24377A (1-733/1405)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
V
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
V
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
G
VP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTL
AYR
C
EAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RK
P
H
TA
S
LSS
PDS
P
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|340186.3.peg.841
Escherichia coli E110019 (1-733/1256)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILS
L
TYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
S
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
L
LPAPLPPYRVLTG
M
A
DRFGRTL
AYR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RK
P
H
TA
S
LSS
PDS
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
T
L
YDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|340186.5.peg.878
Escherichia coli E110019 (1-733/1256)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILS
L
TYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQ
I
RD
D
A
L
V
LNDNGGRSIHFE
S
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
L
LPAPLPPYRVLTG
M
A
DRFGRTL
AYR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RK
P
H
TA
S
LSS
PDS
P
R
PL
S
----
A
SAFPDTLPG
-
TEYG
A
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
Y
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
T
L
YDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYD
G
HGWL
fig|749532.3.peg.4672
Escherichia coli MS 78-1 (1-733/819)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKV
Q
PGETD
L
ALP
D
PLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
G
LILNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
V
Q
PD
GH
T
LA
R
LW
A
S
LP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
E
G
VP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTL
AYR
C
EAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RK
P
H
TA
S
LSS
PDS
P
R
PL
S
----
A
P
S
FPDTLPG
-
TEYG
T
D
S
GIRLSAVWL
M
HDPEYPE
N
LP
A
APLV
C
Y
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
GD
V
I
RY
A
YDNPHS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|478005.5.peg.2451
Escherichia coli O157:H7 str. EC4486 (1-720/732)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGG
M
---
TSG
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRT
R
TPAPVG
I
FGPGWKAP
S
DIRLQLRD
D
A
L
V
LNDNGGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYRVLTG
M
A
DRFGRTLT
YR
REAAG
D
L
A
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
RKQH
TA
S
LSS
PDT
P
R
PL
S
----
D
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWLTHDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMV
G
HRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
H
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RR
A
YDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
Y
S
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
R
YD
N
RG
R
L
I
SVKDAQGHETRYEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYD
L
AGR
IT
T
LTNENGS
R
S
E
F
T
YD
A
LDRL
V
QQ
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRLTHRTVNG
D
fig|340184.3.peg.108
Escherichia coli B7A (1-697/1367)
MSGKPAARQGDMTQYGG
FGRCKNWR
A
HR
R
G
VL
GV
-------------S
G
RD
---
DF
G
Q
P
-----
G
KSAAG
GE
GAARRD
GP
C------------AAR
PA
A
V
H
S
L
PHLQ
Q
L
P
D
-------
ED
A
C----T
GGRSIHFE
P
L
L
PGE
AV
YSRSES
M
WLVRGG
K
A
A
Q
PD
GH
T
LA
R
LW
G
ALP
PD
I
RLSPH
L
YLATNS
A
QGPWWILGW
S
ERVP
G
A
ED
VLPAPLPPYR
E
LTGL
A
DRFGRTLT
YR
REAAG
D
L
T
GEITGVTDGAGR
E
FRLVLTTQAQRAEEA
---
R
TS
S
LSS
SDS
SR
PL
S
----
A
SAFPDTLPG
-
TEYG
P
D
R
GIRLSAVWL
M
HDP
A
YPE
S
LP
A
APLVRY
TY
T
EA
GEL
L
AVYDRS
N
TQVR
A
FTYD
A
Q
H
P
GRMVAHRYAGRPE
M
RYRYDD
A
GRV
V
EQLNPAGLSY
R
YQYE
Q
DRIT
V
TDSLNRREVLHTEG
G
AGLKRVVKKE
L
ADGSVT
H
S
G
Y
DAAGRL
T
AQTDAAGR
R
TEY
GL
N
VV
S
G
D
IT
D
ITTPDGR
ETK
FYYN
D
GN
QLT
AVV
S
PDGLE
S
RREYDE
P
GRL
VS
ET
S
R
S
G
ETV
RYRYD
DA
HS
E
LP
AT
T
T
DATGS
TRQ
MTWSRYGQLL
A
FTDCSGY
Q
TRYEYDRFGQ
M
TAVHREEGIS
L
YR
H
YD
N
RG
R
L
T
SVKDAQG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
TQYDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
S
D
F
S
YD
A
LDRL
V
QQ
G
GFDGRTQRYHYDLTGKLTQSEDEGLVTLW
Y
YDESDR
I
THRTVNGEPAEQWQYD
D
HGWL
fig|481805.3.peg.1772
Escherichia coli ATCC 8739 (1-726/1251)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSG
N
PVNPLLGAKVLPGETD
F
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DI
H
LQLRDN
E
LILNDNGGRSIHFEHLFPGE
DG
F
SRSE
L
F
WLVRGGVA
K
L
NE
S
H
R
LA
P
LWQALPEELRLSPH
I
YLATNS
P
QGPWWILGW
S
ERVP
G
V
DE
M
LPAPLPPYRVLTGLVDRFGRTLTF
R
REAAGE
F
T
GEITGVTDGAGR
Q
FRLVLTTQAQRAE
N
A
---
R
QQ
A
I
AA
GAK
G
-
--
-----
-
PDI
PD
S
LP
D
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRY
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMVAHRYAGRPE
M
RYRYDD
T
GRVTEQ
F
NPAGLSYTYQYEK
N
RITITDSLNRREVLHTEGEAGLK
C
VVK
T
E
L
ADGS
I
T
R
S
K
FD
YM
GRL
Q
S
QTDAAGRTTEYSP
N
VVTG
L
V
T
C
ITTPDGR
KSE
FYYN
N
QN
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
TQ
ETAR
N
GD
V
TRY
S
YDNPHS
E
LP
SA
T
E
DATGSRK
Q
MTWSRYGQL
Q
T
FTDCSGY
E
T
H
YEYDRFGQ
M
M
AVHREEGIS
T
Y
N
T
Y
N
P
RGQL
V
S
W
KD
T
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
T
L
YDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNGEPAEQW
R
Y
N
D
HGWL
fig|481805.6.peg.1765
Escherichia coli ATCC 8739 (1-726/1251)
MSGKPAARQGDMTQYGG
--------
P
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSG
N
PVNPLLGAKVLPGETD
F
ALPGPLPFILSRTYSSYRTKTPAPVG
I
FGPGWKAP
S
DI
H
LQLRDN
E
LILNDNGGRSIHFEHLFPGE
DG
F
SRSE
L
F
WLVRGGVA
K
L
NE
S
H
R
LA
P
LWQALPEELRLSPH
I
YLATNS
P
QGPWWILGW
S
ERVP
G
V
DE
M
LPAPLPPYRVLTGLVDRFGRTLTF
R
REAAGE
F
T
GEITGVTDGAGR
Q
FRLVLTTQAQRAE
N
A
---
R
QQ
A
I
AA
GAK
G
-
--
-----
-
PDI
PD
S
LP
D
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRY
D
WT
PR
GEL
A
AVYDRSGTQ
M
R
H
FTYDDKY
R
GRMVAHRYAGRPE
M
RYRYDD
T
GRVTEQ
F
NPAGLSYTYQYEK
N
RITITDSLNRREVLHTEGEAGLK
C
VVK
T
E
L
ADGS
I
T
R
S
K
FD
YM
GRL
Q
S
QTDAAGRTTEYSP
N
VVTG
L
V
T
C
ITTPDGR
KSE
FYYN
N
QN
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
TQ
ETAR
N
GD
V
TRY
S
YDNPHS
E
LP
SA
T
E
DATGSRK
Q
MTWSRYGQL
Q
T
FTDCSGY
E
T
H
YEYDRFGQ
M
M
AVHREEGIS
T
Y
N
T
Y
N
P
RGQL
V
S
W
KD
T
QG
R
ET
Q
YEYNAAGDLTAVI
T
PDG
N
RS
E
T
L
YDAWGKAV
S
------------------
TTQGGLTRSMEYDAAGRVI
S
LTNENGS
H
T
E
F
S
W
D
V
LDRL
I
QQ
R
GFDGRTQRY
R
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNGEPAEQW
R
Y
N
D
HGWL
fig|409438.11.peg.887
Escherichia coli SE11 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|331111.12.peg.4332
Escherichia coli E24377A (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|331111.3.peg.1735
Escherichia coli E24377A (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|585055.6.peg.698
Escherichia coli 55989 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGL
N
YTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|585055.8.peg.700
Escherichia coli 55989 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGL
N
YTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|331111.12.peg.1023
Escherichia coli E24377A (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|331111.3.peg.3242
Escherichia coli E24377A (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|595496.3.peg.627
Escherichia coli BW2952 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|536056.3.peg.3096
Escherichia coli DH1 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|83333.1.peg.692
Escherichia coli K12 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|316407.3.peg.675
Escherichia coli W3110 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|316385.5.peg.764
Escherichia coli str. K-12 substr. DH10B (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|316385.7.peg.776
Escherichia coli str. K-12 substr. DH10B (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|511145.12.peg.731
Escherichia coli str. K-12 substr. MG1655 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|511145.6.peg.722
Escherichia coli str. K-12 substr. MG1655 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|670888.3.peg.2766
Escherichia coli 1827-70 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTD
T
AGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
YA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|316401.4.peg.4223
Escherichia coli ETEC H10407 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|331111.12.peg.4692
Escherichia coli E24377A (1-726/1365)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|331111.3.peg.2089
Escherichia coli E24377A (1-726/1365)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLV
C
GGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|316401.4.peg.4362
Escherichia coli ETEC H10407 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|585055.6.peg.4446
Escherichia coli 55989 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|585055.8.peg.4450
Escherichia coli 55989 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|358709.5.peg.199
Escherichia coli 101-1 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTD
T
AGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|679205.4.peg.1938
Escherichia coli MS 124-1 (1-726/1387)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|478008.5.peg.2068
Escherichia coli O157:H7 str. EC869 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
S
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|562.373.peg.2622
Escherichia coli 1125A (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|562.372.peg.3653
Escherichia coli 1212A (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|562.374.peg.3612
Escherichia coli 536A (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444454.5.peg.3970
Escherichia coli O157:H7 str. EC4024 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444449.5.peg.3425
Escherichia coli O157:H7 str. EC4042 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444448.5.peg.2180
Escherichia coli O157:H7 str. EC4045 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444453.5.peg.2322
Escherichia coli O157:H7 str. EC4076 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444452.5.peg.1790
Escherichia coli O157:H7 str. EC4113 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444450.8.peg.5262
Escherichia coli O157:H7 str. EC4115 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444451.5.peg.2836
Escherichia coli O157:H7 str. EC4196 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444447.5.peg.2346
Escherichia coli O157:H7 str. EC4206 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478004.5.peg.2396
Escherichia coli O157:H7 str. EC4401 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478005.5.peg.2244
Escherichia coli O157:H7 str. EC4486 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478007.5.peg.2221
Escherichia coli O157:H7 str. EC508 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|544404.4.peg.5072
Escherichia coli O157:H7 str. TW14359 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|566546.4.peg.3830
Escherichia coli W (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
N
GDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYNAAGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|585034.4.peg.672
Escherichia coli IAI1 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|585034.5.peg.671
Escherichia coli IAI1 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|155864.1.peg.4481
Escherichia coli O157:H7 EDL933 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|155864.8.peg.4440
Escherichia coli O157:H7 EDL933 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|331112.3.peg.695
Escherichia coli HS (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|331112.6.peg.726
Escherichia coli HS (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|478008.5.peg.828
Escherichia coli O157:H7 str. EC869 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
S
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|413997.3.peg.3489
Escherichia coli B str. REL606 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|511693.5.peg.3505
Escherichia coli BL21 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|469008.4.peg.275
Escherichia coli BL21(DE3) (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|83333.1.peg.3415
Escherichia coli K12 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|340184.3.peg.375
Escherichia coli B7A (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIA
Q
PGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|340184.6.peg.395
Escherichia coli B7A (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIA
Q
PGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|562.373.peg.1621
Escherichia coli 1125A (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444454.5.peg.3568
Escherichia coli O157:H7 str. EC4024 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444449.5.peg.3022
Escherichia coli O157:H7 str. EC4042 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444448.5.peg.1776
Escherichia coli O157:H7 str. EC4045 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444453.5.peg.1307
Escherichia coli O157:H7 str. EC4076 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444452.5.peg.35
Escherichia coli O157:H7 str. EC4113 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444450.8.peg.4857
Escherichia coli O157:H7 str. EC4115 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444451.5.peg.209
Escherichia coli O157:H7 str. EC4196 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444447.5.peg.1939
Escherichia coli O157:H7 str. EC4206 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478004.5.peg.333
Escherichia coli O157:H7 str. EC4401 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478005.5.peg.287
Escherichia coli O157:H7 str. EC4486 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478007.5.peg.306
Escherichia coli O157:H7 str. EC508 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|544404.4.peg.4668
Escherichia coli O157:H7 str. TW14359 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|585396.4.peg.4438
Escherichia coli O111:H- str. 11128 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|595496.3.peg.3464
Escherichia coli BW2952 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|536056.3.peg.245
Escherichia coli DH1 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|316407.3.peg.3646
Escherichia coli W3110 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|316385.5.peg.3603
Escherichia coli str. K-12 substr. DH10B (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|316385.7.peg.3684
Escherichia coli str. K-12 substr. DH10B (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|511145.12.peg.3581
Escherichia coli str. K-12 substr. MG1655 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|511145.6.peg.3564
Escherichia coli str. K-12 substr. MG1655 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|562.371.peg.1871
Escherichia coli 1044A (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|83334.1.peg.4839
Escherichia coli O157:H7 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478006.5.peg.2164
Escherichia coli O157:H7 str. EC4501 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|386585.9.peg.5086
Escherichia coli O157:H7 str. Sakai (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|502346.5.peg.1734
Escherichia coli O157:H7 str. TW14588 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|331112.3.peg.3453
Escherichia coli HS (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|331112.6.peg.3587
Escherichia coli HS (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|481805.3.peg.3167
Escherichia coli ATCC 8739 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
S
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|481805.6.peg.3152
Escherichia coli ATCC 8739 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
S
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|585055.6.peg.4084
Escherichia coli 55989 (1-726/1321)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|585055.8.peg.4087
Escherichia coli 55989 (1-726/1321)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|481805.3.peg.246
Escherichia coli ATCC 8739 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
S
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|481805.6.peg.256
Escherichia coli ATCC 8739 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
S
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|566546.4.peg.4232
Escherichia coli W (1-722/1390)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|409438.11.peg.4043
Escherichia coli SE11 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQG
D
LTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|679207.4.peg.188
Escherichia coli MS 107-1 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|155864.8.peg.748
Escherichia coli O157:H7 EDL933 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRIT
X
TDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
X
QLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|562.371.peg.3206
Escherichia coli 1044A (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|83334.1.peg.4441
Escherichia coli O157:H7 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478006.5.peg.299
Escherichia coli O157:H7 str. EC4501 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|386585.9.peg.4682
Escherichia coli O157:H7 str. Sakai (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|502346.5.peg.2160
Escherichia coli O157:H7 str. TW14588 (1-726/1409)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
T
F
GHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|409438.11.peg.4404
Escherichia coli SE11 (1-726/1374)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|155864.1.peg.721
Escherichia coli O157:H7 EDL933 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRIT
X
TDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
X
QLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|413997.3.peg.3621
Escherichia coli B str. REL606 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
NE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|1040638.4.peg.5049
Escherichia coli O104:H4 str. LB226692 (1-726/1235)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGL
N
YTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|585396.4.peg.4570
Escherichia coli O111:H- str. 11128 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|670888.3.peg.2635
Escherichia coli 1827-70 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTD
T
AGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
YA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|344610.7.peg.990
Escherichia coli 53638 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|595496.3.peg.3592
Escherichia coli BW2952 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|536056.3.peg.117
Escherichia coli DH1 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|83333.1.peg.3527
Escherichia coli K12 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|316407.3.peg.3535
Escherichia coli W3110 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|316385.5.peg.3725
Escherichia coli str. K-12 substr. DH10B (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|316385.7.peg.3810
Escherichia coli str. K-12 substr. DH10B (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|511145.12.peg.3710
Escherichia coli str. K-12 substr. MG1655 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|511145.6.peg.3692
Escherichia coli str. K-12 substr. MG1655 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|701177.3.peg.4755
Escherichia coli O55:H7 str. CB9615 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTL
S
G
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|481805.3.peg.127
Escherichia coli ATCC 8739 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|481805.6.peg.128
Escherichia coli ATCC 8739 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGA
W
R
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AEQWQYDE
R
GWL
fig|340186.3.peg.1732
Escherichia coli E110019 (1-722/1373)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|340186.5.peg.1792
Escherichia coli E110019 (1-722/1373)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|340184.3.peg.2297
Escherichia coli B7A (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|340184.6.peg.2411
Escherichia coli B7A (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|585396.4.peg.4936
Escherichia coli O111:H- str. 11128 (1-722/1390)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|340186.3.peg.3871
Escherichia coli E110019 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
M
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|340186.5.peg.4064
Escherichia coli E110019 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
M
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|316401.4.peg.829
Escherichia coli ETEC H10407 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMV
V
HR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|562.375.peg.421
Escherichia coli EC4100B (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|670888.3.peg.1261
Escherichia coli 1827-70 (1-726/1110)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
T
LPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|573235.3.peg.4681
Escherichia coli O26:H11 str. 11368 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
A
QLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|340185.3.peg.3890
Escherichia coli E22 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACS
G
CPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|340185.4.peg.4100
Escherichia coli E22 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACS
G
CPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|656419.3.peg.4623
Escherichia coli M718 (1-726/1202)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
S
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTY
C
YE
Q
N
RITITDSL
D
RREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
M
QGHE
M
RYEYNAAGDLTAVIAPDGSR
N
GTQYDAWGKA
I
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AE
R
WQYDE
R
GWL
fig|573235.3.peg.778
Escherichia coli O26:H11 str. 11368 (1-726/1397)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
V
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFH
H
EAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
A
QLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|573235.3.peg.4775
Escherichia coli O26:H11 str. 11368 (1-726/1394)
MSGKPAARQGDMTQYG
S
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
S
RPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRY
A
QLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|566546.4.peg.753
Escherichia coli W (1-726/1110)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRV
I
EQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYNAAGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|344610.7.peg.2616
Escherichia coli 53638 (1-726/1377)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETD
L
ALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
T
QTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
L
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|344601.3.peg.1791
Escherichia coli B171 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|344601.5.peg.1871
Escherichia coli B171 (1-726/1411)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|585396.4.peg.751
Escherichia coli O111:H- str. 11128 (1-722/1393)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|585395.4.peg.4804
Escherichia coli O103:H2 str. 12009 (1-726/1389)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYG
R
T
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|562.373.peg.3029
Escherichia coli 1125A (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|562.372.peg.1697
Escherichia coli 1212A (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|562.374.peg.5281
Escherichia coli 536A (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|83334.1.peg.800
Escherichia coli O157:H7 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444451.5.peg.4504
Escherichia coli O157:H7 str. EC4196 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|386585.9.peg.845
Escherichia coli O157:H7 str. Sakai (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|502346.5.peg.462
Escherichia coli O157:H7 str. TW14588 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|701177.3.peg.4355
Escherichia coli O55:H7 str. CB9615 (1-726/1425)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTL
S
G
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
G
LTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|585395.4.peg.4862
Escherichia coli O103:H2 str. 12009 (1-726/1394)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYG
R
T
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|478008.5.peg.2122
Escherichia coli O157:H7 str. EC869 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|701177.3.peg.242
Escherichia coli O55:H7 str. CB9615 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444454.5.peg.5222
Escherichia coli O157:H7 str. EC4024 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444449.5.peg.5558
Escherichia coli O157:H7 str. EC4042 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444448.5.peg.3433
Escherichia coli O157:H7 str. EC4045 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444453.5.peg.775
Escherichia coli O157:H7 str. EC4076 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444452.5.peg.3560
Escherichia coli O157:H7 str. EC4113 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444450.8.peg.892
Escherichia coli O157:H7 str. EC4115 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|444447.5.peg.3613
Escherichia coli O157:H7 str. EC4206 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478004.5.peg.3837
Escherichia coli O157:H7 str. EC4401 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|478005.5.peg.1280
Escherichia coli O157:H7 str. EC4486 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|544404.4.peg.757
Escherichia coli O157:H7 str. TW14359 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVH
C
EEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|1040638.4.peg.1354
Escherichia coli O104:H4 str. LB226692 (1-726/1110)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|478008.5.peg.4739
Escherichia coli O157:H7 str. EC869 (1-735/1407)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
T
LWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|637388.3.peg.780
Escherichia coli O157:H7 str. FRIK2000 (1-735/1407)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
T
LWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|570506.3.peg.1715
Escherichia coli O157:H7 str. FRIK966 (1-735/1407)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
T
LWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|573235.3.peg.5159
Escherichia coli O26:H11 str. 11368 (1-722/1373)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
C
FTYDDKY
R
GRMVAHR
HT
GRPE
I
C
YRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
--
-
-
TTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
W
Y
YDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|562.373.peg.2700
Escherichia coli 1125A (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444454.5.peg.4700
Escherichia coli O157:H7 str. EC4024 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444449.5.peg.4155
Escherichia coli O157:H7 str. EC4042 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444448.5.peg.2910
Escherichia coli O157:H7 str. EC4045 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444453.5.peg.4272
Escherichia coli O157:H7 str. EC4076 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444452.5.peg.3166
Escherichia coli O157:H7 str. EC4113 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444450.8.peg.379
Escherichia coli O157:H7 str. EC4115 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444451.5.peg.3677
Escherichia coli O157:H7 str. EC4196 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|444447.5.peg.3086
Escherichia coli O157:H7 str. EC4206 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|478004.5.peg.3911
Escherichia coli O157:H7 str. EC4401 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|478005.5.peg.3843
Escherichia coli O157:H7 str. EC4486 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|478007.5.peg.3043
Escherichia coli O157:H7 str. EC508 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|544404.4.peg.241
Escherichia coli O157:H7 str. TW14359 (1-735/1410)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585034.4.peg.4037
Escherichia coli IAI1 (1-726/737)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|585034.5.peg.4033
Escherichia coli IAI1 (1-726/737)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HN
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|562.371.peg.801
Escherichia coli 1044A (1-735/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|155864.1.peg.237
Escherichia coli O157:H7 EDL933 (1-735/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|155864.8.peg.239
Escherichia coli O157:H7 EDL933 (1-735/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|478006.5.peg.3846
Escherichia coli O157:H7 str. EC4501 (1-735/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|502346.5.peg.1024
Escherichia coli O157:H7 str. TW14588 (1-735/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|701177.3.peg.856
Escherichia coli O55:H7 str. CB9615 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTL
S
G
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHT
Q
GE
G
GLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
I
RREYDE
W
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
G
LTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
WQYDE
R
GWL
fig|83334.1.peg.324
Escherichia coli O157:H7 (1-735/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|386585.9.peg.339
Escherichia coli O157:H7 str. Sakai (1-735/1404)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585395.4.peg.737
Escherichia coli O103:H2 str. 12009 (1-726/1399)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACS
G
CPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLL
T
FTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|585034.4.peg.3680
Escherichia coli IAI1 (1-726/737)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|585034.5.peg.3677
Escherichia coli IAI1 (1-726/737)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGV
L
R
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYNAAGDLT
T
VIAPDGSR
N
GTQYDAWGKA
I
C
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTVNGE
T
AEQWQYDE
R
GWL
fig|679207.4.peg.2757
Escherichia coli MS 107-1 (1-726/967)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACS
G
CPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQ
Q
AEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
AVYDRS
N
TQVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKDRITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
R
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
L
RREYDE
S
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
D
DATGSRKTMTWSRYGQLLSFTDCSGY
Q
TRY
DH
DRFGQ
M
TAVHREEG
L
S
Q
YR
A
YD
S
RGQL
I
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKAV
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|670888.3.peg.824
Escherichia coli 1827-70 (1-735/1409)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
R
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
M
RY
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNG
D
PAEQWQYDEHGWL
fig|340185.3.peg.2269
Escherichia coli E22 (1-726/1140)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|340185.4.peg.2403
Escherichia coli E22 (1-726/1140)
MSGKPAARQGDMTQYGG
--------
S
IVQGSAGVRIGAPTGVACSVCPGGV
---
TSGHPVNPLLGAKVLPGETDIALP
A
PLPFILSRTYSSYRTKTPAPVG
S
L
GPGWK
M
P
A
DIRLQLRDN
T
LIL
S
DNGGRS
LY
FEHLFPGE
DG
YSRSES
L
WLVRGGVA
K
L
DE
GH
R
LAALWQALPEELRLSPH
R
YLATNS
P
QGPWW
L
LGW
C
ERVPEADEVLPAPLPPYRVLTGLVDRFGRT
Q
TFHREAAGE
F
SGEITGVTDGAGR
H
FRLVLTTQAQRAEEA
---
R
QQ
A
I
S
G
GTE
P
-
--
-----
-
SAFPDTLPG
Y
TEYG
R
DNGIRLSAVWLTHDPEYPE
N
LP
A
APLVRYGWT
PR
GEL
A
V
VYDRSG
K
QVR
S
FTYDDKY
R
GRMVAHR
HT
GRPE
I
RYRYD
S
D
GRVTEQLNPAGLSYTYQYEKD
H
ITITDSL
D
RREVLHT
Q
GEAGLKRVVKKEHADGSVT
Q
S
Q
FDA
V
GRL
K
AQTDAAGRTTEYSPDVVTG
L
IT
R
ITTPDGR
ASA
FYYNH
HS
QLTSAT
G
PDGLE
M
RR
K
YDE
Y
GRL
IQ
ETA
P
DGDITRYRYDNPHSDLP
CA
T
E
DATGSRKTMTWSRYGQLL
T
FTDCSGY
V
TRY
DH
DRFGQ
V
TAVHREEG
L
S
Q
Y
H
A
YD
S
RGQL
T
A
VKD
T
QGHETRYEYN
I
AGDLTAVIAPDGSR
N
GTQYDAWGKA
I
R
------------------
TTQGGLTRSMEYDAAGRVI
R
LT
S
ENGS
H
T
T
F
R
YD
V
LDRL
I
Q
E
T
GFDGRTQRYH
H
DLTGKL
IR
SEDEGLVT
H
WHYDE
A
DRLTHRTV
K
GE
T
AE
R
W
R
YDE
R
GWL
fig|562.372.peg.2869
Escherichia coli 1212A (1-735/1137)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|562.374.peg.1331
Escherichia coli 536A (1-735/1137)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNP
V
LGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
M
YLATNS
L
QGPWWIL
N
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
A
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTE
L
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
T
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|344610.3.peg.2570
Escherichia coli 53638 (1-735/1411)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWILGW
S
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
R
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
M
RY
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNG
D
PAEQWQYDEHGWL
fig|344610.7.peg.1444
Escherichia coli 53638 (1-735/1411)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWILGW
S
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
R
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
M
RY
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
V
APDGSRS
EI
QYDAWGKAV
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNG
D
PAEQWQYDEHGWL
fig|340186.3.peg.756
Escherichia coli E110019 (1-735/1419)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|340186.5.peg.783
Escherichia coli E110019 (1-735/1419)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|595495.4.peg.4416
Escherichia coli KO11 (1-735/1349)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAG
G
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AS
S
LSS
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
IAPDGSRS
E
TQYDAWGKA
I
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|566546.3.peg.4451
Escherichia coli W (1-735/1349)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAG
G
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AS
S
LSS
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
IAPDGSRS
E
TQYDAWGKA
I
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|566546.4.peg.234
Escherichia coli W (1-735/1349)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAG
G
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AS
S
LSS
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
I
IAPDGSRS
E
TQYDAWGKA
I
S
------------------
TTQGGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|585396.4.peg.245
Escherichia coli O111:H- str. 11128 (1-732/1409)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
-
--
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|331112.3.peg.235
Escherichia coli HS (1-735/1417)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|331112.6.peg.240
Escherichia coli HS (1-735/1417)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|679206.4.peg.2787
Escherichia coli MS 119-7 (1-738/1420)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSPN
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585034.4.peg.245
Escherichia coli IAI1 (1-738/1415)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585034.5.peg.244
Escherichia coli IAI1 (1-738/1415)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585034.4.peg.1446
Escherichia coli IAI1 (1-738/1420)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSPN
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585034.5.peg.1442
Escherichia coli IAI1 (1-738/1420)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSPN
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|550676.3.peg.751
Escherichia coli B185 (1-738/1423)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TG
G
TDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
SSF
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
F
A
YD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
E
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585034.4.peg.239
Escherichia coli IAI1 (1-735/1408)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|585034.5.peg.239
Escherichia coli IAI1 (1-735/1408)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
VS
YSRSES
F
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
V
DNGIRL
E
AVWLTHDP
A
YP
D
E
LP
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
GQ
DR
V
TITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
AA
ET
S
R
S
G
ET
T
S
Y
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|344601.3.peg.1933
Escherichia coli B171 (1-738/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|344601.5.peg.2014
Escherichia coli B171 (1-738/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|340185.3.peg.1517
Escherichia coli E22 (1-738/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|340185.4.peg.1604
Escherichia coli E22 (1-738/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|585395.4.peg.240
Escherichia coli O103:H2 str. 12009 (1-738/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|409438.11.peg.1688
Escherichia coli SE11 (1-738/1421)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSPN
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
R
DAQG
R
ETRYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNG
D
PAEQWQYDEHGWL
fig|331111.12.peg.571
Escherichia coli E24377A (1-738/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALP
C
PLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DA
M
GS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KDAQG
R
ET
P
YEY
S
AAGDLTA
TV
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|331111.3.peg.2807
Escherichia coli E24377A (1-738/1415)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
NSPIEEQK
G
N
PVNPLLGAKVLPGETD
L
ALP
C
PLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DA
M
GS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQ
M
V
S
Q
KDAQG
R
ET
P
YEY
S
AAGDLTA
TV
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRYHYDLT
R
KLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|679206.4.peg.3200
Escherichia coli MS 119-7 (1-735/830)
M
G
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCPGG
I
---
T
YAN
PVNPLLGAKVLPGETD
L
ALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILNDNGGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
E
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
PEP
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
LVLTTQAQRAE
VFRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
LV
FPDTLP
A
G
T
G
YG
T
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEYS
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
K
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
E
I
RYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
I
A
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|340186.3.peg.183
Escherichia coli E110019 (1-737/1323)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSPN
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
-
P
PE
LP
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSG
M
QVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
TAVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
R
DAQG
R
ETRYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|340186.5.peg.196
Escherichia coli E110019 (1-737/1323)
MSGKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
KKKDSPN
Y
G
N
PVNP
V
LGAKVLPGETDIALPGPLPFILSR
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
-
P
PE
LP
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVLTTQAQRAE
A
FRKQ
R
AT
S
LSS
PAG
P
R
SA
SS
---
S
SAFPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSG
M
QVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
TAVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
R
DAQG
R
ETRYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|409438.11.peg.355
Escherichia coli SE11 (4-739/1411)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSPN
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGR
S
L
I
FHREAAGE
L
A
GEITGVTDGAGR
R
F
H
L
A
L
S
TQAQRAE
A
FRKQ
R
VT
S
LSS
PAG
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
E
I
S
R
N
G
N
ITR
W
F
YD
S
SR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQL
Q
A
FTDCSGY
A
TRYEYDR
Y
GQ
Q
I
A
I
HREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
T
LTNENGS
Q
S
T
F
L
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
THRTVNG
D
PAEQWQYDEHGWL
fig|679207.4.peg.4764
Escherichia coli MS 107-1 (4-739/834)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSPN
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
L
WL
A
RGGVA
A
Q
HS
SQ
P
L
S
ALWQ
V
LPE
D
V
RLSPH
V
YLATNS
L
QGPWWIL
S
W
P
ERVP
G
ADEVLP
P
P
P
P
A
YRVLTG
V
VD
G
FGRTL
A
FHR
A
A
K
G
D
V
A
G
AV
TGVTDGAGR
R
F
H
L
A
LTTQAQRAE
A
FRKQ
R
AS
S
LSS
PAS
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AG
GEL
R
AVYDRSG
M
QVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
C
YRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
A
V
T
A
V
T
G
PDGR
TVR
YG
YN
S
QR
Q
V
TS
V
T
Y
PDGL
R
S
S
REYDE
R
GRL
TA
ET
S
R
S
G
ET
TRY
S
YD
D
P
A
S
E
LP
TG
I
Q
DATGS
T
K
Q
M
A
WSRYGQLL
A
FTDCSGY
T
TRYEYDR
Y
GQ
Q
TAVHREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
R
DAQG
R
ETRYEY
S
AAGDLTA
TV
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|1040638.4.peg.5525
Escherichia coli O104:H4 str. LB226692 (4-739/834)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSPN
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGR
S
L
I
FHREAAGE
L
A
GEITGVTDGAGR
R
F
H
L
A
L
S
TQAQRAE
A
FRKQ
R
VT
S
LSS
PAG
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
E
I
S
R
N
G
N
ITR
W
F
YD
S
SR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQL
Q
A
FTDCSGY
A
TRYEYDR
Y
GQ
Q
I
A
I
HREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
T
LTNENGS
Q
S
T
F
L
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|585055.6.peg.240
Escherichia coli 55989 (4-739/834)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSPN
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGR
S
L
I
FHREAAGE
L
A
GEITGVTDGAGR
R
F
H
L
A
L
S
TQAQRAE
A
FRKQ
R
VT
S
LSS
PAG
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
E
I
S
R
N
G
N
ITR
W
F
YD
S
SR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQL
Q
A
FTDCSGY
A
TRYEYDR
Y
GQ
Q
I
A
I
HREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
T
LTNENGS
Q
S
T
F
L
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|585055.8.peg.240
Escherichia coli 55989 (4-739/834)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSPN
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
I
RD
E
G
LILND
S
GGRSIHFE
P
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGR
S
L
I
FHREAAGE
L
A
GEITGVTDGAGR
R
F
H
L
A
L
S
TQAQRAE
A
FRKQ
R
VT
S
LSS
PAG
P
R
SV
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
E
I
S
R
N
G
N
ITR
W
F
YD
S
SR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQL
Q
A
FTDCSGY
A
TRYEYDR
Y
GQ
Q
I
A
I
HREEGIS
T
Y
S
S
Y
N
P
RGQL
V
S
Q
KDAQG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
T
IE
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
T
LTNENGS
Q
S
T
F
L
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|749538.3.peg.1684
Escherichia coli MS 116-1 (4-739/1410)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSPN
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
S
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVL
S
TQAQRAE
A
FRKQ
R
ES
S
LSS
PAG
P
R
SA
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
ET
S
R
N
G
N
ITR
W
F
YD
FSR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQLL
A
FTDCSGY
T
TRYEYD
Q
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
I
S
R
KDAQG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
A
T
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|749548.3.peg.5106
Escherichia coli MS 196-1 (4-739/1410)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSPN
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
S
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVL
S
TQAQRAE
A
FRKQ
R
ES
S
LSS
PAG
P
R
SA
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
ET
S
R
N
G
N
ITR
W
F
YD
FSR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQLL
A
FTDCSGY
T
TRYEYD
Q
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
I
S
R
KDAQG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
A
T
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
fig|749544.3.peg.3348
Escherichia coli MS 175-1 (4-739/1410)
GKPAARQGDMT
RK
G
L
--------
D
IVQGSAGV
L
IGAPTGVACSVCP
TKKDSPN
Y
G
S
PVNPLLGAKVLP
V
ETD
L
ALPGPLPFIL
F
R
A
YSSYRT
R
TPAPVG
V
FGPGWKAP
F
DIRLQ
V
H
E
R
E
LILND
S
GGRSIHFE
S
LFPGE
IS
YSRSES
F
WL
A
RGGV
L
K
Q
HK
GH
P
LA
R
LW
R
ALPE
A
V
RLSPH
T
Y
M
MAV
S
T
T
G
Q
W
L
ILGW
P
ERVPEADEV
P
P
PEP
P
A
YRVLTG
V
VD
G
FGRTLTFHR
A
A
E
G
D
V
A
G
AV
TGVTDGAGR
C
F
H
LVL
S
TQAQRAE
A
FRKQ
R
ES
S
LSS
PAG
P
R
SA
SS
---
S
Q
V
FPDTLP
A
G
TEYG
A
DNGIRL
E
AVWLTHDP
A
YP
D
E
Q
P
T
APL
A
RY
TY
T
AS
GEL
R
AVYDRSGTQVR
G
FTYD
A
E
H
A
GRMVAH
H
YAGRPE
S
RYRYDD
T
GRVTEQ
V
NP
E
GL
D
Y
RFE
Y
G
E
S
R
V
I
ITDSLNRREVL
Y
TEGE
G
GLKRVVKKEHADGS
I
T
R
S
E
Y
D
E
AGRL
K
AQTDAAGR
R
TEY
R
L
H
MAS
G
K
L
T
S
V
IL
PDGR
TVR
YG
YN
S
QL
QLTS
V
T
Y
PDGL
R
S
S
R
K
YD
R
Q
GRL
AE
ET
S
R
N
G
N
ITR
W
F
YD
FSR
S
G
LP
CA
V
E
D
G
TG
V
R
R
R
I
T
R
N
RYGQLL
A
FTDCSGY
T
TRYEYD
Q
Y
GQ
Q
I
AVHREEGIS
T
Y
S
S
Y
N
P
RGQL
I
S
R
KDAQG
R
ETRYEY
S
AAGDLTA
T
I
S
PDG
K
RS
A
T
E
YD
KR
G
R
P
V
S
------------------
V
T
E
GGLTRSM
G
YDAAGR
IT
V
LTNENGS
Q
S
T
F
R
YD
P
V
DRL
T
E
Q
R
GFDGRTQRY
Q
YDLTGKLTQSEDEGL
I
TLWHYD
A
SDR
I
T
R
RTVNGEPAEQWQYD
D
HGWL
Consen1
Primary consensus
MsGKPAARQGDMTqyGg
--------
IVQGSAGVrIGAPTGVACSVCPGGv
---
TsghPVNPLLGAKVLPGETDiALPgPLPFILSRtYSSYRTkTPAPVG
fGPGWKaP
DIRLQlRDn
LiLnDNGGRSihFEhLfPGE
YSRSES
WLvRGGvA
l
gh
LaaLWqaLPeelRLSPH
YLATNS
QGPWWiLgW
ERVPeAdeVLPaplPpYRVLTGlvDrFGRTltfhReAaGe
sGeiTGVTDGAGR
FrLVLTTQAQRAEea
---
r
alSs
p
-
-----
saFPDTLPg
TEYG
DnGIRLsAVWLtHDPeYPe
lP
APLvRYgwT
GEL
aVYDRSgtQVR
FTYDdky
GRMVaHryaGRPE
RYRYDd
GRVtEQlNPaGLsYtyqYekDriTiTDSLnRREVLhTeGeaGLKRVVKKEhADGSvT
S
fDaaGRL
AQTDAAGRtTEYspdvvtG
iT
iTtPDGR
fyYNh
QlTsat
PDGLe
rReYDE
GRL
ETardGditRYrYDnphSdLP
t
DATGSrktMtWSRYGQLLsFTDCSGY
TRYeyDRfGQ
tAVHREEGiS
Yr
Yd
RGqL
svKDaQGhETrYEYnaAGDLTaviaPDGsRsgtqYDAWGKAv
------------------
TTQGGLTRSMeYDaAGRvi
LTnENGS
t
F
YD
lDRL
qq
GFDGRTQRYHyDLTGKLtqSEDEGLvTlWhYDesDRlTHRTVnGepAEqWqYDehGWL
Consen2
Secondary consensus
g
rk
l
l
yan
l
a
a
r
l
m
i
v
s
ly
p
l
a
k
q
sq
sr
gv
pd
l
s
g
ed
pep
a
a
g
qayr
a
e
d
a
av
h
vfrkqh
si
g
sr
ss
lv
a
r
e
m
a
d
q
a
ty
v
nk
a
h
g
hht
s
v
v
e
d
rfe
gq
hv
v
d
y
q
gg
l
i
y
ev
r
gl
mas
v
v
g
yg
v
avv
r
s
a
sps
etv
s
daa
e
i
trq
a
dh
y
i
l
s
n
r
aq
t
r
q
si
ttvt
n
neie
i
g
l
it
s
s
v
ee
h
ir
i
h
y
aa
i
k
dt
r
r
dr
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character