fig|1040638.4.peg.2193
Escherichia coli O104:H4 str. LB226692
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|6666666.5357.peg.3663
Escherichia coli TY-2482
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|585055.6.peg.4033
Escherichia coli 55989
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|585055.8.peg.4036
Escherichia coli 55989
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|595495.4.peg.3877
Escherichia coli KO11
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|679207.4.peg.242
Escherichia coli MS 107-1
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|566546.3.peg.4195
Escherichia coli W
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|566546.4.peg.3777
Escherichia coli W
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|585034.4.peg.3629
Escherichia coli IAI1
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|585034.5.peg.3626
Escherichia coli IAI1
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|585395.4.peg.4914
Escherichia coli O103:H2 str. 12009
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|585396.4.peg.4519
Escherichia coli O111:H- str. 11128
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|573235.3.peg.5210
Escherichia coli O26:H11 str. 11368
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|340185.3.peg.49
Escherichia coli E22
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVH
A
V
CEK
fig|340185.4.peg.52
Escherichia coli E22
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVH
A
V
CEK
fig|550672.3.peg.3605
Escherichia coli B088
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
S
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDCN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|562.375.peg.476
Escherichia coli EC4100B
MH
W
M
H
LPLYHYR-AHFSFSL
---
LAL---TIASSLPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
I
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
S
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAKRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKMS
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGT
G
S
SS
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRL
L
N
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSDKPH
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKE
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
V
A
YTWTRSPE
-----
WEED
D
RLWSFS
VSIP
-----
--
-------
-
LGGAWSSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
SY
N
V
QQ
G
YTSNGVGYSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GA
V
VRSR
F
D
T
RV
G
YRVLMSLKQAN
G
NAV
PFG
ATAALI
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
NEQAQR
C
RVPFRLPENKDNAA
I
VMVN
A
V
CEK
fig|685038.3.peg.3604
Escherichia coli O83:H1 str. NRG 857C
MH
W
M
H
LPLDHYR-AHFSFSL
---
LAL---TIASALPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
V
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
F
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAQRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RIDSF
-------
PALKIL
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
LT
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGS
G
S
SP
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
TL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
S
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSGKPR
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKD
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
I
A
YTWTRSPE
-----
WDED
D
RLWSFS
L
SIP
-----
--
-------
-
LGGAWGSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
N
Y
N
V
QQ
G
YTSNGVGNSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GAI
VRSR
F
D
T
RV
G
YRVLMSLKRAN
G
NAV
PFG
ATAALS
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
HEQAQR
C
RVPFRLPEKKDNSG
I
VMVN
A
V
C
D
K
fig|656440.3.peg.3823
Escherichia coli TA206
MH
W
M
H
LPLDHYR-AHFSFSL
---
LAL---TIASALPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
V
DLS
VYDSPV
G
QQI
PG
K
Y
R
V
F
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAQRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RIDSF
-------
PALKIL
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
LT
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGS
G
S
SP
D
---------------
-STSNSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
S
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSGKPR
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKD
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
I
A
YTWTRSPE
-----
WDED
D
RLWSFS
L
SIP
-----
--
-------
-
LGGAWGSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
N
Y
N
V
QQ
G
YTSNGVGNSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GAI
VRSR
F
D
T
RV
G
YRVLMSLKRAN
G
NAV
PFG
ATAALS
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
HEQAQR
C
RVPFRLPEKKDNSG
I
VMVN
A
V
C
D
K
fig|585397.7.peg.4224
Escherichia coli ED1a
MH
W
M
H
LPLDHYR-AHFSFSL
---
LAL---TIASALPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
V
DLS
VYDFPV
G
QQI
PG
K
Y
R
V
F
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAQRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKIL
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
LT
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGS
G
S
SP
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
S
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSGKPR
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKD
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
I
A
YTWTRSPE
-----
WDED
D
RLWSFS
L
SIP
-----
--
-------
-
LGGAWGSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
N
Y
N
V
QQ
G
YTSNGVGNSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GAI
VRSR
F
D
T
RV
G
YRVLMSLKRAN
G
NAV
PFG
ATAALS
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
HEQAQR
C
RVPFRLPEKKDNSG
I
VMVN
A
V
C
D
K
fig|585397.9.peg.4221
Escherichia coli ED1a
MH
W
M
H
LPLDHYR-AHFSFSL
---
LAL---TIASALPAYGGK
FN
PKF
L
-ENVQGI
D
Q
H
V
DLS
VYDFPV
G
QQI
PG
K
Y
R
V
F
V
FV
N
EEKMA
--
SRT
L
DFS
TAS
EAQRKASGE
S
LMP
C
L
S
RVQ
L
EEM
G
V
RVDSF
-------
PALKIL
--
PPEA
C
V
-
AFDEIIPQATSRF
D
FNTQT
L
H
LT
F
-
PQA
A
M
MMTAR
G
T
V
D
P
SR
WD
-----
E
GI
PALLL
D
Y
S
FSGSNGRNEGS
G
S
SP
D
---------------
-STSDSYYLNLR
SG
L
N
V
G
P
WRLRN
NSIWN------RTD
--------
-GKNQWDNVGTSLN
R
AIIP
L
KSQ
I
T
LGD
TA
T
P
G
E
IFDS
VQMR
G
AL
L
A
SD
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YATSGS
GDL
T
V
I
I
K
E
S
DG
SEQR
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
G
EYRA-GNYDSGKPR
F
GQFTAMY
G
LPWGM
T
A
YGG
A
-
LLSAD
Y
N
A
LAL
G
L
G
KNFGTI
GA
V
S
V
D
V
T
Q
A
K
S
Q
L
R
-----
NNEKD
-
E
G
Q
S
Y
R
FL
Y
S
K
SF-
-
EGG
T
DLR
L
LG
Y
K
YS
TSG
Y
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYR-----------RYHKR-
-
SQIQG
N
IT
Q
Q
L
GD-YG
S
VYF
N
MTQ
Q
D
YW
NV
D
GKEN
-
SLSA
G
YH
---
GHIGR
V
NY
S
I
A
YTWTRSPE
-----
WDED
D
RLWSFS
L
SIP
-----
--
-------
-
LGGAWGSYRMTTDQNGKTSQQASVS
G
TLLEDRN
-
L
N
Y
N
V
QQ
G
YTSNGVGNSGSVN---MG
Y
MGGS
G
NIDVG
Y
N
YS
--
KD
-
-NQQVNYGV
R
GG
VIVHSE
G
I
TL
--
SQPLG
E
SLA
IV
S
A
P
-
G
ARGGH
V
V
-
NSSGVE
V
D
WM
G
NA
V
V
P
YL
T
P
YR
E
T
I
V
E
L
R
SDTLGQNVE
L
QE
A
FQKVVPTR
GAI
VRSR
F
D
T
RV
G
YRVLMSLKRAN
G
NAV
PFG
ATAALS
--
D
ESKPAS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGE
-
LQ
V
S
WG
HEQAQR
C
RVPFRLPEKKDNSG
I
VMVN
A
V
C
D
K
fig|656419.3.peg.4577
Escherichia coli M718 (11-857/857)
SSFSISV
---
VAV---AVASTFSAHAGK
FN
PKF
L
-EDVQGV
G
Q
H
V
DL
T
MFEKGQ
E
QQL
PG
I
Y
R
V
S
V
YV
N
EQRME
--
TRT
L
EFK
EA
T
EAQRKAMGE
S
LVP
C
L
S
RTQ
L
AEM
G
V
RVESF
-------
PALNLV
--
PAEA
C
V
-
PFDEIIPQASSHF
D
FSEQK
L
V
L
S
F
-
PQA
A
M
HQVAR
G
T
V
P
E
SL
WD
-----
E
GI
PALLL
D
Y
S
FSGSNSEYDST
G
S
SS
SYVDDNGTVHHDDGKD
TLKSDSYYLNLR
SG
L
N
L
G
A
WRLRN
YSTWS------HSG
--------
-GKAQWDNIGTSLS
R
AIIP
F
KAQ
L
T
M
GD
TA
T
A
GDIFDS
VQMR
G
AM
L
A
SD
EE
MLP
DSQR
GFAP
I
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTY
V
Q
PG
A
F
E
I
N
DL
YPTANS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
SS
V
P
I
F
Q
R
E
G
HL
K
Y
SF
A
A
G
EYQA-GNYDSASPR
F
GQLDLIY
G
LPWGM
T
A
YGG
V
-
LISNN
Y
N
A
FAL
G
I
G
KNFGYI
GA
I
S
I
D
V
T
Q
A
K
S
E
L
N
-----
NDRDS
-
Q
G
Q
S
Y
R
FL
Y
S
K
SF-
-
ESG
T
DFR
L
AG
YRYS
TSG
F
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYN-----------RYHKR-
-
SEIQG
N
LT
Q
Q
L
GA-YG
S
VYL
N
LTQ
Q
D
YW
ND
A
GKQN
-
TVSA
G
YN
---
GRIGK
V
SY
S
I
A
YSWNKSPE
-----
WDES
D
RLWSFN
I
S
V
P
-----
--
-------
-
LGRAWSNYRVTTDQDGRTNQQVGVS
G
TLLEDRN
-
L
SY
S
V
QE
G
YASNGVGNSGNAN---VG
Y
QGGS
G
NVNVG
YS
YG
--
KD
-
-YRQLNYSV
R
GG
VIVHSE
G
V
TL
--
SQPLG
E
TMT
LI
S
V
P
-
G
ARNAR
V
V
-
NNGGVQ
V
D
WM
G
NA
I
V
P
YA
M
P
YR
E
N
E
I
S
L
R
SDSLGDDVD
V
EN
A
FQKVVPTR
GAI
VRAR
F
D
T
RV
G
YRVLMTLLRSA
G
SPV
PFG
ATATLI
-
TD
KQNEVS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGR
-
VL
I
K
WG
NDASQQ
C
VAPYKLSLELKQGG
I
VPVS
A
N
C
Q
fig|701177.3.peg.4309
Escherichia coli O55:H7 str. CB9615 (11-857/857)
SSFSISV
---
VAV---AVASTFSAHAGK
FN
PKF
L
-EDVQGV
G
Q
H
V
DL
T
MFEKGQ
E
QQL
PG
I
Y
R
V
S
V
YV
N
EQRME
--
TRT
L
EFK
EA
T
EAQRKAMGE
S
LVP
C
L
S
RTQ
L
AEM
G
V
RVESF
-------
PALNLV
--
SAEA
C
V
-
PFDEIIPLASSHF
D
FSEQK
L
V
L
S
F
-
PQA
A
M
HQVAR
G
T
V
P
E
SL
WD
-----
E
GI
PALLL
D
Y
S
FSGSNSEYDST
G
S
SS
SYVDDNGTVHHDDGKD
TLKSDSYYLNLR
SG
L
N
L
G
A
WRLRN
YSTWS------HSG
--------
-GKAQWDNIGTSLS
R
AIIP
F
KAQ
L
T
M
GD
TA
T
A
GDIFDS
VQMR
G
AM
L
A
SD
EE
MLP
DSQR
GFAP
I
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTY
V
Q
PG
A
F
E
I
N
DL
YPTANS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IQ
P
F
SS
V
P
I
F
Q
R
E
G
HL
K
Y
SF
A
A
G
EYQA-GNYDSASPR
F
GQLDLIY
G
LPWGM
T
A
YGG
V
-
LISNN
Y
N
A
FAL
G
I
G
KNFGYI
GA
I
S
I
D
V
T
Q
A
K
S
E
L
N
-----
NDRDS
-
Q
G
Q
S
Y
R
FL
Y
S
K
SF-
-
ESG
T
DFR
L
AG
YRYS
TSG
F
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYN-----------RYHKR-
-
SEIQG
N
LT
Q
Q
L
GA-YG
S
VYL
N
LTQ
Q
D
YW
ND
A
GKQN
-
TVSA
G
YN
---
GRIGK
V
SY
S
I
A
YSWNKSPE
-----
WDES
D
RLWSFN
I
S
V
P
-----
--
-------
-
LGRAWSNYRVTTDQDGRTNQQVGVS
G
TLLEDRN
-
L
SY
S
V
QE
G
YASNGVGNSGNAN---VG
Y
QGGS
G
NVNVG
YS
YG
--
KD
-
-YRQLNYSV
R
GG
VIVHSE
G
V
TL
--
SQPLG
E
TMT
LI
S
V
P
-
G
ARNAR
V
V
-
NNGGVQ
V
D
WM
G
NA
I
V
P
YA
M
P
YR
E
N
E
I
S
L
R
SDSLGDDVD
V
EN
A
FQKVVPTR
GAI
VRAR
F
D
T
RV
G
YRVLMTLLRSA
G
SPV
PFG
ATATLI
-
TD
KQNEVS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGR
-
VL
I
K
WG
NDASQQ
C
VAPYKLSLELKQGG
I
IPVS
A
N
C
Q
fig|155864.1.peg.4437
Escherichia coli O157:H7 EDL933 (11-857/857)
SSFSISV
---
VAV---AVASTFSAHAGK
FN
PKF
L
-EDVQGV
G
Q
H
V
DL
T
MFEKGQ
E
QQL
PG
I
Y
R
V
S
V
YV
N
EQRME
--
TRT
L
EFK
EA
T
EAQRKAMGE
S
LVP
C
L
S
RTQ
L
AEM
G
V
RVESF
-------
PALNLV
--
SAEA
C
V
-
PFDEIIPLASSHF
D
FSEQK
L
V
L
S
F
-
PQA
A
M
HQVAR
G
T
V
P
E
SL
WD
-----
E
GI
PALLL
D
Y
S
FSGSNSEYDST
G
S
SS
SYVDDNGTVHHDDGKD
TLKSDSYYLNLR
SG
L
N
L
G
A
WRLRN
YSTWS------HSG
--------
-GKAQWDNIGTSLS
R
AIIP
F
KAQ
L
T
M
GD
TA
T
A
GDIFDS
VQMR
G
AM
L
A
SD
EE
MLP
DSQR
GFAP
I
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RTY
V
Q
PG
A
F
E
I
N
DL
YPTANS
GDL
T
V
I
I
K
E
A
DG
SEQR
F
IX
P
F
SS
V
P
I
F
Q
R
E
G
HL
K
Y
SF
A
A
G
EYQA-GNYDSASPR
F
GQLDLIY
G
LPWGM
T
A
YGG
V
-
LISNN
Y
N
A
FTL
G
I
G
KNFGYI
GA
I
S
I
D
V
T
Q
A
K
S
E
L
N
-----
NDRDS
-
Q
G
Q
S
Y
R
FL
Y
S
K
SF-
-
ESG
T
DFR
L
AG
YRYS
TSG
F
Y
T
FQ
E
AT---
-
-
--
DVRSDADSDYN-----------RYHKR-
-
SEIQG
N
LT
Q
Q
L
GA-YG
S
VYL
N
LTQ
Q
D
YW
ND
A
GKQN
-
TVSA
G
YN
---
GRIGK
V
SY
S
I
A
YSWNKSPE
-----
WDES
D
RLWSFN
I
S
V
P
-----
--
-------
-
LGRAWSNYRVTTDQDGRTNQQVGVS
G
TLLEDRN
-
L
SY
S
V
QE
G
YASNGVGNSGNAN---VG
Y
QGGS
G
NVNVG
YS
YG
--
KD
-
-YRQLNYSV
R
GG
VIVHSE
G
V
TL
--
SQPLG
E
TMT
LI
S
V
P
-
G
ARNAR
V
V
-
NNGGVQ
V
D
WM
G
NA
I
V
P
YA
M
P
YR
E
N
E
I
S
L
R
SDSLGDDVD
V
EN
A
FQKVVPTR
GAI
VRAR
F
D
T
RV
G
YRVLMTLLRSA
G
SPV
PFG
ATATLI
-
TD
KQNEVS
S
IV
G
E
E
G
QL
Y
I
S
G
MPEEGR
-
VL
I
K
WG
NDASQQ
C
VAPYKLSLELKQGG
I
IPVS
A
N
C
Q
fig|216593.1.peg.4284
Escherichia coli E2348/69
MS
W
M
V
VSRTYTSFFPFSLSV
---
LAL---TVAGSFSATAGK
FN
PRF
L
-EDTAGI
N
Q
H
V
DLS
MYETDH
G
AQL
PG
T
Y
R
V
S
L
IV
N
EQKME
--
TRT
L
EFK
AA
T
ESQRKEMGE
F
LIP
C
L
S
RTQ
L
ADM
G
V
RVDSF
-------
SALNLI
--
PAEA
C
V
-
AFNEIIPQATSHF
D
FSEQK
L
V
M
S
F
-
PQA
A
M
QQVAR
G
T
V
P
E
SR
WD
-----
D
G
V
PALLL
D
Y
S
FSGSNSSHD-T
K
S
YN
RYIDENGNHHQDKNET
SQTNDSYYLSMR
SG
L
N
L
G
A
WRLRN
YSNWS------YSN
--------
-GEKQWDNIGTYVT
R
AIVP
L
KAQ
L
T
LGD
TA
T
P
S
DIFDS
VQMR
G
AL
L
A
SD
EE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RAF
V
Q
PG
A
F
E
I
N
DL
YPTANS
GDL
T
V
I
V
K
E
A
DG
SEQR
F
IQ
P
F
SS
V
A
V
F
Q
R
E
G
HL
K
Y
SL
S
A
G
EYRA-GNYDSANPR
F
GQFNAMY
G
LPFGM
T
T
YGG
A
-
LISKD
Y
N
A
FAL
G
L
G
KNFGSI
GA
I
S
V
D
I
T
Q
A
K
S
T
L
N
-----
NNATD
-
Q
G
Q
S
Y
R
FL
Y
S
K
SF-
-
ASG
T
DFR
L
LG
Y
K
YS
TSG
F
Y
T
FQ
E
AT---
-
-
--
DVRSGADSDYG-----------RYHKR-
-
SEIQG
N
LT
Q
Q
L
GT-YG
S
VYF
N
MTQ
Q
D
YW
ND
D
GKRL
-
SLAT
G
YN
---
GRIGR
V
NY
S
I
A
YSWNKSPE
-----
WDEN
D
QLWSFN
I
SIP
-----
--
-------
-
FGRAWSNYRVTTDQDGRTIQQLGVN
G
TLLEDRN
-
L
SY
N
V
QE
G
YSSNGVGNSGNAS---LA
Y
QGGA
G
NISVG
YS
YG
--
KD
-
-YQQTNYSL
R
GG
IVAHSE
G
I
S
L
--
SQPLG
E
TIA
IV
S
A
P
-
G
ARGAK
V
L
-
NNSGVS
V
D
WQ
G
NA
V
V
P
YL
S
I
YR
E
N
D
V
S
I
R
SETLNDSVD
M
NS
A
FQTIVPTR
GA
V
VRAH
F
D
T
RV
G
YRVLMTLIRQN
G
VSV
PFG
ATATLV
-
SD
TTEQIS
GIV
G
E
D
G
QL
Y
I
S
G
MPKTGN
-
VK
I
V
WG
KDTSQQ
C
VAKYELPVEEKNSG
I
ISVT
A
N
C
Q
fig|574521.7.peg.3883
Escherichia coli O127:H6 str. E2348/69
MS
W
M
V
VSRTYTSFFPFSLSV
---
LAL---TVAGSFSATAGK
FN
PRF
L
-EDTAGI
N
Q
H
V
DLS
MYETDH
G
AQL
PG
T
Y
R
V
S
L
IV
N
EQKME
--
TRT
L
EFK
AA
T
ESQRKEMGE
F
LIP
C
L
S
RTQ
L
ADM
G
V
RVDSF
-------
SALNLI
--
PAEA
C
V
-
AFNEIIPQATSHF
D
FSEQK
L
V
M
S
F
-
PQA
A
M
QQVAR
G
T
V
P
E
SR
WD
-----
D
G
V
PALLL
D
Y
S
FSGSNSSHD-T
K
S
YN
RYIDENGNHHQDKNET
SQTNDSYYLSMR
SG
L
N
L
G
A
WRLRN
YSNWS------YSN
--------
-GEKQWDNIGTYVT
R
AIVP
L
KAQ
L
T
LGD
TA
T
P
S
DIFDS
VQMR
G
AL
L
A
SD
EE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RAF
V
Q
PG
A
F
E
I
N
DL
YPTANS
GDL
T
V
I
V
K
E
A
DG
SEQR
F
IQ
P
F
SS
V
A
V
F
Q
R
E
G
HL
K
Y
SL
S
A
G
EYRA-GNYDSANPR
F
GQFNAMY
G
LPFGM
T
T
YGG
A
-
LISKD
Y
N
A
FAL
G
L
G
KNFGSI
GA
I
S
V
D
I
T
Q
A
K
S
T
L
N
-----
NNATD
-
Q
G
Q
S
Y
R
FL
Y
S
K
SF-
-
ASG
T
DFR
L
LG
Y
K
YS
TSG
F
Y
T
FQ
E
AT---
-
-
--
DVRSGADSDYG-----------RYHKR-
-
SEIQG
N
LT
Q
Q
L
GT-YG
S
VYF
N
MTQ
Q
D
YW
ND
D
GKRL
-
SLAT
G
YN
---
GRIGR
V
NY
S
I
A
YSWNKSPE
-----
WDEN
D
QLWSFN
I
SIP
-----
--
-------
-
FGRAWSNYRVTTDQDGRTIQQLGVN
G
TLLEDRN
-
L
SY
N
V
QE
G
YSSNGVGNSGNAS---LA
Y
QGGA
G
NISVG
YS
YG
--
KD
-
-YQQTNYSL
R
GG
IVAHSE
G
I
S
L
--
SQPLG
E
TIA
IV
S
A
P
-
G
ARGAK
V
L
-
NNSGVS
V
D
WQ
G
NA
V
V
P
YL
S
I
YR
E
N
D
V
S
I
R
SETLNDSVD
M
NS
A
FQTIVPTR
GA
V
VRAH
F
D
T
RV
G
YRVLMTLIRQN
G
VSV
PFG
ATATLV
-
SD
TTEQIS
GIV
G
E
D
G
QL
Y
I
S
G
MPKTGN
-
VK
I
V
WG
KDTSQQ
C
VAKYELPVEEKNSG
I
ISVT
A
N
C
Q
fig|670897.3.peg.3790
Escherichia coli 2362-75 (1-843/853)
MS
W
M
V
VSRTYTSFFPFSLSV
---
LAL---TVAGSFSATAGK
FN
PRF
L
-EDTAGI
N
Q
H
V
DLS
MYETDH
G
AQL
PG
T
Y
R
V
S
L
IV
N
EQKME
--
TRT
L
EFK
AA
T
ESQRKEMGE
F
LIP
C
L
S
RTQ
L
ADM
G
V
RVDSF
-------
SALNLI
--
PAEA
C
V
-
AFNEIIPQATSHF
D
FSEQK
L
V
M
S
F
-
PQA
A
M
QQVAR
G
T
V
P
E
SR
WD
-----
D
G
V
PALLL
D
Y
S
FSGSNSSHD-T
K
S
YN
RYIDENGNHHQDKNET
SQTNDSYYLSMR
SG
L
N
L
G
A
WRLRN
YSNWS------YSN
--------
-GEKQWDNIGTYVT
R
AIVP
L
KAQ
L
T
LGD
TA
T
P
S
DIFDS
VQMR
G
AL
L
A
SD
EE
MLP
DSQR
GFAP
V
V
R
GIA
KS
N
-
A
E
V
S
I
E
QNG
YV
IY
RAF
V
Q
PG
A
F
E
I
N
DL
YPTANS
GDL
T
V
I
V
K
E
A
DG
SEQR
F
IQ
P
F
SS
V
A
V
F
Q
R
E
G
HL
K
Y
SL
S
A
G
EYRA-GNYDSANPR
F
GQFNAMY
G
LPFGM
T
T
YGG
A
-
LISKD
Y
N
A
FAL
G
L
G
KNFGSI
GA
I
S
V
D
I
T
Q
A
K
S
T
L
N
-----
NNATD
-
Q
G
Q
S
Y
R
FL
Y
S
K
SF-
-
ASG
T
DFR
L
LG
Y
K
YS
TSG
F
Y
T
FQ
E
AT---
-
-
--
DVRSGADSDYG-----------RYHKR-
-
SEIQG
N
LT
Q
Q
L
GT-YG
S
VYF
N
MTQ
Q
D
YW
ND
D
GKRL
-
SLAT
G
YN
---
GRIGR
V
NY
S
I
A
YSWNKSPE
-----
WDEN
D
QLWSFN
I
SIP
-----
--
-------
-
FGRAWSNYRVTTDQDGRTIQQLGVN
G
TLLEDRN
-
L
SY
N
V
QE
G
YSSNGVGNSGNAS---LA
Y
QGGA
G
NISVG
YS
YG
--
KD
-
-YQQTNYSL
R
GG
IVAHSE
G
I
S
L
--
SQPLG
E
TIA
IV
S
A
P
-
G
ARGAK
V
L
-
NNSGVS
V
D
WQ
G
NA
V
V
P
YL
S
I
YR
E
N
D
V
S
I
R
SETLNDSVD
M
NS
A
FQTIVPTR
GA
V
VRAH
F
D
T
RV
G
YRVLMTLIRQN
G
VSV
PFG
ATATLV
-
SD
TTEQIS
GIV
G
E
D
G
QL
Y
I
S
G
MPKTGN
-
VK
I
V
WG
KDTSQQ
C
VAKYELP
fig|331112.3.peg.4253
Escherichia coli HS (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNRSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPERQQQL
L
TQLS
A
E
C
fig|550672.3.peg.4626
Escherichia coli B088 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNRSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|679205.4.peg.3587
Escherichia coli MS 124-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNRSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749533.3.peg.3126
Escherichia coli MS 84-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNRSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.375.peg.2765
Escherichia coli EC4100B (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNRSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|749533.3.peg.3127
Escherichia coli MS 84-1 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNRSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.375.peg.2764
Escherichia coli EC4100B (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNRSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|749545.3.peg.4255
Escherichia coli MS 182-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|679207.4.peg.3440
Escherichia coli MS 107-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSIIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|331112.6.peg.4426
Escherichia coli HS (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNRSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPERQQQL
L
TQLS
A
E
C
fig|679207.4.peg.3439
Escherichia coli MS 107-1 (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSIIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749547.3.peg.1081
Escherichia coli MS 187-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASI
-------
SGMNLL
--
ADDA
C
V
-
PLTAMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTIWSYNSSDRSSG
--------
-SKNKWQHINTXLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGIY
G
TLLEDND
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|340185.3.peg.2053
Escherichia coli E22 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTTMIQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|585034.4.peg.4402
Escherichia coli IAI1 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QTPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSIIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|585034.5.peg.4397
Escherichia coli IAI1 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QTPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSIIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|595495.4.peg.2118
Escherichia coli KO11 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAVGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|566546.3.peg.4960
Escherichia coli W (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAVGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|595495.4.peg.2117
Escherichia coli KO11 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAVGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|566546.3.peg.4959
Escherichia coli W (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAVGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|566546.4.peg.4613
Escherichia coli W (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAVGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|409438.11.peg.4774
Escherichia coli SE11 (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
H
RYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656408.3.peg.4795
Escherichia coli H591 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QTPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--NHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPERQQQL
L
TQLS
A
E
C
fig|679206.4.peg.3535
Escherichia coli MS 119-7 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QTPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--NHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPERQQQL
L
TQLS
A
E
C
fig|656380.3.peg.59
Escherichia coli FVEC1412 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656419.3.peg.49
Escherichia coli M718 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749540.3.peg.3242
Escherichia coli MS 146-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749549.3.peg.189
Escherichia coli MS 198-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|573235.3.peg.5702
Escherichia coli O26:H11 str. 11368 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656408.3.peg.4796
Escherichia coli H591 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QTPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--NHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPERQQQL
L
TQLS
A
E
C
fig|679206.4.peg.3536
Escherichia coli MS 119-7 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QTPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--NHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPERQQQL
L
TQLS
A
E
C
fig|469008.4.peg.3849
Escherichia coli BL21(DE3) (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749549.3.peg.188
Escherichia coli MS 198-1 (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|585056.7.peg.5087
Escherichia coli UMN026 (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749531.3.peg.2134
Escherichia coli MS 69-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGLFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NAYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RVQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
S
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|679204.3.peg.2710
Escherichia coli MS 145-7 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
VQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|585057.4.peg.4936
Escherichia coli IAI39 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|585057.6.peg.4945
Escherichia coli IAI39 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|753642.3.peg.3754
Escherichia coli NC101 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFT---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGTL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749545.3.peg.4254
Escherichia coli MS 182-1 (9-855/856)
VVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|701177.3.peg.5123
Escherichia coli O55:H7 str. CB9615 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749527.3.peg.2986
Escherichia coli MS 21-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFS---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQYINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|316401.4.peg.5266
Escherichia coli ETEC H10407 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASI
-------
SGMNLL
--
ADDA
C
V
-
PLTAMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGIY
G
TLLEDND
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|550677.3.peg.285
Escherichia coli B354 (1-862/863)
M
H
I
R
KHRLAGLFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
T
V
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|679204.3.peg.2711
Escherichia coli MS 145-7 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
VQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
IANYQLPPESQQQL
L
TQLS
A
E
C
fig|749548.3.peg.575
Escherichia coli MS 196-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
V
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|405955.9.peg.4103
Escherichia coli APEC O1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749527.3.peg.2985
Escherichia coli MS 21-1 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFS---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQYINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|869729.3.peg.4787
Escherichia coli UM146 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.376.peg.833
Escherichia coli WV_060327 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|753642.3.peg.3753
Escherichia coli NC101 (9-855/856)
FVAC
---
AFT---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGTL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|316401.4.peg.5267
Escherichia coli ETEC H10407 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASI
-------
SGMNLL
--
ADDA
C
V
-
PLTAMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGIY
G
TLLEDND
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.371.peg.3831
Escherichia coli 1044A (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.373.peg.2372
Escherichia coli 1125A (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.372.peg.3225
Escherichia coli 1212A (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.374.peg.2997
Escherichia coli 536A (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|83334.1.peg.5256
Escherichia coli O157:H7 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|444448.5.peg.2574
Escherichia coli O157:H7 str. EC4045 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|478005.5.peg.787
Escherichia coli O157:H7 str. EC4486 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|386585.9.peg.5512
Escherichia coli O157:H7 str. Sakai (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|216593.1.peg.3674
Escherichia coli E2348/69 (1-877/878)
MSYLNLR
I
YQRN
TQCL
H
I
R
KHRLAVFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
S
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
A
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|574521.7.peg.4728
Escherichia coli O127:H6 str. E2348/69 (1-877/878)
MSYLNLR
I
YQRN
TQCL
H
I
R
KHRLAVFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
S
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
A
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|199310.1.peg.5287
Escherichia coli CFT073 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749546.3.peg.2188
Escherichia coli MS 185-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749528.3.peg.1496
Escherichia coli MS 45-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|340185.4.peg.2177
Escherichia coli E22 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTTMIQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|585395.4.peg.5359
Escherichia coli O103:H2 str. 12009 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTTMIQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|478008.5.peg.334
Escherichia coli O157:H7 str. EC869 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
NN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|570506.3.peg.2814
Escherichia coli O157:H7 str. FRIK966 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
NN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.373.peg.2371
Escherichia coli 1125A (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.374.peg.2996
Escherichia coli 536A (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|444454.5.peg.4364
Escherichia coli O157:H7 str. EC4024 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|444449.5.peg.3817
Escherichia coli O157:H7 str. EC4042 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|444450.8.peg.5654
Escherichia coli O157:H7 str. EC4115 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|444447.5.peg.2740
Escherichia coli O157:H7 str. EC4206 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|478004.5.peg.1673
Escherichia coli O157:H7 str. EC4401 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|544404.4.peg.5464
Escherichia coli O157:H7 str. TW14359 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|502346.5.peg.1358
Escherichia coli O157:H7 str. TW14588 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656444.3.peg.201
Escherichia coli TA280 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHLLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
AGM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTTMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIVTQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
N
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|550676.3.peg.4559
Escherichia coli B185 (9-855/856)
VVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|637388.3.peg.3744
Escherichia coli O157:H7 str. FRIK2000 (9-855/856)
FVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
NN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.371.peg.3832
Escherichia coli 1044A (9-855/856)
FVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|444452.5.peg.2532
Escherichia coli O157:H7 str. EC4113 (9-855/856)
FVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|444451.5.peg.1637
Escherichia coli O157:H7 str. EC4196 (9-855/856)
FVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|478006.5.peg.1105
Escherichia coli O157:H7 str. EC4501 (9-855/856)
FVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|478007.5.peg.723
Escherichia coli O157:H7 str. EC508 (9-855/856)
FVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749537.3.peg.336
Escherichia coli MS 115-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NSGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQYINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
A
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMNNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656393.3.peg.511
Escherichia coli H299 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
S
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
N
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|83333.1.peg.4227
Escherichia coli K12 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKTR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|316407.3.peg.4145
Escherichia coli W3110 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKTR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656437.3.peg.4826
Escherichia coli TA143 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFT---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QMADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749537.3.peg.337
Escherichia coli MS 115-1 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NSGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQYINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
A
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMNNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656440.3.peg.4941
Escherichia coli TA206 (9-855/856)
FVAC
---
AFT---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|595496.3.peg.4403
Escherichia coli BW2952 (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKTR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|536056.3.peg.3898
Escherichia coli DH1 (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKTR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|511145.12.peg.4458
Escherichia coli str. K-12 substr. MG1655 (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKTR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|511145.6.peg.4437
Escherichia coli str. K-12 substr. MG1655 (1-862/863)
M
H
I
R
KHRLAGFFVRLVVAC
---
AFA---A-QAPLSSADLY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKTR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|155864.1.peg.5241
Escherichia coli O157:H7 EDL933 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
X
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656444.3.peg.202
Escherichia coli TA280 (1-862/863)
M
H
I
R
KHLLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
AGM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTTMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIVTQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
N
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656437.3.peg.4827
Escherichia coli TA143 (9-855/856)
FVAC
---
AFT---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QMADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656393.3.peg.512
Escherichia coli H299 (9-855/856)
FVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
S
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
N
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|155864.8.peg.5227
Escherichia coli O157:H7 EDL933 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---V-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
AGMNLL
--
ADDA
C
V
-
PLTTMVQDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-SS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLARN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
X
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|431946.3.peg.4438
Escherichia coli SE15 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASI
-------
SGMNLL
--
ADDA
C
V
-
PLTAMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDRSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGIY
G
TLLEDND
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|340186.3.peg.3345
Escherichia coli E110019 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
S
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
ILAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|685038.3.peg.4444
Escherichia coli O83:H1 str. NRG 857C (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-RKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|405955.13.peg.4869
Escherichia coli APEC O1 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|714962.3.peg.4844
Escherichia coli IHE3034 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|869729.3.peg.4788
Escherichia coli UM146 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|364106.7.peg.4881
Escherichia coli UTI89 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|364106.8.peg.4882
Escherichia coli UTI89 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|562.376.peg.834
Escherichia coli WV_060327 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|340186.5.peg.3489
Escherichia coli E110019 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
S
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
ILAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|344601.3.peg.798
Escherichia coli B171 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TL
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656417.3.peg.5516
Escherichia coli M605 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
RHRLAGFFVRLFVAC
---
AFT---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLADNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|199310.4.peg.5054
Escherichia coli CFT073 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749546.3.peg.2189
Escherichia coli MS 185-1 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749528.3.peg.1497
Escherichia coli MS 45-1 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656417.3.peg.5517
Escherichia coli M605 (9-855/856)
FVAC
---
AFT---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLADNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|340184.3.peg.2246
Escherichia coli B7A (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
E
L
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656443.3.peg.12
Escherichia coli TA271 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
E
L
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSNSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|362663.8.peg.4671
Escherichia coli 536 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|362663.9.peg.4687
Escherichia coli 536 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|340197.3.peg.754
Escherichia coli F11 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|340197.5.peg.782
Escherichia coli F11 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749550.3.peg.3880
Escherichia coli MS 200-1 (1-877/878)
MSYLNLRL
YQRN
TQCL
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|656443.3.peg.13
Escherichia coli TA271 (1-862/863)
M
H
I
R
KHRLAGFFVRLFVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
E
L
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSNSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|670888.3.peg.477
Escherichia coli 1827-70 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
S
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
ILAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|585035.6.peg.4836
Escherichia coli S88 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
S
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
ILAHAN
G
V
TL
--
GQPLN
E
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|344601.5.peg.829
Escherichia coli B171 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TL
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|340184.6.peg.2357
Escherichia coli B7A (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
E
L
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLVGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|749550.3.peg.3881
Escherichia coli MS 200-1 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
TF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAQL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRLR
D
NTSWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGVY
G
TLLEDNN
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKQLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
A
E
C
fig|413997.3.peg.991
Escherichia coli B str. REL606 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
G
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNFGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|511693.5.peg.1015
Escherichia coli BL21 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
G
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNFGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|469008.4.peg.2746
Escherichia coli BL21(DE3) (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
G
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNFGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|749547.3.peg.1895
Escherichia coli MS 187-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNFGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|550672.3.peg.1218
Escherichia coli B088 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|344601.3.peg.2658
Escherichia coli B171 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|340185.3.peg.3000
Escherichia coli E22 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|656408.3.peg.1008
Escherichia coli H591 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|679206.4.peg.3927
Escherichia coli MS 119-7 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|656443.3.peg.1236
Escherichia coli TA271 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|670897.3.peg.1728
Escherichia coli 2362-75 (9-855/856)
FVAC
---
AFA---A-QAPLSSAELY
FN
PRF
L
ADDPQAV
---
A
DLS
RFENGQ
-
ELP
PG
T
Y
R
VDI
YL
N
NGYMA
--
TRD
V
SF-
---
-NTGDSEQG
-
IVP
C
L
T
RAQ
L
ASM
GL
NTASV
-------
SGMNLL
--
ADDA
C
V
-
PLTSMIHDATAHL
D
VGQQR
L
N
LT
I
-
PQA
F
M
SNRAR
GY
I
PP
EL
WD
-----
P
GI
NAGLL
NYN
FSG-NSVQNRI
--
-G
G
---------------
N--SHYAYLNLQ
SG
L
N
I
G
A
WRL
C
D
NTTWSYNSSDSSSG
--------
-SKNKWQHINTWLE
R
DIIP
L
RSR
L
T
LGD
GY
T
Q
GDIFD
G
INFR
G
AQ
L
A
SD
DN
MLP
DSQR
GFAP
V
I
H
GIA
HG
T
-
A
Q
VTI
K
QNG
YD
IY
NST
VPPGPF
T
I
N
D
I
YAAGNS
GDL
Q
V
T
I
K
E
A
DG
STQI
F
TV
P
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
G
EYRS-GNAQQEKPR
F
FQSTLLH
G
LPAGW
T
I
YGG
T
-
QLADR
Y
R
A
FNF
G
I
G
KNMGAL
GA
L
S
V
D
M
T
Q
A
N
S
T
L
P
-----
DDSQH
-
D
G
Q
S
V
R
FL
Y
N
K
SLN
-
ESG
T
NIQ
L
VG
YRYS
TSG
Y
F
N
FA
D
TTYSR
M
N
GY
NIETQ-DGVIQVKPKFTDYYNLAYNKR-
-
GKLQL
T
VT
Q
Q
L
GR-TS
T
LYL
S
GSH
Q
T
YW
GT
S
NVDE
-
QFQA
G
LN
---
TAFED
I
NW
T
L
S
YSLTKNAW
-----
QKGR
D
QMLALN
V
N
IP
FSHWL
--
RSDSKSQ
-
WRHASASYSMSHDLNGRMTNLAGIY
G
TLLEDND
-
L
SY
S
V
QT
G
YAGGGDGNSGSTGYATLN
Y
RGGY
G
NANIG
YS
HS
--
DD
-
-IKRLYYGV
SGG
VLAHAN
G
V
TL
--
GQPLN
D
TVV
L
V
K
A
P
-
G
AKDAK
V
E
-
NQTGVR
TD
WR
G
YA
V
L
P
YA
T
E
YR
E
N
R
V
A
LD
TNTLADNVD
L
DN
A
VANVVPTR
GAI
VRAE
F
K
A
RV
G
IKLLMTL-THN
N
KPL
PFG
AMVTSE
---
-SSQSS
GIV
A
D
N
G
QV
YL
S
G
MPLAGK
-
VQ
V
K
WG
EEENAH
C
VANYQLPPESQQQL
L
TQLS
V
E
C
fig|331111.3.peg.3545
Escherichia coli E24377A (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVN
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|585055.6.peg.1006
Escherichia coli 55989 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|340184.3.peg.2043
Escherichia coli B7A (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|562.375.peg.4599
Escherichia coli EC4100B (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|595495.4.peg.2284
Escherichia coli KO11 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|679207.4.peg.4122
Escherichia coli MS 107-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|749545.3.peg.3339
Escherichia coli MS 182-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|749532.3.peg.2022
Escherichia coli MS 78-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|566546.3.peg.1829
Escherichia coli W (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|340184.3.peg.3201
Escherichia coli B7A (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
H
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTWENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
NGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|340184.6.peg.3341
Escherichia coli B7A (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
H
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTWENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
NGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|679204.3.peg.4273
Escherichia coli MS 145-7 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
H
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTWENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
NGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|595496.3.peg.873
Escherichia coli BW2952 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|83333.1.peg.925
Escherichia coli K12 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|749538.3.peg.2986
Escherichia coli MS 116-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|749548.3.peg.1448
Escherichia coli MS 196-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|316407.3.peg.906
Escherichia coli W3110 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|316385.5.peg.1010
Escherichia coli str. K-12 substr. DH10B (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|511145.6.peg.967
Escherichia coli str. K-12 substr. MG1655 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|550676.3.peg.1341
Escherichia coli B185 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
V
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749537.3.peg.4485
Escherichia coli MS 115-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|656419.3.peg.763
Escherichia coli M718 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGIG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|656414.3.peg.1160
Escherichia coli H736 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
I
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|749540.3.peg.134
Escherichia coli MS 146-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
HLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
I
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IS
V
R
WG
EAPDQI
C
HINYELTEQQINSA
I
TRMD
A
I
C
fig|562.371.peg.2802
Escherichia coli 1044A (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|562.373.peg.3190
Escherichia coli 1125A (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|562.372.peg.1534
Escherichia coli 1212A (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|562.374.peg.5441
Escherichia coli 536A (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|83334.1.peg.666
Escherichia coli O157:H7 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|444454.5.peg.5078
Escherichia coli O157:H7 str. EC4024 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|444449.5.peg.5414
Escherichia coli O157:H7 str. EC4042 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|444448.5.peg.3289
Escherichia coli O157:H7 str. EC4045 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|444453.5.peg.630
Escherichia coli O157:H7 str. EC4076 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|444452.5.peg.897
Escherichia coli O157:H7 str. EC4113 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|444450.8.peg.749
Escherichia coli O157:H7 str. EC4115 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|444451.5.peg.2250
Escherichia coli O157:H7 str. EC4196 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|444447.5.peg.3463
Escherichia coli O157:H7 str. EC4206 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|478004.5.peg.1457
Escherichia coli O157:H7 str. EC4401 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|478005.5.peg.2487
Escherichia coli O157:H7 str. EC4486 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|478006.5.peg.857
Escherichia coli O157:H7 str. EC4501 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|478007.5.peg.947
Escherichia coli O157:H7 str. EC508 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|478008.5.peg.1689
Escherichia coli O157:H7 str. EC869 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|637388.3.peg.1084
Escherichia coli O157:H7 str. FRIK2000 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|570506.3.peg.3621
Escherichia coli O157:H7 str. FRIK966 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|386585.9.peg.701
Escherichia coli O157:H7 str. Sakai (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|544404.4.peg.614
Escherichia coli O157:H7 str. TW14359 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|502346.5.peg.609
Escherichia coli O157:H7 str. TW14588 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|679205.4.peg.971
Escherichia coli MS 124-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
C
D
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|749533.3.peg.968
Escherichia coli MS 84-1 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
V
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
C
D
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
V
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
IPL
R
FG
AIATLD
G
IQ
TNS---
GI
I
D
D
D
G
SL
Y
M
S
G
LPAQGA
-
IT
V
R
WG
EAPDQI
C
HISYQLTEQQINSA
I
TRMD
A
I
C
fig|656414.3.peg.733
Escherichia coli H736 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTVLG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|679206.4.peg.14
Escherichia coli MS 119-7 (24-864/869)
SLA---IXPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
T
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|155864.1.peg.1123
Escherichia coli O157:H7 EDL933 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
A
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
SE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
S
R
SNDSYTS
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGVF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
ISL
R
FG
AIATLD
GV
Q
TNS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IT
V
R
WG
EAPDQI
C
HISYELTEQQINSA
I
TRMD
A
I
C
fig|413997.3.peg.514
Escherichia coli B str. REL606 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
H
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|550672.3.peg.65
Escherichia coli B088 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
T
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|511693.5.peg.520
Escherichia coli BL21 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
H
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|469008.4.peg.3231
Escherichia coli BL21(DE3) (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
H
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|656408.3.peg.477
Escherichia coli H591 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
T
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749547.3.peg.1388
Escherichia coli MS 187-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
H
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|637912.3.peg.683
Escherichia coli OP50 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
H
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|656443.3.peg.792
Escherichia coli TA271 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
T
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|679207.4.peg.4494
Escherichia coli MS 107-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
R
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
T
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDELIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|481805.3.peg.3310
Escherichia coli ATCC 8739 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|481805.6.peg.3298
Escherichia coli ATCC 8739 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|585055.6.peg.558
Escherichia coli 55989 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
LI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SLN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|585055.8.peg.559
Escherichia coli 55989 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
LI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SLN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|1040638.4.peg.5238
Escherichia coli O104:H4 str. LB226692 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
LI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|701177.3.peg.653
Escherichia coli O55:H7 str. CB9615 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
G
V
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-SNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|6666666.5357.peg.236
Escherichia coli TY-2482 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
LI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNKGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|409438.11.peg.685
Escherichia coli SE11 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMS
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRV--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|656437.3.peg.630
Escherichia coli TA143 (24-864/869)
SLA---ISPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|216592.1.peg.293
Escherichia coli 042 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQRLQQA
V
TVIS
A
V
C
fig|216592.3.peg.603
Escherichia coli 042 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQRLQQA
V
TVIS
A
V
C
fig|155864.1.peg.589
Escherichia coli O157:H7 EDL933 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVXWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|155864.8.peg.603
Escherichia coli O157:H7 EDL933 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTFV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVXWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|358709.5.peg.2019
Escherichia coli 101-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
F
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
H
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|585034.4.peg.532
Escherichia coli IAI1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
T
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
T
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|585034.5.peg.531
Escherichia coli IAI1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
T
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
T
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749537.3.peg.4076
Escherichia coli MS 115-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
T
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSKQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|340186.3.peg.810
Escherichia coli E110019 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMS
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDELIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|340186.5.peg.845
Escherichia coli E110019 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMS
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDELIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749540.3.peg.4794
Escherichia coli MS 146-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTVLG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|550677.3.peg.978
Escherichia coli B354 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749531.3.peg.1514
Escherichia coli MS 69-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|656444.3.peg.1029
Escherichia coli TA280 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749527.3.peg.4959
Escherichia coli MS 21-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGKXXWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|656379.3.peg.1027
Escherichia coli FVEC1302 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|656380.3.peg.785
Escherichia coli FVEC1412 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749549.3.peg.1409
Escherichia coli MS 198-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|585056.7.peg.781
Escherichia coli UMN026 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|562.371.peg.2090
Escherichia coli 1044A (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
A
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
P
X
XR
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
SE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
S
R
SNDSYTS
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGVF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
ISL
R
FG
AIATLD
GV
Q
TNS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IT
V
R
WG
EAPDQI
C
HISYELTEQQINSA
I
TRMD
A
I
C
fig|562.373.peg.1072
Escherichia coli 1125A (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
A
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
P
X
XR
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
SE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
S
R
SNDSYTS
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGVF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
ISL
R
FG
AIATLD
GV
Q
TNS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IT
V
R
WG
EAPDQI
C
HISYELTEQQINSA
I
TRMD
A
I
C
fig|562.372.peg.938
Escherichia coli 1212A (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
A
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
P
X
XR
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
SE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
S
R
SNDSYTS
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGVF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
ISL
R
FG
AIATLD
GV
Q
TNS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IT
V
R
WG
EAPDQI
C
HISYELTEQQINSA
I
TRMD
A
I
C
fig|562.374.peg.2472
Escherichia coli 536A (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VEITPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
A
-
PLAEIIPDASVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
P
X
XR
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQATLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
V
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
SE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
S
R
SNDSYTS
K
KNYAWMTSNTSIDNEGHTTQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGVF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-SKQ
G
ISL
R
FG
AIATLD
GV
Q
TNS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IT
V
R
WG
EAPDQI
C
HISYELTEQQINSA
I
TRMD
A
I
C
fig|656419.3.peg.1242
Escherichia coli M718 (2-865/866)
YRTH
R
Q
H
SLLSSGGVPSFIGGL
---
VVF---VSAAFNAQAETW
F
D
PAF
F
KDDPSMV
---
A
DLS
RFEKGQ
-
KIT
PG
V
Y
R
VDI
VL
N
QTIVD
--
TRN
V
NF-
---
-VELTPEKG
-
IAA
C
L
T
TES
L
DAM
G
V
NTDAF
-------
PAFKQL
--
DKQA
C
A
-
PLAEIIPDARVTF
N
VNKLR
L
E
IS
V
-
PQ
I
A
I
KSNAR
GY
V
PP
ER
WD
-----
E
GI
NALLL
G
Y
S
FSGANSIHSSA
D
S
DS
G
---------------
D----SYFLNLN
SG
V
N
L
G
P
WRLRN
NSTWS-----RSSG
--------
-QTAEWKNLSSYLQ
R
AVIP
L
KGE
L
T
V
GD
DY
T
A
GD
F
FDS
VSFR
G
VQ
L
A
SD
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
N
-
A
Q
I
TI
K
QNG
YT
IY
QTY
V
S
P
V
A
F
E
I
S
DL
YSTSSS
GDL
L
V
E
I
K
E
A
DG
SVNS
Y
SV
P
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
KYRT-NSNEQQESK
F
AQVTLQW
G
GPWGT
T
W
YGG
G
-
QYAEY
Y
R
A
AMF
G
L
G
FNLGDF
GA
I
S
F
D
A
T
Q
A
K
S
T
L
A
-----
DQSEH
-
K
G
Q
S
Y
R
FL
Y
A
K
TLN
-
QLG
T
NFQ
L
MG
YRYS
TSG
F
Y
T
LS
D
TMYKH
M
D
GY
EFNDG--DDED-TPMWSRYYNLFYTKR-
-
GKLQV
N
IS
Q
Q
L
GE-YG
S
FYL
S
GSQ
Q
T
YW
HT
D
QQDR
-
LLQF
G
YN
---
TQIKD
L
SL
G
I
S
WNYSKSRG
-----
QPDA
D
QVFALN
F
S
L
P
LNLLL
P
R
SNDSYTR
K
KNYAWMTSNTSIDNEGHITQNLGLT
E
TLLDDGN
-
L
SY
S
V
QQ
G
YNSEGKTANGSAS---MD
Y
KGAF
A
DARVG
Y
N
YS
--
DN
G
SQQQLNYAL
SG
S
LVAHSQ
G
I
TL
--
GQSLG
E
TNV
LI
A
A
P
-
G
AENTR
V
A
-
NSTGLK
TD
WR
G
YT
V
V
P
YA
T
S
YR
E
N
R
I
A
LD
AASLKRNVD
L
EN
A
VVNVVPTK
GA
L
VLAE
F
N
A
HA
G
ARVLMKT-TKQ
G
IPL
R
FG
AIATLD
GV
Q
ANS---
GI
I
D
D
D
G
SL
Y
M
A
G
LPAKGT
-
IT
V
R
WG
EAPDQI
C
HISYELTEQQINSA
I
TRMD
A
I
C
fig|573235.3.peg.574
Escherichia coli O26:H11 str. 11368 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
S
L
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
G
T
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|331112.3.peg.566
Escherichia coli HS (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
S
L
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
G
T
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
L
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|331112.6.peg.592
Escherichia coli HS (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
S
L
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
G
T
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
L
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|595496.3.peg.454
Escherichia coli BW2952 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|536056.3.peg.3256
Escherichia coli DH1 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|83333.1.peg.529
Escherichia coli K12 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749538.3.peg.2524
Escherichia coli MS 116-1 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749544.3.peg.277
Escherichia coli MS 175-1 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749548.3.peg.1617
Escherichia coli MS 196-1 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|316407.3.peg.515
Escherichia coli W3110 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|316385.5.peg.487
Escherichia coli str. K-12 substr. DH10B (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|316385.7.peg.494
Escherichia coli str. K-12 substr. DH10B (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|511145.12.peg.553
Escherichia coli str. K-12 substr. MG1655 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|511145.6.peg.546
Escherichia coli str. K-12 substr. MG1655 (24-862/867)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DASRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
E--YDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
VS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
L
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|595495.4.peg.4191
Escherichia coli KO11 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
LI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-SIGIT
S
M
X
X
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|566546.3.peg.4370
Escherichia coli W (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
LI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QATY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-SIGIT
S
M
X
X
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749545.3.peg.3674
Escherichia coli MS 182-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
R
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
A
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-SIGIT
S
M
X
X
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749532.3.peg.2527
Escherichia coli MS 78-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
R
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
A
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-SIGIT
S
M
X
X
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|585396.4.peg.585
Escherichia coli O111:H- str. 11128 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
S
L
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
G
T
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|585057.4.peg.528
Escherichia coli IAI39 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
W
S
LRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|585057.6.peg.527
Escherichia coli IAI39 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
VNTGDKSGG
-
LMP
C
F
N
QAL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
W
S
LRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGGG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|679205.4.peg.243
Escherichia coli MS 124-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
R
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEF-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
A
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPFG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-SIGIT
S
M
X
X
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|749533.3.peg.1261
Escherichia coli MS 84-1 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
R
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEF-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
VR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
A
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPFG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-SIGIT
S
M
X
X
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|344610.3.peg.1538
Escherichia coli 53638 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
S
L
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
S
Q
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
G
T
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
NPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|344610.7.peg.1141
Escherichia coli 53638 (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
G
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
S
L
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
S
Q
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
G
T
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRYS
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
NPDN
E
RIVGLN
VS
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
G
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
R
KPV
PFG
SLVREN
---
-STGIT
S
M
V
G
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|562.375.peg.3948
Escherichia coli EC4100B (24-864/869)
SLA---ILPSFLSYAESY
FN
PAF
L
LENGTSV
---
A
DLS
RFERGN
-
HQP
A
R
V
Y
R
VD
L
WR
N
DEFIG
--
SQD
I
VFE
STT
ENTGDKSGG
-
LMP
C
F
N
QVL
L
ERI
GL
NSSAF
-------
PELAQQ
--
QNNK
C
I
-
NLLKAVPDATINF
D
FAAMR
L
N
I
T
I
-
PQ
I
A
L
LSSAH
GY
I
PP
EE
WD
-----
E
GI
PALLL
NYN
FTG-NRGN---
--
--
G
---------------
N--DSYFFSEL-
SG
I
N
I
G
P
WRLRN
NGSWNYFRG--NGY
--------
-HSEQWNNIGTWVQ
R
AIIP
L
KSE
L
V
M
GD
GN
T
G
S
DIFD
G
VGFR
G
IR
L
Y
S
S
DN
M
Y
P
DSQQ
GFAP
T
V
R
GIA
RT
A
-
A
Q
L
TI
R
QNG
FI
IY
QSY
V
S
PG
A
F
E
I
T
DL
HPTSSN
GDL
D
V
T
I
D
E
R
DG
NQQN
Y
TI
P
Y
S
T
V
P
I
L
Q
R
E
G
RF
KF
DL
T
A
G
DFRS-GNSQQSSPF
F
FQGTALG
G
LPQEF
T
A
YGG
T
-
QLSAN
Y
T
A
FLL
G
L
G
RNLGNW
GA
V
S
L
D
V
T
H
A
R
S
Q
L
A
-----
DDSRH
-
E
G
D
S
I
R
FL
Y
A
K
SMN
-
TFG
T
NFQ
L
MG
YRY
L
TQG
F
Y
T
LD
D
VAYRR
M
E
GY
EYDYDYDGEHRDEPIIVNYHNLRFSRK-
-
DRLQL
N
IS
Q
S
L
ND-FG
S
LYI
S
GTH
Q
K
YW
NT
S
DSDT
-
WYQV
G
YT
---
SSWVG
I
SY
S
L
S
FSWNESVG
-----
IPDN
E
RIVDLX
X
S
V
P
FNVLT
KR
RYTRENA
-
LDRAYASFNANRNSNGQNSWLAGVG
G
TLLEGHN
-
L
SY
H
V
SQ
G
DT----SNNGYTGSATAN
W
QAAY
A
TLGVG
Y
N
YD
--
RD
-
-QHDVNWQL
SGG
VVGHEN
G
I
TL
--
SQPLG
D
TNV
LI
K
A
P
-
G
AGGVR
I
E
-
NQTGIL
TD
WR
G
YA
V
M
P
YA
T
V
YR
Y
N
R
I
A
LD
TNTMGNSID
V
EK
N
ISSVVPTQ
GA
L
VRAN
F
D
T
RI
G
VRALITV-TQG
G
KPV
PFG
SLVREN
---
-SIGIT
S
M
X
X
D
D
G
QV
YL
S
G
APLSGE
-
LL
V
Q
WG
DGANSR
C
IAHYVLPKQSLQQA
V
TVIS
A
V
C
fig|685038.3.peg.3468
Escherichia coli O83:H1 str. NRG 857C (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
TA
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|362663.8.peg.3551
Escherichia coli 536 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|362663.9.peg.3563
Escherichia coli 536 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|525281.3.peg.2865
Escherichia coli 83972 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|199310.4.peg.3961
Escherichia coli CFT073 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|340197.5.peg.1688
Escherichia coli F11 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
I
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|656440.3.peg.3673
Escherichia coli TA206 (7-837/847)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|655817.3.peg.4018
Escherichia coli ABU 83972 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|199310.1.peg.4119
Escherichia coli CFT073 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|749546.3.peg.4044
Escherichia coli MS 185-1 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|749528.3.peg.3102
Escherichia coli MS 45-1 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|340197.3.peg.1597
Escherichia coli F11 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
I
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|749550.3.peg.3036
Escherichia coli MS 200-1 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HL
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
I
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|656393.3.peg.4487
Escherichia coli H299 (13-852/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AL
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELNNP
V
SYAT
L
E
C
fig|656417.3.peg.4275
Escherichia coli M605 (13-852/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYNLPETELNNP
V
SYAT
L
E
C
fig|656417.3.peg.4276
Escherichia coli M605 (24-863/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
V
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYNLPETELNNP
V
SYAT
L
E
C
fig|753642.3.peg.4527
Escherichia coli NC101 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|753642.3.peg.4526
Escherichia coli NC101 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDENQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|405955.13.peg.3851
Escherichia coli APEC O1 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|869729.3.peg.3684
Escherichia coli UM146 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|364106.8.peg.3853
Escherichia coli UTI89 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|364106.7.peg.3853
Escherichia coli UTI89 (7-837/847)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|405955.9.peg.3222
Escherichia coli APEC O1 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|714962.3.peg.3884
Escherichia coli IHE3034 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|585035.6.peg.3747
Escherichia coli S88 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKTLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
A
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
DN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|562.376.peg.2833
Escherichia coli WV_060327 (13-843/853)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKMLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
V
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
YN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|562.376.peg.2832
Escherichia coli WV_060327 (24-854/864)
LL-----------L
---
VIM---PACSI---AGMR
FN
PAF
L
SGDTEAV
---
A
DLS
RFEKGM
-
TYL
PG
S
Y
E
V
EV
WV
N
DSPLL
--
SRT
V
TF-
---
--KADDANQ
-
LIP
C
L
S
LAD
L
LSL
G
I
NKNAL
-------
PE-QAL
AS
SENS
C
L
-
DLRIWFPDVHYMP
E
LDAQR
L
K
LT
F
-
PQA
I
I
KRDAR
GY
I
PP
EQ
WD
-----
N
GI
TAFLL
NY
D
FSGNN------
--
DR
G
---------------
DYSSNNYYLNLR
A
G
I
N
I
G
A
WR
F
R
D
YSTWS-----RGSN
--------
-SAGKLEHISSTLQ
R
VIIP
F
RSE
L
T
LGD
TW
S
S
S
D
V
FDS
VSIR
G
IK
L
E
SD
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
-
A
Q
VTI
K
QNG
YV
IY
QTY
M
PPGPF
E
I
S
DL
NPTSSA
GDL
E
V
T
I
K
E
S
D
N
SETV
Y
TV
P
Y
A
A
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
QYRS-NSYNQKSPY
I
FQGELIW
G
LPWDI
T
A
YGG
A
-
QFSED
Y
R
A
LAL
G
L
G
LNLGVF
GA
T
S
F
D
V
T
Q
A
N
S
S
L
V
-----
DGSKH
-
Q
G
Q
S
Y
R
FL
Y
S
K
SLV
-
QTG
T
AFH
I
IG
YRYS
TQG
F
Y
T
LS
D
TTYQQ
M
S
G
T
VVDPKMLDDKDYVYNWNDFYNLRYSKR-
-
GKFQA
S
VS
Q
P
F
GN-YG
S
MYL
S
ASQ
Q
T
YW
NT
D
KKDS
-
LYQV
G
YN
---
TSIKG
I
YL
N
V
V
WNYSKSPG
-----
T-NA
D
KIVSLN
VS
L
P
ISNWL
SS
TNDGRSS
-
SNAMTATYGYSQDNHGQVNQYTGVS
G
SLLEQHN
-
L
SY
N
I
QH
G
FANQDNSSSGSVG---VN
Y
RGAY
G
SLNSA
YS
Y-
--
YN
E
GNQQINYGI
SG
A
LVVHEN
G
L
TL
--
SQPLG
E
TNV
LI
K
A
P
-
G
ANNVD
V
Q
-
RGTGIS
TD
WR
G
YA
V
V
P
YA
T
E
YR
R
N
N
I
S
LD
PMSMNMHTE
L
DI
T
STEVIPGK
GA
L
VRAE
F
A
A
HI
G
IRGLFTV-RYR
N
KSV
PFG
ATASAQ
-
I
K
NSSQIT
GIV
G
D
N
G
QL
YL
S
G
LPLEGV
-
IN
I
Q
WG
DGVQQK
C
QANYKLPETELDN
fig|562.371.peg.1697
Escherichia coli 1044A (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
DQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.373.peg.5043
Escherichia coli 1125A (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.372.peg.1181
Escherichia coli 1212A (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.374.peg.2402
Escherichia coli 536A (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|155864.1.peg.1958
Escherichia coli O157:H7 EDL933 (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.371.peg.1698
Escherichia coli 1044A (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
DQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|386585.9.peg.2216
Escherichia coli O157:H7 str. Sakai (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
DQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.373.peg.5044
Escherichia coli 1125A (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.372.peg.1182
Escherichia coli 1212A (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.374.peg.2401
Escherichia coli 536A (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|155864.8.peg.1781
Escherichia coli O157:H7 EDL933 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|444454.5.peg.1019
Escherichia coli O157:H7 str. EC4024 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|444449.5.peg.344
Escherichia coli O157:H7 str. EC4042 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|444448.5.peg.4703
Escherichia coli O157:H7 str. EC4045 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|444453.5.peg.2899
Escherichia coli O157:H7 str. EC4076 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|444452.5.peg.1917
Escherichia coli O157:H7 str. EC4113 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|444450.8.peg.2162
Escherichia coli O157:H7 str. EC4115 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|444451.5.peg.1910
Escherichia coli O157:H7 str. EC4196 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|444447.5.peg.5612
Escherichia coli O157:H7 str. EC4206 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|478004.5.peg.2888
Escherichia coli O157:H7 str. EC4401 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|478005.5.peg.2935
Escherichia coli O157:H7 str. EC4486 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|478006.5.peg.1903
Escherichia coli O157:H7 str. EC4501 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|478007.5.peg.2107
Escherichia coli O157:H7 str. EC508 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|478008.5.peg.3626
Escherichia coli O157:H7 str. EC869 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|637388.3.peg.1613
Escherichia coli O157:H7 str. FRIK2000 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|570506.3.peg.2984
Escherichia coli O157:H7 str. FRIK966 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|544404.4.peg.2025
Escherichia coli O157:H7 str. TW14359 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|502346.5.peg.5256
Escherichia coli O157:H7 str. TW14588 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
A
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|701177.3.peg.1859
Escherichia coli O55:H7 str. CB9615 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
LPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSTLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|550677.3.peg.2920
Escherichia coli B354 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKENS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
IY
QTT
VPPGPF
N
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
I
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSATSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|83333.1.peg.3089
Escherichia coli K12 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749544.3.peg.921
Escherichia coli MS 175-1 (24-860/861)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|316407.3.peg.3025
Escherichia coli W3110 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|595496.3.peg.3123
Escherichia coli BW2952 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|536056.3.peg.585
Escherichia coli DH1 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|316401.4.peg.3879
Escherichia coli ETEC H10407 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749538.3.peg.2179
Escherichia coli MS 116-1 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749548.3.peg.2778
Escherichia coli MS 196-1 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|316385.5.peg.3272
Escherichia coli str. K-12 substr. DH10B (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|316385.7.peg.3343
Escherichia coli str. K-12 substr. DH10B (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|511145.12.peg.3239
Escherichia coli str. K-12 substr. MG1655 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|511145.6.peg.3224
Escherichia coli str. K-12 substr. MG1655 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|562.371.peg.2454
Escherichia coli 1044A (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|562.373.peg.1928
Escherichia coli 1125A (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|562.374.peg.4725
Escherichia coli 536A (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|155864.8.peg.3956
Escherichia coli O157:H7 EDL933 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|444454.5.peg.3080
Escherichia coli O157:H7 str. EC4024 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|444449.5.peg.2539
Escherichia coli O157:H7 str. EC4042 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|444453.5.peg.2715
Escherichia coli O157:H7 str. EC4076 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|444452.5.peg.2276
Escherichia coli O157:H7 str. EC4113 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|444450.8.peg.4375
Escherichia coli O157:H7 str. EC4115 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|444451.5.peg.903
Escherichia coli O157:H7 str. EC4196 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|478004.5.peg.976
Escherichia coli O157:H7 str. EC4401 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|478005.5.peg.3466
Escherichia coli O157:H7 str. EC4486 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|478006.5.peg.2567
Escherichia coli O157:H7 str. EC4501 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|478007.5.peg.597
Escherichia coli O157:H7 str. EC508 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|637388.3.peg.5238
Escherichia coli O157:H7 str. FRIK2000 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|570506.3.peg.843
Escherichia coli O157:H7 str. FRIK966 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|544404.4.peg.4185
Escherichia coli O157:H7 str. TW14359 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|502346.5.peg.3229
Escherichia coli O157:H7 str. TW14588 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|701177.3.peg.3886
Escherichia coli O55:H7 str. CB9615 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|562.372.peg.2649
Escherichia coli 1212A (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|83334.1.peg.3996
Escherichia coli O157:H7 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|155864.1.peg.4025
Escherichia coli O157:H7 EDL933 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|444448.5.peg.1292
Escherichia coli O157:H7 str. EC4045 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|444447.5.peg.1449
Escherichia coli O157:H7 str. EC4206 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|478008.5.peg.1346
Escherichia coli O157:H7 str. EC869 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|386585.9.peg.4201
Escherichia coli O157:H7 str. Sakai (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YNFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|413997.3.peg.3154
Escherichia coli B str. REL606 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|469008.4.peg.611
Escherichia coli BL21(DE3) (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749537.3.peg.2527
Escherichia coli MS 115-1 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749547.3.peg.2081
Escherichia coli MS 187-1 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|481805.3.peg.589
Escherichia coli ATCC 8739 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|481805.6.peg.585
Escherichia coli ATCC 8739 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSVNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|550676.3.peg.3301
Escherichia coli B185 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|340186.3.peg.395
Escherichia coli E110019 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|340186.5.peg.416
Escherichia coli E110019 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|331111.12.peg.3887
Escherichia coli E24377A (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|331111.3.peg.1301
Escherichia coli E24377A (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|679207.4.peg.2590
Escherichia coli MS 107-1 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|585396.4.peg.4107
Escherichia coli O111:H- str. 11128 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|409438.11.peg.3599
Escherichia coli SE11 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|573235.3.peg.4351
Escherichia coli O26:H11 str. 11368 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
N
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749531.3.peg.771
Escherichia coli MS 69-1 (1-882/883)
M
TAFHAA
FKAY
R
M
H
QVLILPRFVRLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHL
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSATSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|550676.3.peg.1843
Escherichia coli B185 (10-870/871)
FARLTIAL
---
SLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-ITDDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLVDASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
FK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPR
F
IQGSLMH
G
LEENW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
G
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGSMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|656414.3.peg.3619
Escherichia coli H736 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
G
S
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749540.3.peg.1432
Escherichia coli MS 146-1 (26-862/863)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQQ
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
V
SA
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
G
S
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-DTGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|340197.3.peg.2942
Escherichia coli F11 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TIV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPAVSPGTL
L
NQQT
A
I
C
fig|340197.5.peg.3074
Escherichia coli F11 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TIV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPAVSPGTL
L
NQQT
A
I
C
fig|749550.3.peg.183
Escherichia coli MS 200-1 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TIV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPAVSPGTL
L
NQQT
A
I
C
fig|753642.3.peg.1648
Escherichia coli NC101 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
KI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPVVSPGTL
L
NQQT
A
I
C
fig|656444.3.peg.2305
Escherichia coli TA280 (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IAGDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
I
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
N
GG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNS
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|431946.3.peg.1466
Escherichia coli SE15 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GH-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPLGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPAVSPGTL
L
NQQT
A
I
C
fig|655817.3.peg.1822
Escherichia coli ABU 83972 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSTPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FHNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
V
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|199310.1.peg.1873
Escherichia coli CFT073 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSTPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FHNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
V
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|199310.4.peg.1801
Escherichia coli CFT073 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSTPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FHNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
V
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|749528.3.peg.3903
Escherichia coli MS 45-1 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSTPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FHNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
V
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|656408.3.peg.3568
Escherichia coli H591 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAK
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|595495.4.peg.772
Escherichia coli KO11 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAK
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|656443.3.peg.3877
Escherichia coli TA271 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAK
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|566546.4.peg.3380
Escherichia coli W (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAK
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|585055.6.peg.3599
Escherichia coli 55989 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|585055.8.peg.3602
Escherichia coli 55989 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749545.3.peg.3380
Escherichia coli MS 182-1 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
Q
A
T
E
LS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|749532.3.peg.1639
Escherichia coli MS 78-1 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
Q
A
T
E
LS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NRHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|585034.4.peg.3231
Escherichia coli IAI1 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFGTPDSEPTTS
V
LQGT
A
Q
C
fig|585034.5.peg.3229
Escherichia coli IAI1 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFGTPDSEPTTS
V
LQGT
A
Q
C
fig|358709.5.peg.3523
Escherichia coli 101-1 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GF
T
P
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
H
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|344610.3.peg.2823
Escherichia coli 53638 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
L
G
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|344610.7.peg.3091
Escherichia coli 53638 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
L
G
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWDYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|679205.4.peg.3340
Escherichia coli MS 124-1 (11-882/883)
Y
R
M
H
QVLIMPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAE
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSATNG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|749533.3.peg.1872
Escherichia coli MS 84-1 (11-882/883)
Y
R
M
H
QVLIMPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAE
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSATNG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|749532.3.peg.1015
Escherichia coli MS 78-1 (11-882/883)
Y
R
M
H
QVLILPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
EL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-NSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|656408.3.peg.1665
Escherichia coli H591 (11-882/883)
Y
R
M
H
QVLIMPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAE
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYYSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|679206.4.peg.3260
Escherichia coli MS 119-7 (11-882/883)
Y
R
M
H
QVLIMPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAE
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYYSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|331112.3.peg.3117
Escherichia coli HS (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNCYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|331112.6.peg.3255
Escherichia coli HS (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNCYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QTSSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FD
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
V
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|216592.1.peg.2051
Escherichia coli 042 (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TEL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNST
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAEN
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|344601.3.peg.2071
Escherichia coli B171 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALYLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|344601.5.peg.2163
Escherichia coli B171 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALYLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|340185.3.peg.511
Escherichia coli E22 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALYLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|340185.4.peg.553
Escherichia coli E22 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALYLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|585395.4.peg.4069
Escherichia coli O103:H2 str. 12009 (1-837/838)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKK
I
TF-
---
--TANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
EKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALYLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|869729.3.peg.2018
Escherichia coli UM146 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAH
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
G
N
L
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSTPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
FV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FHNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVDYKLPVVSPGTL
L
NQQT
A
I
C
fig|6666666.5357.peg.170
Escherichia coli TY-2482 (1-836/837)
MPQR
HHQG
H
K
R
TPKQLALIIKRCLPM
---
VLTGSGMLCTTANAEEYY
F
D
PIM
L
ETTKSGM
-
QT
T
DLS
RFSKKY
-
AQL
PG
T
Y
Q
VDI
WL
N
KKKVS
--
QKN
Y
IY-
---
---ANAEQL
-
LQP
Q
F
T
VEQ
L
REL
G
I
KVDEI
-------
PALAEK
--
DDDS
V
I
N
SLEQIIPGTAAEF
D
FNHQR
L
N
L
S
I
-
PQ
I
A
L
YRDAR
GY
V
S
P
SR
WD
-----
D
GI
PTLFT
NY
S
FTGSDNRYRQ-
--
--
-
---------------
GNRSQRQYLNMQ
N
G
A
N
F
G
P
WRLRN
YSTWT------RND
--------
-QASSWNTISSYLQ
R
DIKA
L
KSQ
L
L
LG
E
SA
T
S
G
S
IF
S
S
YTFT
G
VQ
L
A
SD
DN
MLP
NSQR
GFAP
T
V
R
GIA
NS
S
-
A
I
VTI
R
QNG
YV
IY
QSN
VP
A
G
A
F
E
I
N
DL
YPSSNS
GDL
E
V
T
I
E
E
S
DG
TQRR
F
IQ
P
Y
SS
L
P
M
M
Q
R
P
G
HL
K
Y
SA
T
A
G
RYRADANSDSKEPE
F
AEATAIY
G
LNNTF
T
L
YGG
L
-
LGSED
Y
Y
A
LGI
G
I
G
GTLGAL
GA
L
S
M
D
I
N
R
A
D
T
Q
F
D
-----
NQHSF
-
H
G
Y
Q
W
R
TQ
Y
I
K
DIP
-
ETN
T
NIA
V
SY
YRY
T
NDG
Y
F
S
FN
E
ANTR-
-
-
--
--------------------NWNYNSRQ
K
SEIQF
N
IS
Q
T
I
FD-GV
S
LYA
S
GSQ
Q
D
YW
GN
N
DKNR
-
NISV
G
VS
---
GQQWG
I
GY
S
L
N
YQYSRYTD
-----
Q-NN
D
RALSLN
L
SIP
LERW-
--
-------
-
LPRSRVSYQMTSQKDRPTQHEMRLD
G
SLLDDGR
-
L
SY
S
L
EQ
S
LDDDNNHNSSLNA----S
Y
RSPY
G
TFSAG
YS
YG
--
ND
-
-SSQYNYGV
T
GG
VVIHPH
G
V
TL
--
SQYLG
N
AFA
LI
D
A
N
-
G
ASGVR
I
Q
-
NYPGIA
TD
PF
G
YA
V
V
P
YL
T
T
Y
Q
E
N
R
L
S
V
D
TTQLPDNVD
L
EQ
T
TQFVVPNR
GA
M
VAAR
F
N
A
NI
G
YRVLVTVSDRN
G
KPL
PFG
ALASND
---
-ETGQQ
S
IV
D
E
G
G
IL
YL
S
G
ISSKSQ
S
WT
V
R
WG
NQADQQ
C
QFAFSTPDSEPTTS
V
LQGT
A
Q
C
fig|340184.3.peg.52
Escherichia coli B7A (11-882/883)
Y
R
M
H
QVLILPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
EL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-NSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLTFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|340186.3.peg.136
Escherichia coli E110019 (11-882/883)
Y
R
M
H
QVLIMPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAE
-
LIP
C
L
S
TDL
L
VSL
G
I
KKIAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYYSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|749549.3.peg.244
Escherichia coli MS 198-1 (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFIT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
I
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-KNKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|656437.3.peg.1728
Escherichia coli TA143 (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFIT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
I
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-KNKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|340185.3.peg.2629
Escherichia coli E22 (11-882/883)
Y
R
M
H
QVLILPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
EL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-NSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.375.peg.3835
Escherichia coli EC4100B (11-882/883)
Y
R
M
H
QVLILPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
EL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-NSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|679204.3.peg.3869
Escherichia coli MS 145-7 (11-882/883)
Y
R
M
H
QVLILPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
EL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-NSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|595495.4.peg.3471
Escherichia coli KO11 (11-882/883)
Y
R
M
H
QVLIMPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAE
-
LIP
C
L
S
TDL
L
VSL
G
I
KKIAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYYSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
NE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGQ
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|566546.3.peg.1193
Escherichia coli W (11-882/883)
Y
R
M
H
QVLIMPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAE
-
LIP
C
L
S
TDL
L
VSL
G
I
KKIAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYYSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
NE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGQ
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|405955.9.peg.1320
Escherichia coli APEC O1 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FDG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
G
N
L
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSTPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FHNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVDYKLPVVSPGTL
L
NQQT
A
I
C
fig|656443.3.peg.1855
Escherichia coli TA271 (11-882/883)
Y
R
M
H
QVLIMPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAE
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASTEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYYSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFA
Q
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTNEQTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RN
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPEVSPGTL
L
NQQT
A
I
C
fig|562.376.peg.3005
Escherichia coli WV_060327 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDSKEHS
AEKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSTPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
A
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
T
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPVVSPGTL
L
NQQT
A
I
C
fig|656393.3.peg.2254
Escherichia coli H299 (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
AEKHVS
--
DNSA
C
T
-
PLRDRLADASSEF
N
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DGTRH
-
S
G
Q
S
I
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDSNEKTQFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFSD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNIG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVEYKLPEVSPGTL
L
NQQT
A
I
C
fig|216593.1.peg.312
Escherichia coli E2348/69 (11-882/883)
Y
R
M
H
QVLLLPRFARLTIAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMT
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHAP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAR
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QIT
VPPGPF
T
I
D
D
I
NSAANG
GDL
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
C
Y
AL
A
M
G
EYRS-GNNLQSSPK
F
IQGSLMH
G
LEGNW
T
P
YGG
M
-
QIAEN
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
L
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FRNSNASYNMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTHGGNTSSGTSGYSSLN
Y
RGAY
G
NTNIG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
NDKNSN
C
IVDYKLPAVSPGTL
L
NQQT
A
I
C
fig|714962.3.peg.1691
Escherichia coli IHE3034 (11-882/883)
Y
R
M
H
QVLILPRFARLTFAL
---
GLA---T-AVFPVDAEYY
FN
PRF
L
SNDLAES
---
V
DLS
AFTKGR
-
EAP
PG
T
Y
R
VDI
YL
N
DEFMA
--
SRD
I
TF-
---
-IADDNNAD
-
LIP
C
L
S
TDL
L
VSL
G
I
KKSAL
LDNKEHS
ADKHVP
--
DNSA
C
T
-
PLQDRLADASSEF
D
VGQQH
L
S
L
S
V
-
PQ
I
Y
V
GRMAH
GY
V
S
P
DL
W
E
-----
E
GI
NAGLL
NY
S
FNG-NSINNRS
N
H
NA
G
---------------
K--SNYAYLNLQ
SG
I
N
I
G
S
WRLR
D
NSTWSYNSGSSNSS
--------
-DSNKWQHINTSAE
R
DIIP
L
RSR
L
T
V
GD
SY
T
D
GDIFDS
VNFR
G
LK
I
N
S
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
-
A
Q
V
SV
K
QNG
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NSAANG
G
N
L
Q
V
T
I
K
E
A
DG
SIQT
L
YV
P
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
G
EYRS-GNNLQSTPK
F
VQASLMH
G
LKGNW
T
P
YGG
M
-
QIAED
Y
Q
A
FNL
G
I
G
KDLGLF
GA
F
S
F
D
I
T
Q
A
N
T
T
L
A
-----
DDTRH
-
S
G
Q
S
V
K
SV
Y
S
K
SFY
-
QTG
T
NIQ
V
AG
YRYS
TQG
F
Y
N
LS
D
SAYSR
M
S
GY
TVKPP-TGDTSEQTLFIDYFNLFYSKR-
-
GQEQI
S
IS
Q
Q
L
GN-YG
T
TFF
S
ASR
Q
S
YW
NT
S
RSDQ
-
QISF
G
LN
---
VPFGD
I
TT
S
L
N
YSYSNNIW
-----
QNDR
D
HLLAFT
L
N
V
P
FSHWM
--
RTDSQSA
-
FHNSNASYSMSNDLKGGMTNLSGVY
G
TLLPDNN
-
L
N
Y
S
V
QV
G
NTQGGNTSSGTSGYSSLN
Y
RGAY
G
NTNVG
YS
RS
--
GD
-
-SSQIYYGM
SGG
IIAHAD
G
I
T
F
--
GQPLG
D
TMV
L
V
K
A
P
-
G
ADNVK
I
E
-
NQTGIH
TD
WR
G
YA
I
L
P
FA
T
E
YR
E
N
R
V
A
L
N
ANSLADNVE
L
DE
T
VVTVIPTH
GAI
ARAT
F
N
A
QI
G
GKVLMTL-KYG
N
KSV
PFG
AIVTHG
---
-ENKNG
S
IV
A
E
N
G
QV
YL
T
G
LPQSGK
-
LQ
V
S
WG
KDKNSN
C
IVDYKLPVVSPGTL
L
NQQT
A
I
C
fig|216593.1.peg.3415
Escherichia coli E2348/69 (1-811/825)
MSLPMHRTFVLT
---
GIIFALSAVYSLSYARDE
FN
LRI
L
ELD-SPL
EN
T
Q
V
L
A
DFINNN
-
NLT
PG
V
Y
L
TS
V
MW
G
QDSLD
--
KRN
I
TF-
---
--VLSSDKK
S
LIP
R
F
T
KAD
L
REF
GL
KVDVI
-------
PALKVM
--
NDDT
E
V
G
DIAQIIDGARYDF
Q
LDSQT
L
W
L
R
I
-
PQ
I
Y
Q
NAIAA
G
S
I
A
P
KY
W
N
-----
D
G
E
SAAWL
S
Y
Y
ASGSRQNSDGD
N
L
SS
N
---------------
-------WLNLN
SG
I
N
L
G
A
WRLRN
NTVYN---------
--------
--ESNWESISTSLQ
R
DIKA
L
RSQ
M
E
I
G
Q
TF
T
N
GD
L
FDS
VQMT
G
IK
L
E
T
D
TS
MLP
DSEQ
GFAP
V
V
R
V
IA
NS
D
-
A
Q
V
V
I
K
QNG
YV
IY
QTW
V
SA
GPF
E
I
K
DL
SQVTAG
S
DL
E
V
T
I
K
E
T
N
G
QEHS
F
IQ
A
S
S
T
V
P
I
L
Q
R
E
G
AL
K
Y
SL
A
A
G
KYRD-SDNNAETPV
F
GVATAIY
G
LPYGI
T
I
YGG
I
-
LGASM
Y
H
S
GVT
G
I
G
ADLGRL
G
S
V
S
V
D
I
T
A
A
K
T
K
F
D
-----
DGRDD
A
T
G
L
S
W
R
AQ
Y
A
K
DFP
-
DTD
T
TVT
L
AS
YRYS
TSQ
F
Y
T
FQ
E
AL---
-
-
--
--------DQRDTPDDKGIYSYRQTNNR
R
NRLQI
N
LS
Q
N
I
GR-WG
S
VYL
N
GYQ
Q
D
YW
GM
H
GAER
-
SIGM
G
YS
---
TTWNN
I
NW
S
V
N
YTLTKTPG
-----
M-TG
E
QQFSLT
L
N
IP
LSRW-
--
-------
-
LPDSWAMYNVNRSDKSNTSHQLGIG
G
TALQDNN
-
L
SY
N
L
QQ
S
YTDNNVGYGASMN---GR
Y
RSSV
G
EFGLG
YS
YD
--
KN
-
-SRQWNYSA
Q
G
A
VVAHAH
G
V
TL
--
GQSVQ
D
SFA
IV
H
I
N
-
E
GANVK
V
Q
-
NAQGVY
TD
FW
G
NA
I
V
P
NM
T
N
YR
H
N
A
I
T
V
N
TQG-HDSLD
I
SD
A
TQDVIPSK
GA
V
VGVD
F
D
A
RS
G
MRALLTL-VHN
K
ERV
PFG
ALLTLG
---
---NST
A
IV
G
E
D
G
EV
Y
I
T
G
VQESMT
-
FT
V
Q
WG
KEINQQ
C
TGVITEPE
fig|574521.7.peg.162
Escherichia coli O127:H6 str. E2348/69 (1-811/825)
MSLPMHRTFVLT
---
GIIFALSAVYSLSYARDE
FN
LRI
L
ELD-SPL
EN
T
Q
V
L
A
DFINNN
-
NLT
PG
V
Y
L
TS
V
MW
G
QDSLD
--
KRN
I
TF-
---
--VLSSDKK
S
LIP
R
F
T
KAD
L
REF
GL
KVDVI
-------
PALKVM
--
NDDT
E
V
G
DIAQIIDGARYDF
Q
LDSQT
L
W
L
R
I
-
PQ
I
Y
Q
NAIAA
G
S
I
A
P
KY
W
N
-----
D
G
E
SAAWL
S
Y
Y
ASGSRQNSDGD
N
L
SS
N
---------------
-------WLNLN
SG
I
N
L
G
A
WRLRN
NTVYN---------
--------
--ESNWESISTSLQ
R
DIKA
L
RSQ
M
E
I
G
Q
TF
T
N
GD
L
FDS
VQMT
G
IK
L
E
T
D
TS
MLP
DSEQ
GFAP
V
V
R
V
IA
NS
D
-
A
Q
V
V
I
K
QNG
YV
IY
QTW
V
SA
GPF
E
I
K
DL
SQVTAG
S
DL
E
V
T
I
K
E
T
N
G
QEHS
F
IQ
A
S
S
T
V
P
I
L
Q
R
E
G
AL
K
Y
SL
A
A
G
KYRD-SDNNAETPV
F
GVATAIY
G
LPYGI
T
I
YGG
I
-
LGASM
Y
H
S
GVT
G
I
G
ADLGRL
G
S
V
S
V
D
I
T
A
A
K
T
K
F
D
-----
DGRDD
A
T
G
L
S
W
R
AQ
Y
A
K
DFP
-
DTD
T
TVT
L
AS
YRYS
TSQ
F
Y
T
FQ
E
AL---
-
-
--
--------DQRDTPDDKGIYSYRQTNNR
R
NRLQI
N
LS
Q
N
I
GR-WG
S
VYL
N
GYQ
Q
D
YW
GM
H
GAER
-
SIGM
G
YS
---
TTWNN
I
NW
S
V
N
YTLTKTPG
-----
M-TG
E
QQFSLT
L
N
IP
LSRW-
--
-------
-
LPDSWAMYNVNRSDKSNTSHQLGIG
G
TALQDNN
-
L
SY
N
L
QQ
S
YTDNNVGYGASMN---GR
Y
RSSV
G
EFGLG
YS
YD
--
KN
-
-SRQWNYSA
Q
G
A
VVAHAH
G
V
TL
--
GQSVQ
D
SFA
IV
H
I
N
-
E
GANVK
V
Q
-
NAQGVY
TD
FW
G
NA
I
V
P
NM
T
N
YR
H
N
A
I
T
V
N
TQG-HDSLD
I
SD
A
TQDVIPSK
GA
V
VGVD
F
D
A
RS
G
MRALLTL-VHN
K
ERV
PFG
ALLTLG
---
---NST
A
IV
G
E
D
G
EV
Y
I
T
G
VQESMT
-
FT
V
Q
WG
KEINQQ
C
TGVITEPE
fig|656417.3.peg.249
Escherichia coli M605 (1-811/825)
MLLPLHRTFVLT
---
GITFALSAVYSLSYARDE
FN
LRI
L
ELD-SPL
EN
T
Q
V
L
E
DFVNNN
-
NLT
PG
V
Y
L
TS
V
MW
G
QEYLD
--
KRN
I
TF-
---
--ILSSDKK
R
LIP
R
F
T
KAD
L
REF
GL
KVDDI
-------
PALQVM
--
DDDT
E
F
G
DIAQIIDGARYDF
Q
LDSQT
L
C
L
R
I
-
PQ
I
Y
Q
NARAA
G
S
I
S
P
KY
W
S
-----
D
G
E
SAVWL
S
Y
Y
ASGSRQNSDGD
N
L
NS
N
---------------
-------WLNLN
SG
I
N
L
G
V
WRLRN
NTVYS---------
--------
--DSSWESISTSLQ
R
DIKA
L
RSQ
M
E
V
G
Q
TF
T
N
GD
L
FDS
VQMT
G
IK
L
E
T
D
TS
MLP
DSEQ
GFAP
V
V
R
GIA
NS
D
-
A
Q
V
V
I
K
QNG
YV
IY
QTW
V
SA
GPF
E
I
K
DL
SQVTAG
A
DL
E
V
T
I
K
E
T
N
G
QEHS
F
IQ
A
S
S
T
V
P
I
L
Q
R
E
G
AL
K
Y
SL
A
T
G
KYRD-NDNHAETPV
F
GVATAIY
G
LPYGI
T
I
YGG
I
-
LGASI
Y
H
S
GVT
G
I
G
ADLGRL
G
S
V
S
V
D
I
T
A
A
E
T
K
F
D
-----
DGRDD
A
T
G
L
S
W
R
AQ
Y
A
K
DFP
-
DTD
T
TVT
L
AS
YRYS
TSQ
F
Y
T
FQ
E
AL---
-
-
--
--------DQRDTPDDKGIYSYRQTNNR
R
NRLQI
N
LS
Q
N
I
GR-WG
S
VYL
N
GYQ
Q
D
YW
GM
H
GAER
-
SIGM
G
YS
---
TTWSN
I
NW
S
V
N
YTLTKTPG
-----
M-AG
E
QQFSLT
L
N
IP
LSRW-
--
-------
-
LPDSWAMYNVNRSDKSNTSHQLGIG
G
TALQDNN
-
L
SY
N
L
QQ
S
YTDNNVGYGASIN---GR
Y
RSSV
G
EFGLG
YS
YD
--
KN
-
-SRQWNYSA
Q
G
A
VVAHAH
G
V
TL
--
GQSVQ
D
SFA
IV
H
I
N
-
E
GANVK
V
Q
-
NAQGVY
TD
YW
G
NA
I
V
P
NM
T
N
YR
H
N
A
I
T
V
N
TQG-HDSLD
I
SD
A
TQDVIPSK
GA
V
VGVD
F
D
A
RS
G
IRALLTL-VHN
K
ERV
PFG
ALLTLG
---
---NST
A
IV
G
E
D
G
EV
Y
I
T
G
VQESMT
-
FT
V
Q
WG
KEINQQ
C
TGVVTVPE
fig|431946.3.peg.168
Escherichia coli SE15 (1-811/825)
MLLPLHRTFVLT
---
GITFALSAVYSLSYARDE
FN
LRI
L
ELD-SPL
EN
T
Q
V
L
E
DFVNNN
-
NLT
PG
V
Y
L
TS
V
MW
G
QEYLD
--
KRN
I
TF-
---
--ILSSDKK
R
LIP
R
F
T
KAD
L
REF
GL
KVDDI
-------
PALQVM
--
DDDT
E
F
G
DIAQIIDGARYDF
Q
LDSQT
L
C
L
R
I
-
PQ
I
Y
Q
NARAA
G
S
I
S
P
KY
W
S
-----
D
G
E
SAVWL
S
Y
Y
ASGSRQNSDGD
N
L
NS
N
---------------
-------WLNLN
SG
I
N
L
G
V
WRLRN
NTVYS---------
--------
--DSSWESISTSLQ
R
DIKA
L
RSQ
M
E
V
G
Q
TF
T
N
GD
L
FDS
VQMT
G
IK
L
E
T
D
TS
MLP
DSEQ
GFAP
V
V
R
GIA
NS
D
-
A
Q
V
V
I
K
QNG
YV
IY
QTW
V
SA
GPF
E
I
K
DL
SQVTAG
A
DL
E
V
T
I
K
E
T
N
G
QEHS
F
IQ
A
S
S
T
V
P
I
L
Q
R
E
G
AL
K
Y
SL
A
T
G
KYRD-NDNHAETPV
F
GVATAIY
G
LPYGI
T
I
YGG
I
-
LGASI
Y
H
S
GVT
G
I
G
ADLGRL
G
S
V
S
V
D
I
T
A
A
E
T
K
F
D
-----
DGRDD
A
T
G
L
S
W
R
AQ
Y
A
K
DFP
-
DTD
T
TVT
L
AS
YRYS
TSQ
F
Y
T
FQ
E
AL---
-
-
--
--------DQRDTPDDKGIYSYRQTNNR
R
NRLQI
N
LS
Q
N
I
GR-WG
S
VYL
N
GYQ
Q
D
YW
GM
H
GAER
-
SIGM
G
YS
---
TTWSN
I
NW
S
V
N
YTLTKTPG
-----
M-AG
E
QQFSLT
L
N
IP
LSRW-
--
-------
-
LPDSWAMYNVNRSDKSNTSHQLGIG
G
TALQDNN
-
L
SY
N
L
QQ
S
YTDNNVGYDASMN---GR
Y
RSSV
G
EFGLG
YS
YD
--
KN
-
-SRQWNYSA
Q
G
A
VVAHAH
G
V
TL
--
GQSVQ
D
SFA
IV
H
I
N
-
E
GANVK
V
Q
-
NAQGVY
TD
YW
G
NA
I
V
P
NM
T
N
YR
H
N
A
I
T
V
N
TQG-HDSLD
I
SD
A
TQDVIPSK
GA
V
VGVD
F
D
A
RS
G
IRALLTL-VHN
K
ERV
PFG
ALLTLG
---
---NST
A
IV
G
E
D
G
EV
Y
I
T
G
VQESMT
-
FT
V
Q
WG
KEINQQ
C
TGVVTVPE
fig|409438.11.peg.16
Escherichia coli SE11 (3-831/835)
YSKLFLSV
GLA
LVTLSGW------GRTYT
F
D
PSL
V
ESSGGDS
---
V
D
V
S
LFNQG-
-
LQL
PG
E
Y
F
V
S
I
FV
N
GEKVG
--
SDN
I
NF-
---
RIENHNGED
T
LSP
C
L
N
ADQ
L
TKY
G
I
DIHKY
-------
SDLF--
N
A
GPEQ
C
A
-
NL-WAIPQADIQF
D
FNQQK
L
S
L
L
L
-
P
TQ
A
L
LPKLN
G
I
A
P
E
QL
WD
-----
D
GI
PALFM
NY
Q
TNMQQREYQGA
Y
-
--
-
---------------
KSHDESYYAQLQ
P
G
L
N
I
G
P
WR
F
R
S
AASWQ-----KEQG
--------
-----WQRSYIYAE
R
GLNT
I
KGR
L
T
LG
E
SY
S
D
G
S
IFDS
IPFT
G
GK
L
A
SD
ET
MLP
YDQW
S
F
S
P
V
I
R
G
V
A
RT
Q
-
A
R
V
E
V
Q
QNG
YT
V
S
NDL
I
P
S
GPF
E
L
T
N
L
PLGGGS
GDL
K
V
I
V
H
E
S
DG
TQQV
F
TV
P
Y
D
T
P
A
V
A
L
R
Q
G
YF
E
Y
SV
M
G
G
EYRP-ANDAVQTTP
V
GALEMKY
G
LPWNL
T
L
YGG
L
-
QGAGN
Y
Q
A
AAL
G
I
G
SLLGDF
GA
L
S
A
D
V
V
Q
S
N
S
K
K
D
-----
NQQKE
-
S
G
Q
R
W
R
VR
Y
N
K
SLD
-
-SG
T
SVN
I
AS
EE
Y
A
TEG
F
N
T
LS
D
TL---
-
-
--
--------NTYCKPDAGNICYSDYKKP-
K
NKVNL
S
IS
Q
T
T
DG-WG
T
FNF
N
GYR
Q
N
YW
ND
K
STTT
-
SFTA
G
YS
---
RMFDS
G
-I
S
L
N
VNLSKTQN
IDKNG
KKTN
D
RLTSLW
L
S
F
P
LSRWL
SN
S------
-
S--VNANYQMTSDTRGDSMHEFGVY
G
DAF-NRQ
-
L
H
W
D
L
RE
R
YRDNASDNKASSA-LSLN
Y
RGTY
G
ELRGN
YS
YD
--
KK
-
-QRQLGIGI
N
G
N
IVATQY
G
I
T
A
--
GQSSG
D
TMA
L
V
Q
A
P
-
G
VDGAS
V
G
-
YWPGMK
TD
FR
G
YT
S
Y
G
YL
T
P
YR
E
N
N
I
D
I
N
PVTLPKNAE
I
SQ
T
STRVVPTK
GA
V
VLAK
F
D
T
RI
G
GRLLLQLKRSD
N
KPV
PFG
SV---A
TVE
GQASSS
GIV
G
D
N
S
QV
YL
T
G
VPKEAT
-
VK
I
Q
WG
KDKTQS
C
HARVLLPEDVNTTG
I
YNLT
A
V
C
fig|749546.3.peg.1803
Escherichia coli MS 185-1 (8-849/850)
YELSALYIAV
---
LSSLPFFFCADVAARSYT
F
E
PSM
L
NVDGND-
---
I
DLS
IFESG-
-
AQL
PG
T
Y
Y
VDI
ML
N
GKLVD
--
TKE
M
EF-
--
S
RERNKDGEF
V
LSS
C
L
T
QSM
L
NRY
G
V
KVGDY
-------
PELFVN
SS
NGKV
C
G
-
DL-SVIPGAFSYF
D
FYNQQ
L
N
L
S
I
-
P
NV
A
L
YPKYK
G
I
A
S
E
EL
WD
-----
N
GI
NAFLM
NY
Q
ANAQINQYRNK
K
-
--
-
---------------
NREVSSYWARIE
P
G
M
N
I
G
S
WR
I
RN
LTTFT-----KENG
--------
-NSEKRESVYTYAE
R
GLTS
I
KSN
L
L
I
G
E
SY
T
N
S
DIFDS
ISFR
G
IM
L
H
SD
ES
M
V
P
YSKY
A
FAP
V
I
R
GIA
QS
Q
-
A
L
I
E
V
R
QNG
YL
I
H
TVS
V
A
PG
A
F
E
I
S
DL
PVTGSG
GDL
Q
V
S
V
I
E
T
N
G
KNQS
F
TV
P
Y
TT
P
V
I
A
L
R
E
N
YL
K
Y
SL
V
G
G
MYRS-AYSGVDNTA
L
IQMTAMY
G
MPWNL
T
T
F
V
G
F
-
QGSEH
Y
N
S
VAT
G
V
G
LSMGDM
GA
I
S
L
D
G
I
Y
A
R
G
Q
K
E
-----
KQNKE
-
D
G
Y
S
W
R
VR
Y
S
K
VFD
-
ITG
T
NFI
A
AS
HQ
YS
SDG
Y
Q
T
LS
D
VL---
-
-
--
--------DTY---GHSNYSYGGYANR-
S
MRNSL
T
IS
Q
S
M
GE-WG
T
FSF
G
GVR
D
E
Y
R
GN
R
SPQN
-
SINA
L
YS
---
NSMEW
G
TL
S
L
N
WSQNKITD
SSRSV
KDKK
E
NIFSFW
VSIP
LYRLL
GN
T------
-
SNNINATTQIQKYDNQKMQYEFGMN
G
RAF-NRQ
-
L
Y
W
D
I
SQ
R
LAPGNENYNDASR-LNLE
W
YGTY
G
QIRGG
Y
G
YS
--
DS
-
-LRQMNAGI
SG
T
AIVHSN
G
V
T
F
--
GQKQG
G
TIA
L
V
E
A
Q
-
G
VDGAE
V
I
-
GWPGVK
TD
FR
G
YT
A
L
G
HL
T
P
Y
Q
E
N
T
V
S
L
N
PASFPEYAE
V
LQ
T
DTKVIPTK
GA
V
VSAR
F
K
T
SI
G
KKALFKLTRHD
G
KKV
PFG
AVVSSA
TAD
DNKRVV
GIV
N
E
S
G
EV
Y
M
S
G
LSEKGQ
-
LD
V
K
W
-
-NSHGS
C
KAVYKLSDNKSIVN
I
YNAS
L
T
C
fig|340197.3.peg.849
Escherichia coli F11 (38-854/863)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
A
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSARINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYSSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
L
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFGA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
V
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|340197.5.peg.891
Escherichia coli F11 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
A
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSARINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYSSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
L
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFGA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
V
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|749550.3.peg.1517
Escherichia coli MS 200-1 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
A
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSARINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYSSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
L
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFGA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
V
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|869729.3.peg.4667
Escherichia coli UM146 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
A
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSARINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYSSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
L
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFGA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
V
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|364106.7.peg.4774
Escherichia coli UTI89 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
A
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSARINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYSSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
L
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFGA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
V
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|364106.8.peg.4773
Escherichia coli UTI89 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
A
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSARINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYSSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
L
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFGA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
V
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|362663.8.peg.3815
Escherichia coli 536 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
A
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSASINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
SQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYSSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
I
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGES-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
L
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFGA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAA
I
N
-
NASGSR
L
D
FW
G
NG
I
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|362663.9.peg.3829
Escherichia coli 536 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
A
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSASINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
SQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYSSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
I
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGES-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
L
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFGA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAA
I
N
-
NASGSR
L
D
FW
G
NG
I
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|525281.3.peg.1568
Escherichia coli 83972 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
V
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSASINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYNSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
I
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFSA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
I
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|655817.3.peg.5064
Escherichia coli ABU 83972 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
V
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSASINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYNSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
I
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFSA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
I
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|749546.3.peg.3032
Escherichia coli MS 185-1 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
V
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSASINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYNSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
I
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFSA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
I
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|749528.3.peg.1477
Escherichia coli MS 45-1 (19-835/844)
FAAL
---
GLT---VTNHSFAAEEAE
F
D
SEF
L
HLDKGIN
--
V
I
D
I
R
RFSHGN
-
PVP
E
G
R
Y
Y
S
DI
YV
N
NVWKG
--
KAD
L
QY-
---
--LRTANTG
A
PTL
C
L
T
PEL
L
SLI
D
L
VKDTM
-------
------
-
S
GNTS
C
F
-
PASTGLSSASINF
D
LSTLR
L
N
I
E
I
-
PQA
L
L
NTRPR
GY
I
S
P
AQ
W
Q
-----
S
G
V
PAAFI
NY
D
ANYYQYNSS--
--
--
-
---------------
GTSNEQTYLGLK
A
G
F
N
L
W
G
W
A
LR
H
RGSES-----WNNS
--------
-YPAGYQNIETSIM
H
DLAP
L
RAQ
F
T
LGD
FY
T
N
G
EL
M
DS
LSLR
G
VR
L
A
SD
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
N
-
A
K
VTI
Y
QN
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YPSGYA
GDL
I
V
K
I
T
E
S
N
G
QTRM
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
KYRY--ANKTYNDL
I
AQGTYQY
G
LTNDI
T
L
NS
G
L
-
TTASG
Y
T
A
GLA
G
L
A
FNT-PL
GA
I
A
S
D
I
T
L
S
R
T
A
F
R
-----
YSGVT
R
K
G
Y
S
L
H
SS
Y
S
I
NIP
-
ASN
T
NIT
L
AA
YRYS
SKD
F
Y
H
LK
D
ALSAN
H
N
AF
IDDVSVKS------------TAFYRPR-
-
NQFQI
S
IN
Q
E
L
GEKWG
G
MYL
T
GTT
Y
N
YW
GH
K
GSRN
-
EYQM
G
YS
---
NFWKQ
L
GY
Q
I
G
LSQSRDNE
-----
QQRR
D
DRFYIN
F
TL
P
LGGS-
--
-------
-
VQSPVFSTVLNYSKEEKNSIQTSIS
G
TGGEDNQ
-
F
SY
G
I
SG
N
SQENGPSGYAMNG----G
Y
RSPY
V
NITTT
VG
HD
--
TQ
-
NNNQRSFSA
SG
A
VVAHPY
G
V
TL
--
SNDLS
D
TFA
I
I
H
A
E
-
G
AQGAV
I
N
-
NASGSR
L
D
FW
G
NG
I
V
P
YV
T
P
Y
E
K
N
Q
I
S
I
D
PSNLDLNVE
L
SA
T
EQEIIPRA
N
S
A
TLVK
F
D
T
KT
G
RSLLFDIRMST
G
NPP
P
MA
SEVLDE
---
-HGQLA
G
Y
V
A
Q
A
G
KV
F
T
R
G
LPEKGH
-
LS
V
V
WG
PDNKDR
C
SFVYHVAHNKDDMQ
S
QLVP
V
L
C
fig|585397.7.peg.2478
Escherichia coli ED1a (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
V
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
E
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRHDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|585397.9.peg.2476
Escherichia coli ED1a (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
V
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
E
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRHDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|405955.13.peg.2323
Escherichia coli APEC O1 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQVGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|405955.9.peg.1894
Escherichia coli APEC O1 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQVGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|714962.3.peg.2382
Escherichia coli IHE3034 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQVGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|753642.3.peg.2987
Escherichia coli NC101 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQVGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|585035.6.peg.2231
Escherichia coli S88 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQVGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|869729.3.peg.1329
Escherichia coli UM146 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQVGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|364106.7.peg.2409
Escherichia coli UTI89 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQVGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|364106.8.peg.2412
Escherichia coli UTI89 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQVGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|562.376.peg.4023
Escherichia coli WV_060327 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
VA
NIRLED
N
QPL
PG
P
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EDLES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
I
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGVN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|362663.8.peg.2160
Escherichia coli 536 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGGWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSKSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|362663.9.peg.2165
Escherichia coli 536 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGGWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSKSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|340197.3.peg.167
Escherichia coli F11 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGGWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|340197.5.peg.172
Escherichia coli F11 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGGWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|749550.3.peg.1555
Escherichia coli MS 200-1 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGGWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|656440.3.peg.2089
Escherichia coli TA206 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
V
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|685038.3.peg.2163
Escherichia coli O83:H1 str. NRG 857C (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
E
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGHQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRHDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|749546.3.peg.2884
Escherichia coli MS 185-1 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
P
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGAFR
L
D
F
S
V
-
PQA
W
V
EDLES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
I
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|749528.3.peg.1573
Escherichia coli MS 45-1 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
P
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EDLES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
I
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|525281.3.peg.3908
Escherichia coli 83972 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
P
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EDLES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
I
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSNK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|655817.3.peg.2549
Escherichia coli ABU 83972 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
P
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EDLES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
I
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSNK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|199310.1.peg.2567
Escherichia coli CFT073 (15-818/831)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
P
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EDLES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
I
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
X
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|199310.4.peg.2479
Escherichia coli CFT073 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
V
S
NIRLED
N
QPL
PG
P
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDSF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EDLES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
Q
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
I
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
X
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|656417.3.peg.2772
Escherichia coli M605 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
VA
NIRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RF
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|670897.3.peg.4574
Escherichia coli 2362-75 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
VA
NIRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGM
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
C
YS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
GYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQHQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
ST
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
H
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
LLVN
F
D
T
DQ
R
KPCFIKALRAD
G
QPL
T
FG
YEVNDL
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|216593.1.peg.2647
Escherichia coli E2348/69 (15-818/831)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
VA
NIRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGM
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
C
YS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
GYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQHQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
ST
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
H
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
LLVN
F
D
T
DQ
R
KPCFIKALRAD
G
QPL
T
FG
YEVNDL
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|574521.7.peg.2303
Escherichia coli O127:H6 str. E2348/69 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
VA
NIRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELES
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGM
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
C
YS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
GYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQHQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
ST
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
H
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
LLVN
F
D
T
DQ
R
KPCFIKALRAD
G
QPL
T
FG
YEVNDL
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|431946.3.peg.2077
Escherichia coli SE15 (10-813/826)
AIVAL
---
------LIGIEAYAAEET
F
D
THF
M
IGGMKD-
---
Q
Q
VA
NIRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
QET
C
L
S
REM
I
KRL
G
I
NTDNF
-------
------
-
A
SGKQ
C
L
-
TFKQLIQGGSYTW
D
IGVFR
L
D
F
S
V
-
PQA
W
V
EELEI
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
MSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTNN
--------
-NPGVWKSNTLYLE
R
GFAQ
L
LGT
L
R
V
GD
MY
T
S
S
DIFDS
VRFS
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
S
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RF
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FVS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
HVWA-
-
-
--
NNKDNYRRDENDIYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNLRR
I
SY
T
L
A
ASHAYDEN
-----
-HHE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGVASNNTGLS
G
TVGSRDQ
-
F
N
Y
G
V
NL
S
YQYQGNETTAGAN---LT
W
NAPV
A
TVNGS
YS
QS
--
SA
-
-YRQAGASV
SGG
IVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
N
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
RN
G
VV
V
Y
D
GM
T
P
YR
E
N
Y
L
M
LD
VSQSDSEAE
L
RG
N
RKIAAPYR
GA
V
VLVN
F
D
T
DQ
R
KPWFIKALRAD
G
QPL
T
FG
YEVNDI
---
-HGHNI
G
V
V
G
Q
G
S
QL
F
I
R
T
NEVPPS
-
VN
V
A
ID
KQQGLS
C
TITF
fig|656419.3.peg.991
Escherichia coli M718 (4-806/816)
YRLSVLS
---
CLAMVTPPALA-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
NFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
IQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TPLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITA
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|550676.3.peg.174
Escherichia coli B185 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TPLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
V
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|562.371.peg.4313
Escherichia coli 1044A (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|562.373.peg.4575
Escherichia coli 1125A (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|562.372.peg.5748
Escherichia coli 1212A (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|562.374.peg.1823
Escherichia coli 536A (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|83334.1.peg.2919
Escherichia coli O157:H7 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|155864.1.peg.2923
Escherichia coli O157:H7 EDL933 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|155864.8.peg.2806
Escherichia coli O157:H7 EDL933 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|444454.5.peg.1823
Escherichia coli O157:H7 str. EC4024 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|444449.5.peg.1298
Escherichia coli O157:H7 str. EC4042 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|444448.5.peg.60
Escherichia coli O157:H7 str. EC4045 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|444453.5.peg.5133
Escherichia coli O157:H7 str. EC4076 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|444452.5.peg.3101
Escherichia coli O157:H7 str. EC4113 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|444450.8.peg.3091
Escherichia coli O157:H7 str. EC4115 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|444451.5.peg.3476
Escherichia coli O157:H7 str. EC4196 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|444447.5.peg.192
Escherichia coli O157:H7 str. EC4206 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|478004.5.peg.3098
Escherichia coli O157:H7 str. EC4401 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|478005.5.peg.3911
Escherichia coli O157:H7 str. EC4486 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|478006.5.peg.2333
Escherichia coli O157:H7 str. EC4501 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|478007.5.peg.1743
Escherichia coli O157:H7 str. EC508 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|478008.5.peg.4185
Escherichia coli O157:H7 str. EC869 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|637388.3.peg.2120
Escherichia coli O157:H7 str. FRIK2000 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|570506.3.peg.3546
Escherichia coli O157:H7 str. FRIK966 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|386585.9.peg.3046
Escherichia coli O157:H7 str. Sakai (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|544404.4.peg.2953
Escherichia coli O157:H7 str. TW14359 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|502346.5.peg.4469
Escherichia coli O157:H7 str. TW14588 (10-813/826)
AIVAL
---
------LLGIEAHAAEET
F
D
THF
M
MGGMKG-
---
E
Q
V
T
NLRLDD
N
QPL
PG
Q
Y
D
I
DI
YV
N
KQWRG
--
KYE
I
IV-
---
------KDN
P
HET
C
L
T
REI
V
KRL
G
I
NSDNF
-------
------
-
A
RENQ
C
L
-
TFEQLVQGGSYSW
D
IGIFR
L
D
L
A
V
-
PQA
W
V
EELEN
GY
V
PP
EN
W
E
-----
R
GI
NAFYT
S
Y
Y
VSQYYSDYKAS
--
--
-
---------------
GNSK-STYVRFN
SG
L
N
L
L
G
W
Q
L
H
S
DASFS-----KTDN
--------
-NPGEWKSNTLYLE
H
GFSQ
I
LGT
L
R
I
GD
MY
T
S
A
DIFDS
VRFT
G
VR
L
F
R
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
-
A
L
VTI
E
QNG
FV
V
Y
QKE
VPPGPF
S
I
S
DL
QLAGGG
A
DL
D
V
S
V
K
E
A
DG
SVTT
Y
LV
P
Y
A
A
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
RSHIEGASKQSD--
F
VQAGYQY
G
FNNLL
T
L
YGG
T
-
MVANN
Y
Y
A
FTL
G
T
G
WNT-RI
GA
I
S
V
D
A
T
K
S
H
S
K
Q
D
-----
NGDVF
-
D
G
Q
S
Y
Q
IA
Y
N
K
FLS
-
QTS
T
RFG
L
AA
W
RYS
SRD
Y
R
T
FN
D
YVWA-
-
-
--
NNKDNYRRDKNDVYDIADYYQNDFGRK-
-
NSFSA
N
MS
Q
S
L
PEGWG
S
VSL
S
TLW
R
D
YW
GR
S
GSSK
-
DYQL
S
YS
---
NNWRR
I
SY
T
L
A
ASQAYDEN
-----
-HAE
E
KRFNIF
I
SIP
FD---
--
WGDDVTT
P
RRQIYMSNSTTFDDQGFASNNTGLS
G
TVGNRDQ
-
F
N
Y
G
I
NL
S
HQHQGNETTAGAN---LT
W
TAPA
A
TVNGS
YS
QS
--
ST
-
-YRQVGASV
SGG
LVAWSG
G
V
N
L
--
ANRLS
E
TFA
VM
H
A
P
-
G
IKDAY
V
N
-
GQKYRT
T
N
CN
G
VV
V
Y
D
GL
T
P
YR
E
N
H
L
M
M
D
VSQSDSETE
L
RG
N
RKMTAPYR
GA
V
VLVD
F
D
T
DQ
R
KPWFIKALRSD
G
QPL
T
FG
YEVNDM
---
-HGHNI
G
V
V
G
Q
G
S
QI
F
I
R
T
NEIPPA
-
VN
V
A
ID
KQQGLS
C
TITF
fig|550677.3.peg.400
Escherichia coli B354 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
HQWRG
--
KKD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NH
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKV
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLSTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LI
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|656379.3.peg.252
Escherichia coli FVEC1302 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
V
YV
N
HQWRG
--
KQD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
I
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDDDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|656380.3.peg.191
Escherichia coli FVEC1412 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
V
YV
N
HQWRG
--
KQD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
I
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDDDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|749549.3.peg.1045
Escherichia coli MS 198-1 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
V
YV
N
HQWRG
--
KQD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
I
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDDDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|656437.3.peg.69
Escherichia coli TA143 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
V
YV
N
HQWRG
--
KQD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
I
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDDDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|585056.7.peg.203
Escherichia coli UMN026 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
V
YV
N
HQWRG
--
KQD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
I
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDDDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|701177.3.peg.870
Escherichia coli O55:H7 str. CB9615 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-TQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|562.373.peg.3015
Escherichia coli 1125A (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|562.372.peg.1711
Escherichia coli 1212A (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|562.374.peg.5266
Escherichia coli 536A (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|444454.5.peg.5236
Escherichia coli O157:H7 str. EC4024 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|444449.5.peg.5572
Escherichia coli O157:H7 str. EC4042 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|444448.5.peg.3447
Escherichia coli O157:H7 str. EC4045 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|444453.5.peg.790
Escherichia coli O157:H7 str. EC4076 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|444452.5.peg.3574
Escherichia coli O157:H7 str. EC4113 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|444450.8.peg.906
Escherichia coli O157:H7 str. EC4115 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|444451.5.peg.4518
Escherichia coli O157:H7 str. EC4196 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|478005.5.peg.1294
Escherichia coli O157:H7 str. EC4486 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|478007.5.peg.3992
Escherichia coli O157:H7 str. EC508 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|478008.5.peg.2136
Escherichia coli O157:H7 str. EC869 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|637388.3.peg.1495
Escherichia coli O157:H7 str. FRIK2000 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|570506.3.peg.422
Escherichia coli O157:H7 str. FRIK966 (7-809/819)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|544404.4.peg.771
Escherichia coli O157:H7 str. TW14359 (4-806/816)
YRLSVLS
---
CLAMVTPPALT-----AE
FN
LNV
L
DKSIRDS
---
V
D
I
S
LLNQKG
-
VVA
PG
D
Y
F
V
S
V
TV
N
NNKIS
-
N
GQQ
I
RWQ
---
KSGDK----
-
IIP
C
I
N
ESL
I
ELF
GL
KSDFR
-------
KKLPA-
--
-IKE
C
V
-
DFSV-FPEIIFTF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
N
-----
N
GI
PGFLM
D
YN
LFA-STYRPQS
--
--
-
---------------
GSSSNNLNAYGT
T
G
L
N
A
G
A
WRLR
S
DYQ--LSQS-DSGD
--------
NREQSGAISRTYLF
R
PLPQ
I
GSR
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
RPRSSMSHHTEDET
F
ISHEVSW
G
MLSNT
S
L
YGG
M
L
LAGDD
Y
R
S
GAL
G
I
G
QNMLWM
GA
L
S
F
D
V
T
W
A
D
S
H
F
D
-----
TQQDE
-
Q
G
Y
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YIDH-
-
-
--
-----------------KYNDAD-AQDE
K
QTISL
S
FG
Q
P
I
TLLNL
N
LYA
N
ILH
Q
S
W
W
NA
D
TSTT
A
NITV
G
FN
VDI
GDWKD
I
SV
S
T
S
FNTTHYED
-----
-KDR
D
NQIYFS
I
S
L
P
I----
--
-------
-
GESGRLGYDMQ-NNSNTTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDLT
G
T
YA
--
AN
-
DYTSASASW
SG
S
FTATQH
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VGDIP
I
-
-
QGNIDY
T
N
RF
G
IA
V
V
P
FV
S
S
Y
Q
P
T
T
V
A
V
N
MNDLPDGVT
V
SE
N
VVKETWTE
GAI
GFKS
L
A
S
RA
G
KDLNVIISDAN
G
HFP
P
L
G
ADVRQA
---
EGGVSV
G
M
V
G
E
N
G
HA
W
L
S
G
VDENQQ
-
FT
V
H
WG
DQKTCA
I
HLPEHLED
fig|749531.3.peg.1264
Escherichia coli MS 69-1 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
HQWRG
--
KKD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NH
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
M
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYNDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
IIDGS
YS
HS
--
KN
-
-TWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LI
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
NQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|749527.3.peg.4010
Escherichia coli MS 21-1 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
HQWRG
--
KKD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DVFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPV
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPRYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|656419.3.peg.170
Escherichia coli M718 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
HQWRG
--
KKD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
I
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
A
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKHNHQSEYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDITK
T
RHQINLSNSTSFSKDGYSSNNTGVT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
IIDGS
YS
HS
--
KN
-
-TWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|749548.3.peg.2362
Escherichia coli MS 196-1 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
HQWRG
--
KKD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
I
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
A
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKHNHQSEYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDITK
T
RHQINLSNSTSFSKDGYSSNNTGVT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
IIDGS
YS
HS
--
KN
-
-TWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|749532.3.peg.4255
Escherichia coli MS 78-1 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
V
YV
N
HQWRG
--
KQD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
I
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
A
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKHNHQSEYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDITK
T
RHQINLSNSTSFSKDGYSSNNTGVT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
IIDGS
YS
HS
--
KN
-
-TWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
S
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|550676.3.peg.4681
Escherichia coli B185 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
HQWRG
--
KKD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSHTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NNWQH
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
LL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|585057.4.peg.21
Escherichia coli IAI39 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
HQWRG
--
KKD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DVFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FH---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLC
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|585057.6.peg.21
Escherichia coli IAI39 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMRG-
---
E
K
V
S
EYRFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
HQWRG
--
KKD
I
TI-
---
------PES
P
AKP
C
L
P
KVL
L
TTL
G
V
KTDNL
-------
------
-
N
TEDN
C
I
-
LLDEAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DVFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FH---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLC
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|216592.1.peg.4400
Escherichia coli 042 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NNWQH
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|216592.3.peg.22
Escherichia coli 042 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGKWQSNTLYLE
R
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YT
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
VS
N
F
DF
I
A
G
RSQIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDTRH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLL
-
QTA
T
HFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NNWQH
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
IANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|562.371.peg.3950
Escherichia coli 1044A (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|562.373.peg.2255
Escherichia coli 1125A (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|562.372.peg.3106
Escherichia coli 1212A (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|562.374.peg.2878
Escherichia coli 536A (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|83334.1.peg.109
Escherichia coli O157:H7 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|155864.1.peg.21
Escherichia coli O157:H7 EDL933 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|155864.8.peg.18
Escherichia coli O157:H7 EDL933 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|478006.5.peg.3208
Escherichia coli O157:H7 str. EC4501 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|478007.5.peg.837
Escherichia coli O157:H7 str. EC508 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|478008.5.peg.447
Escherichia coli O157:H7 str. EC869 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|637388.3.peg.3856
Escherichia coli O157:H7 str. FRIK2000 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|570506.3.peg.1020
Escherichia coli O157:H7 str. FRIK966 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|386585.9.peg.117
Escherichia coli O157:H7 str. Sakai (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|502346.5.peg.1246
Escherichia coli O157:H7 str. TW14588 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|701177.3.peg.18
Escherichia coli O55:H7 str. CB9615 (10-811/816)
AII-F
---
------LYSFPGY-AEET
F
D
THF
M
IGGMKG-
---
E
K
V
S
EYHFDN
K
QPL
PG
N
Y
E
L
D
F
YV
N
NQWRG
--
KQD
I
TI-
---
------PES
P
VKP
C
L
P
KVL
L
TKL
G
V
KTGNL
-------
------
-
N
TEDN
C
I
-
LLDKAVHGGQYQW
D
ISEHR
L
N
LT
V
-
PQA
Y
I
NELER
GY
V
PP
ES
WD
-----
R
GI
DAFYT
S
YN
LSQYRS-YDSN
--
--
-
---------------
NNSNTASYGRFN
SG
L
N
L
F
S
W
Q
L
H
S
DASYS-----KPDD
--------
-MKGTWQSNTLYLE
H
GWSQ
I
LST
V
Q
I
G
E
NY
T
S
S
L
IFDS
LRFS
G
IR
L
F
R
D
MQ
MLP
DSMQ
S
F
T
P
L
V
Q
G
V
A
QS
N
-
A
L
I
T
V
S
QNG
YI
IY
QKE
VPPGPF
T
I
A
DL
QLSGSG
S
DL
D
V
S
I
K
E
A
DG
SVRS
F
LV
P
Y
SS
V
P
N
M
L
Q
P
G
IS
N
F
DF
I
A
G
RSKIYGVKNQED--
F
LEANYIY
G
LNNLL
T
L
YGG
T
-
ILSDN
Y
N
A
ITL
G
N
G
WNT-PL
GA
I
S
F
D
A
T
R
S
S
S
K
L
N
-----
NDITH
-
E
G
T
S
Y
Q
VA
Y
N
K
YLV
-
QTA
T
RFS
V
AA
W
RY
A
SQD
Y
R
T
FS
D
HLYE-
-
-
--
NDKINHQSDYDDFYDI--------GRK-
-
NSLSA
N
IM
Q
P
L
SNNLG
N
VSL
S
ALW
R
N
YW
GR
S
GNAK
-
DYQF
S
YS
---
NSWQR
I
SY
T
F
S
ASQSYDEN
-----
-DKE
E
ERFNLF
I
SIP
FY---
--
WGDDIAK
T
RHQINLSNSTSFSKDGYSSNNTGIT
G
IAGEHDQ
-
L
N
Y
G
I
YV
N
QQQQNNDTSLGTN---LS
W
RTPI
A
TIDGS
YS
HS
--
KN
-
-AWQSGGSI
S
S
G
LVVWPG
G
I
N
I
--
TNQLS
D
TFA
IL
D
A
P
-
G
LEGAH
I
N
-
GQKYNR
T
N
SK
G
QV
V
Y
D
LM
I
P
H
R
E
N
H
L
V
LD
TANSESETE
L
QG
N
RQIIAPYR
GA
V
SYVQ
F
T
T
DQ
R
KPWYIQALRPD
G
SPL
T
FG
YDVLDL
---
-QENNI
G
V
V
G
Q
G
S
RL
F
I
R
V
DEIPTG
-
IK
V
A
LN
DEQNLF
C
TITFQHVIDENK
fig|656437.3.peg.789
Escherichia coli TA143 (4-808/816)
YRLSFIS
---
CLVMAIPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKGKG
-
GIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
DWK
---
KNGDQ----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPR-
--
-LNQ
C
V
-
DFSS-RPEILFIF
D
QASQQ
L
N
I
T
I
-
PQA
W
L
AWHSD
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSNSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNF
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSNHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AD
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQE
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
V
WG
DSQHCS
L
YLPEHMENTA
fig|749531.3.peg.1848
Escherichia coli MS 69-1 (4-808/816)
YRLSFIS
---
CLVMAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSD
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSNSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNF
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSNHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
S
V
N
MNDLPDGVT
V
AD
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
V
WG
DSQRCS
I
HLPEHMEDTA
fig|216592.3.peg.769
Escherichia coli 042 (4-808/816)
YRLSFIS
---
CLVMAMPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFDS
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
T
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDN
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSGHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AD
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQE
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
I
WG
DSQRCS
I
HLPEHMEDTA
fig|216592.1.peg.465
Escherichia coli 042 (21-825/833)
YRLSFIS
---
CLVMAMPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFDS
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
T
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDN
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSGHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AD
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQE
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
I
WG
DSQRCS
I
HLPEHMEDTA
fig|656379.3.peg.1492
Escherichia coli FVEC1302 (4-808/816)
YRLSFIS
---
CLVMAIPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKGKG
-
GIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
DWK
---
KNGDQ----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPR-
--
-LNQ
C
V
-
DFSS-RPEILFIF
D
QASQQ
L
N
I
T
I
-
PQA
W
L
AWHSD
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSNSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRTSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDN
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSNHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AD
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNLIIRNAS
G
QFP
P
L
G
ADIRQE
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
I
WG
DSQRCS
I
HLPEHMEDTA
fig|656380.3.peg.1320
Escherichia coli FVEC1412 (4-808/816)
YRLSFIS
---
CLVMAIPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKGKG
-
GIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
DWK
---
KNGDQ----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPR-
--
-LNQ
C
V
-
DFSS-RPEILFIF
D
QASQQ
L
N
I
T
I
-
PQA
W
L
AWHSD
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSNSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRTSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDN
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSNHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AD
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNLIIRNAS
G
QFP
P
L
G
ADIRQE
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
I
WG
DSQRCS
I
HLPEHMEDTA
fig|749549.3.peg.4481
Escherichia coli MS 198-1 (4-808/816)
YRLSFIS
---
CLVMAIPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKGKG
-
GIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
DWK
---
KNGDQ----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPR-
--
-LNQ
C
V
-
DFSS-RPEILFIF
D
QASQQ
L
N
I
T
I
-
PQA
W
L
AWHSD
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSNSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRTSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDN
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSNHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AD
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNLIIRNAS
G
QFP
P
L
G
ADIRQE
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
I
WG
DSQRCS
I
HLPEHMEDTA
fig|585056.7.peg.1000
Escherichia coli UMN026 (4-808/816)
YRLSFIS
---
CLVMAIPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKGKG
-
GIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
DWK
---
KNGDQ----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPR-
--
-LNQ
C
V
-
DFSS-RPEILFIF
D
QASQQ
L
N
I
T
I
-
PQA
W
L
AWHSD
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSNSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRTSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDN
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSNHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AD
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNLIIRNAS
G
QFP
P
L
G
ADIRQE
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
I
WG
DSQRCS
I
HLPEHMEDTA
fig|656393.3.peg.1403
Escherichia coli H299 (4-808/816)
YRLSFIS
---
CLVMAIPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
GIA
PG
E
Y
F
V
S
V
TV
N
NNQIS
-
N
GQK
I
DWK
---
KNGDQ----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPR-
--
-LNQ
C
V
-
DFSS-RPEILFIF
D
QASQQ
L
N
I
T
I
-
PQA
W
L
AWHSD
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSNSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RANN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRTSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATD
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNNSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGVQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
NSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQM
-
LT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|749547.3.peg.1737
Escherichia coli MS 187-1 (4-808/816)
YRLSFVS
---
CLVMAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SNYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
S
WRLR
S
DYQ--LNNT-DSED
--------
SHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TSLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|670888.3.peg.1279
Escherichia coli 1827-70 (4-808/816)
YRLSFVS
---
CLVMAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SNYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
S
WRLR
S
DYQ--LNNT-DSED
--------
SHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|413997.3.peg.720
Escherichia coli B str. REL606 (4-808/816)
YRLSFVS
---
CLVMAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SNYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
S
WRLR
S
DYQ--LNNT-DSED
--------
SHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|511693.5.peg.717
Escherichia coli BL21 (4-808/816)
YRLSFVS
---
CLVMAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SNYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
S
WRLR
S
DYQ--LNNT-DSED
--------
SHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|469008.4.peg.3039
Escherichia coli BL21(DE3) (4-808/816)
YRLSFVS
---
CLVMAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SNYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
S
WRLR
S
DYQ--LNNT-DSED
--------
SHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|358709.5.peg.3965
Escherichia coli 101-1 (4-808/816)
YRLSFVS
---
CLVMAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SNYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
S
WRLR
S
DYQ--LNNT-DSED
--------
SHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
MADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|550677.3.peg.1123
Escherichia coli B354 (4-808/816)
YRLSFIS
---
CLVMAIPSALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
DWK
---
KNGDQ----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPR-
--
-LNQ
C
V
-
DFSS-RPEILFIF
D
QASQQ
L
N
I
T
I
-
PQA
W
L
AWHSD
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSNSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
Q
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDND-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GNWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSQSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSTTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
NSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQM
-
LT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|340184.3.peg.2282
Escherichia coli B7A (7-811/819)
YRLSLVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQN
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|340184.6.peg.2396
Escherichia coli B7A (4-808/816)
YRLSLVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQN
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|679204.3.peg.4494
Escherichia coli MS 145-7 (4-808/816)
YRLSLVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQN
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|585055.6.peg.714
Escherichia coli 55989 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
KT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
I
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|585055.8.peg.716
Escherichia coli 55989 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
KT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
I
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|562.375.peg.1438
Escherichia coli EC4100B (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|656408.3.peg.660
Escherichia coli H591 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|679207.4.peg.1372
Escherichia coli MS 107-1 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|679206.4.peg.2773
Escherichia coli MS 119-7 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|585396.4.peg.766
Escherichia coli O111:H- str. 11128 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|656443.3.peg.983
Escherichia coli TA271 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|566546.3.peg.4582
Escherichia coli W (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|566546.4.peg.772
Escherichia coli W (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|749548.3.peg.3226
Escherichia coli MS 196-1 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDH-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|573235.3.peg.793
Escherichia coli O26:H11 str. 11368 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNDVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|340186.3.peg.3855
Escherichia coli E110019 (7-811/819)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|331111.3.peg.3259
Escherichia coli E24377A (7-811/819)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|340186.5.peg.4048
Escherichia coli E110019 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|331111.12.peg.1040
Escherichia coli E24377A (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|481805.3.peg.3149
Escherichia coli ATCC 8739 (7-811/819)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
TE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|481805.6.peg.3134
Escherichia coli ATCC 8739 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
TE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|331112.3.peg.712
Escherichia coli HS (7-811/819)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
TE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|331112.6.peg.743
Escherichia coli HS (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
TE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|409438.11.peg.903
Escherichia coli SE11 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
N
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|595496.3.peg.644
Escherichia coli BW2952 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|536056.3.peg.3079
Escherichia coli DH1 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|656414.3.peg.899
Escherichia coli H736 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|749538.3.peg.653
Escherichia coli MS 116-1 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|749544.3.peg.3267
Escherichia coli MS 175-1 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|316407.3.peg.692
Escherichia coli W3110 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|316385.5.peg.781
Escherichia coli str. K-12 substr. DH10B (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|316385.7.peg.793
Escherichia coli str. K-12 substr. DH10B (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|511145.12.peg.748
Escherichia coli str. K-12 substr. MG1655 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|511145.6.peg.739
Escherichia coli str. K-12 substr. MG1655 (4-805/815)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|83333.1.peg.710
Escherichia coli K12 (7-808/818)
YRLSFVS
---
CLVMAMPCAMA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNKIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
A
G
A
WRLR
S
DYQ--LNKT-DSED
--------
NHDQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
ISDDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AS
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|679205.4.peg.3671
Escherichia coli MS 124-1 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|749533.3.peg.4665
Escherichia coli MS 84-1 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|550672.3.peg.961
Escherichia coli B088 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
VI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
K
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|585034.4.peg.688
Escherichia coli IAI1 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
VI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
K
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|585034.5.peg.687
Escherichia coli IAI1 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
I
T
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
A
G
I
N
T
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGEISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAI
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
VI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
K
G
HA
W
L
S
G
VAENQK
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|749532.3.peg.2590
Escherichia coli MS 78-1 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
G
T
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
W
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
GSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|316401.4.peg.846
Escherichia coli ETEC H10407 (4-805/815)
YRLSFVS
---
CLVMAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWQ
---
KKGDK----
-
TIP
C
I
N
DSL
V
DKF
GL
KPDIR
-------
QSLPQ-
--
-IDR
C
I
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
A
PP
ST
W
K
-----
E
G
V
AGVLM
D
YN
LFA-SNYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
S
WRLR
S
DYQ--LNNT-DSED
--------
SHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
R
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTIHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QYP
P
L
G
ADIRQD
---
DSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQL
-
FT
V
V
WG
E-QSCI
I
HLPERLED
fig|749545.3.peg.3930
Escherichia coli MS 182-1 (4-808/816)
YRLSFVS
---
CLVVAMPCALA-----VE
FN
LNV
L
DKSMRDR
---
I
D
I
S
LLKEKG
-
VIA
PG
E
Y
F
V
S
V
AV
N
NNQIS
-
N
GQK
I
NWH
---
KNDDK----
-
TIP
C
I
N
DLL
V
DKF
GL
KPEVR
-------
QSLPL-
--
-INQ
C
V
-
DFSS-RPEMLFNF
D
QANQQ
L
N
IS
I
-
PQA
W
L
AWHSE
NW
T
PP
ST
W
K
-----
E
G
V
AGILM
D
YN
LFA-SSYRPQD
--
--
-
---------------
GSSSTNLNAYGT
T
G
I
N
A
G
A
WRLR
S
DYQ--LNQT-DSDD
--------
NHEQSGGISRTYLF
R
PLPQ
L
GSK
L
T
LG
E
TD
F
S
S
N
IFD
G
FSYT
G
AA
L
A
SD
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
N
-
A
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
N-QSVQ
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
QV
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
QPRPSMSHQTENET
F
FSNEVSW
G
MLSNT
S
L
YGG
L
L
LSGDD
Y
H
S
AAM
G
I
G
QNMLWL
GA
L
S
F
D
V
T
W
A
S
S
H
F
D
-----
TQQDE
-
R
G
L
S
Y
R
FN
Y
S
K
QVD
-
ATN
S
TIS
L
AA
YR
F
S
DRH
F
H
S
YA
N
YLDH-
-
-
--
-----------------KYNDSD-AQDE
K
QTISL
S
VG
Q
P
I
TPLNL
N
LYA
N
LLH
Q
T
G
W
NA
D
ASTT
A
NITA
G
FN
VDI
GDWRD
I
SI
S
T
S
FNTTHYED
-----
-KDR
D
NQIYLS
I
S
L
P
F----
--
-------
-
GNGGRVGYDMQ-NSSHSTTHRMSWN
D
TL--DER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQVSGNYQ
H
LSSA
G
EWDIS
G
T
YA
--
AN
-
DYSSVSSSW
SG
S
FTATQY
G
A
A
FHR
RSSTN
E
PRL
MV
S
T
D
-
G
VADIP
V
-
-
QGNLDY
T
N
HF
G
IA
V
V
P
LI
S
S
Y
Q
P
S
T
V
A
V
N
MNDLPDGVT
V
AE
N
VIKETWIE
GAI
GYKS
L
A
S
RS
G
KDVNVIIRNAS
G
QFP
P
L
G
ADIRQD
---
GSGISV
G
M
V
G
E
E
G
HA
W
L
S
G
VAENQQ
-
FT
V
V
WG
DSQHCS
L
HLPEHMEDTA
fig|749545.3.peg.1807
Escherichia coli MS 182-1 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAT
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYII
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|749532.3.peg.624
Escherichia coli MS 78-1 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYII
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|340185.3.peg.409
Escherichia coli E22 (8-808/822)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
T
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|340185.4.peg.449
Escherichia coli E22 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
T
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|340184.3.peg.3927
Escherichia coli B7A (8-808/822)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|340184.6.peg.4106
Escherichia coli B7A (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|562.375.peg.681
Escherichia coli EC4100B (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|656408.3.peg.3455
Escherichia coli H591 (23-823/837)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|585034.4.peg.3127
Escherichia coli IAI1 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|585034.5.peg.3125
Escherichia coli IAI1 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|679207.4.peg.3758
Escherichia coli MS 107-1 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|679206.4.peg.1409
Escherichia coli MS 119-7 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|679204.3.peg.768
Escherichia coli MS 145-7 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|585395.4.peg.3888
Escherichia coli O103:H2 str. 12009 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|409438.11.peg.3494
Escherichia coli SE11 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|656443.3.peg.3991
Escherichia coli TA271 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|573235.3.peg.4243
Escherichia coli O26:H11 str. 11368 (23-823/837)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|344601.3.peg.433
Escherichia coli B171 (8-808/822)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
D
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|344601.5.peg.431
Escherichia coli B171 (23-823/837)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
D
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|340186.3.peg.493
Escherichia coli E110019 (8-808/822)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
GA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|340186.5.peg.514
Escherichia coli E110019 (23-823/837)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
GA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|749547.3.peg.3987
Escherichia coli MS 187-1 (21-820/834)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNL----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQT
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWED
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|1040638.4.peg.2197
Escherichia coli O104:H4 str. LB226692 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SFSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
NYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|6666666.5357.peg.4212
Escherichia coli TY-2482 (23-823/837)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SFSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
NYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|585055.6.peg.3494
Escherichia coli 55989 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SFSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
NYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|585055.8.peg.3497
Escherichia coli 55989 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SFSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
NYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|550672.3.peg.3288
Escherichia coli B088 (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--GRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|595495.4.peg.886
Escherichia coli KO11 (23-823/837)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--GRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|566546.3.peg.562
Escherichia coli W (23-823/837)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--GRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|566546.3.peg.561
Escherichia coli W (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--GRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|566546.4.peg.3274
Escherichia coli W (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--GRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|331111.12.peg.3781
Escherichia coli E24377A (20-820/834)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
V
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|331111.3.peg.1197
Escherichia coli E24377A (8-808/822)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EVAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
V
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|585396.4.peg.4003
Escherichia coli O111:H- str. 11128 (23-823/837)
SLS
---
VLIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
N
A
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
L
T
PEQ
L
TLL
G
F
TDEII
-------
EEAQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
WD
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQANNLDFPRIYLF
R
PIPA
I
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
VS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNKISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
AEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIMGVIRLAD
G
SHP
P
L
G
ISVKDK
---
TSHKEL
G
L
V
A
D
G
C
FV
YL
N
G
IQDDSK
-
LT
L
R
WG
D-KSCF
I
QPPNSSN
fig|316401.4.peg.3765
Escherichia coli ETEC H10407 (24-823/837)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
W
N
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQATNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
IS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYI
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNEISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
TEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|358709.5.peg.1197
Escherichia coli 101-1 (21-820/834)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
W
N
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQATNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
IS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNEISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
TEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|656414.3.peg.3507
Escherichia coli H736 (24-823/837)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
W
N
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQATNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
IS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNEISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
TEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|749538.3.peg.3863
Escherichia coli MS 116-1 (24-823/837)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
W
N
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQATNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
IS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTSVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNEISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
TEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|656444.3.peg.4132
Escherichia coli TA280 (16-821/835)
L
---
LLSLAGAPAYA-----VD
FN
TDV
L
DAADRQN
---
I
D
F
S
RFSQAG
-
YIM
PG
Q
Y
Q
ME
I
MV
N
DQGIS
PS
AFP
V
TFL
EPP
VSGQDGKKP
L
PQA
C
L
T
PEM
V
SRM
GL
TVASQ
-------
EKVTYW
--
NNGQ
C
A
-
DLSQ-LPGVEIRP
N
PAEGM
L
Y
I
N
M
-
PQA
W
L
EYSDA
S
W
L
PP
SR
WD
-----
N
GI
PGLLF
D
YN
ING-TVNKPHK
--
--
-
---------------
GKQSQSLSYNGT
A
G
A
N
F
G
A
WRLR
A
DYQGNLNHTTGSVQ
--------
GTDSQFTWSRFYMY
R
AIPR
W
RAS
L
T
LG
E
NY
I
N
S
E
IF
S
S
WRYT
G
AS
L
E
SD
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
N
-
A
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
D-SSVR
G
R
L
D
V
E
V
I
E
Q
DG
RKKT
F
QV
D
T
A
Y
V
P
Y
L
T
R
P
G
QI
RY
KL
V
S
G
RSRN-YEHTTEGPV
F
AAGEASW
G
ISNKW
S
L
YGG
G
-
IVAGD
Y
N
A
LAV
G
L
G
RDLSEF
G
T
V
S
A
D
V
T
Q
S
V
A
R
I
P
-----
GEETK
-
Q
G
K
S
W
R
LS
Y
S
K
RFD
-
DVN
A
DIT
F
AG
YR
F
S
ERN
Y
M
T
MD
Q
YLNA-
-
-
--
-----------------RYRNDF-TGRE
K
ELYTV
T
LN
K
N
F
EDWKT
S
VNL
Q
YSH
Q
T
YW
DR
R
TSDY
Y
TLSV
N
RY
F
D
A
FGFKN
I
SL
G
L
S
ASRSKYQN
-----
-RDN
D
-SAFVR
L
S
V
P
W----
--
-------
-
G-TGTASYSGS-MSNDRYTNTVGYS
D
TL--NKG
L
S
SY
S
L
NA
G
VSSGGGQPSQSQMSAYYN
H
SSPL
A
--NLS
A
N
FS
AV
EN
-
GYTSFGMSA
SGG
ATITAK
G
A
A
L
H
A
GGMNG
G
TRL
L
V
D
T
D
-
G
VGGVP
V
-
-
DGGRVS
T
N
RW
G
IG
V
V
T
DV
S
S
Y
Y
R
N
T
T
S
V
D
LNKLPEDME
A
TR
S
VVESVLTE
GAI
GYRE
F
E
V
LK
G
SRLFAVLRLAD
N
SHP
PFG
ASVTNA
---
K-GREL
G
M
V
A
D
S
G
LA
W
L
S
G
VNPGET
-
LN
V
G
W
D
GRTQCV
V
DIPAKL
fig|413997.3.peg.3053
Escherichia coli B str. REL606 (21-820/834)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
W
N
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQATNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
IS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTNVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNEISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
TEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|511693.5.peg.3062
Escherichia coli BL21 (24-823/837)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
W
N
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQATNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
IS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTNVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNEISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
TEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|469008.4.peg.712
Escherichia coli BL21(DE3) (24-823/837)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
W
N
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQATNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
IS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTNVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNEISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
TEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|637912.3.peg.374
Escherichia coli OP50 (24-823/837)
LS
---
VIIIGCASAYA-----VE
FN
KDL
I
EAEDREN
---
V
N
LS
QFETDG
-
QLP
V
G
K
Y
S
L
ST
LI
N
NKRTP
-
I
HLD
L
QWV
---
LIDNQ----
-
TAV
C
V
T
PEQ
L
TLL
G
F
TDEFI
-------
EKTQQN
--
LIDG
C
Y
-
PIEK-EKQITTYL
D
KGKMQ
L
S
IS
A
-
PQA
W
L
KYKDA
NW
T
PP
EL
W
N
-----
H
GI
AGAFL
D
YN
LYA-SHYAPHQ
--
--
-
---------------
GDNSQNISSYGQ
A
G
V
N
L
G
A
WRLR
T
DYQ--YDQSFNNGK
--------
SQATNLDFPRIYLF
R
PIPA
M
NAK
L
T
I
G
Q
YD
T
E
S
S
IFDS
FHFS
G
IS
L
K
SD
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
N
-
A
K
VT
V
S
QN
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
F-NTLQ
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
QV
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
KPTNVGGDSLQQPF
F
WTGEFSW
G
WLNNV
S
L
YGG
S
V
LTNRD
Y
Q
S
LAA
G
V
G
FNLNSL
G
S
L
S
F
D
V
T
R
S
D
A
Q
L
H
-----
NQDKE
-
T
G
Y
S
Y
R
AN
Y
S
K
RFE
-
STG
S
QLT
F
AG
YR
F
S
DKN
F
V
T
MN
E
YIND-
-
-
--
-----------------TNHYTN-YQNE
K
ESYIV
T
FN
Q
Y
L
ESLRL
N
TYV
S
LAR
N
T
YW
DA
S
SNVN
Y
SLSL
S
RD
F
DI
GPLKN
V
ST
S
L
T
FSRINWEE
-----
-DNQ
D
-QLYLN
I
SIP
W----
--
-------
-
GTSRTLSYGMQRNQDNEISHTASWY
D
SS--DRN
-
N
S
W
S
V
SA
S
GDNDEFKDMKASLRASYQ
H
NTEN
G
RLYLS
G
T
SQ
--
RD
-
SYYSLNASW
N
G
S
FTATRH
G
A
A
FH
D
YSGSA
D
SRF
M
I
D
A
D
-
G
TEDIP
L
-
-
NNKRAV
T
N
RY
G
IG
V
I
P
SV
S
S
Y
I
T
T
S
L
S
V
D
TRNLPENVD
I
EN
S
VITTTLTE
GAI
GYAK
L
D
T
RK
G
YQIIGVIRLAD
G
SHP
P
L
G
ISVKDE
---
TSHKEL
G
L
V
A
D
G
G
FV
YL
N
G
IQDDNK
-
LA
L
R
WG
D-KSCF
I
QPPNSSN
fig|481805.3.peg.1408
Escherichia coli ATCC 8739 (16-841/881)
I
---
ALAISGSYSSVWAEDDIQ
F
D
SRF
L
ELKGDTK
---
I
DL
K
RFSSQG
-
YVE
PG
K
Y
N
L
Q
V
QL
N
KQPLA
-
E
EYD
I
YWY
---
AGEDDASK-
-
SYA
C
L
T
PEL
V
AQF
GL
KEDVA
-------
KNLQWS
--
HDAK
C
L
-
KSGQ-LEGMEIKA
D
LSQSA
L
V
IS
L
-
PQA
Y
L
EYTYP
D
W
D
PP
SR
WD
-----
D
GI
SGIVA
D
Y
S
INAQTRHEENG
--
--
-
---------------
GDDSNEISGNGT
V
G
V
N
L
G
P
WR
M
R
A
DWQTNYQHTRSNDD
-
DEEFSGD
DTQKKWEWSRYYAW
R
ALPS
L
KAK
L
A
LG
E
DY
L
N
S
DIFD
G
FNYV
G
GS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
-
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
G-DSVS
G
T
L
H
I
R
I
E
E
Q
N
G
QVQE
Y
DI
S
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
RPQE-WGHHVEGEF
F
SGAEASW
G
IANGW
S
L
YGG
A
-
LGDEN
Y
Q
S
AAL
G
V
G
RDLSTF
GA
V
A
F
D
V
T
H
S
H
T
K
L
D
KDTAY
GKGSL
-
D
G
N
S
F
R
VS
Y
S
K
DFD
-
QLN
S
RVT
F
AG
YR
F
S
EEN
F
M
T
MS
E
YLDA-
-
-
--
-----------------SDSGMVRTGND
K
EMYTA
T
YN
Q
N
F
RDAGV
S
VYL
N
YTR
H
T
YW
DR
E
EQTN
Y
NIML
S
HY
FNM
GSIRN
M
SV
S
L
T
GYRYEYDN
-----
-RAD
K
-GMYIS
L
S
M
P
W----
--
-------
-
GDNSTVSYNGN-YGSGTDSSQVGYF
S
RV--DDA
-
T
H
Y
Q
L
NV
G
TSD-----KHTSVDGYYS
H
DGSL
A
QVDLC
A
N
YH
--
EG
-
QYTSAGLSL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
VADVP
V
E
G
NGAAVY
T
N
MF
G
KA
V
V
S
DV
N
N
Y
Y
R
N
Q
A
Y
I
D
LNKLPENAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
A
V
IS
G
QKAMAVLRLQD
G
SHP
PFG
AEVKND
---
N-EQTV
G
L
V
D
D
D
G
NV
YL
A
G
VKPGEH
-
MR
V
F
W
S
GVAHCD
I
NLPDPLPADLFNGL
L
L
fig|481805.6.peg.1406
Escherichia coli ATCC 8739 (16-841/881)
I
---
ALAISGSYSSVWAEDDIQ
F
D
SRF
L
ELKGDTK
---
I
DL
K
RFSSQG
-
YVE
PG
K
Y
N
L
Q
V
QL
N
KQPLA
-
E
EYD
I
YWY
---
AGEDDASK-
-
SYA
C
L
T
PEL
V
AQF
GL
KEDVA
-------
KNLQWS
--
HDAK
C
L
-
KSGQ-LEGMEIKA
D
LSQSA
L
V
IS
L
-
PQA
Y
L
EYTYP
D
W
D
PP
SR
WD
-----
D
GI
SGIVA
D
Y
S
INAQTRHEENG
--
--
-
---------------
GDDSNEISGNGT
V
G
V
N
L
G
P
WR
M
R
A
DWQTNYQHTRSNDD
-
DEEFSGD
DTQKKWEWSRYYAW
R
ALPS
L
KAK
L
A
LG
E
DY
L
N
S
DIFD
G
FNYV
G
GS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
-
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
G-DSVS
G
T
L
H
I
R
I
E
E
Q
N
G
QVQE
Y
DI
S
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
RPQE-WGHHVEGEF
F
SGAEASW
G
IANGW
S
L
YGG
A
-
LGDEN
Y
Q
S
AAL
G
V
G
RDLSTF
GA
V
A
F
D
V
T
H
S
H
T
K
L
D
KDTAY
GKGSL
-
D
G
N
S
F
R
VS
Y
S
K
DFD
-
QLN
S
RVT
F
AG
YR
F
S
EEN
F
M
T
MS
E
YLDA-
-
-
--
-----------------SDSGMVRTGND
K
EMYTA
T
YN
Q
N
F
RDAGV
S
VYL
N
YTR
H
T
YW
DR
E
EQTN
Y
NIML
S
HY
FNM
GSIRN
M
SV
S
L
T
GYRYEYDN
-----
-RAD
K
-GMYIS
L
S
M
P
W----
--
-------
-
GDNSTVSYNGN-YGSGTDSSQVGYF
S
RV--DDA
-
T
H
Y
Q
L
NV
G
TSD-----KHTSVDGYYS
H
DGSL
A
QVDLC
A
N
YH
--
EG
-
QYTSAGLSL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
VADVP
V
E
G
NGAAVY
T
N
MF
G
KA
V
V
S
DV
N
N
Y
Y
R
N
Q
A
Y
I
D
LNKLPENAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
A
V
IS
G
QKAMAVLRLQD
G
SHP
PFG
AEVKND
---
N-EQTV
G
L
V
D
D
D
G
NV
YL
A
G
VKPGEH
-
MR
V
F
W
S
GVAHCD
I
NLPDPLPADLFNGL
L
L
fig|656417.3.peg.3017
Escherichia coli M605 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQSLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
TYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
SGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
EDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADEH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAH
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------SQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
VYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NTD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
N
RI--DDA
-
T
H
Y
Q
I
NV
G
TSE-----QHGSVDGYLS
H
DGTL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
INAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|585057.4.peg.2590
Escherichia coli IAI39 (4-835/883)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
TYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
EDDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADEH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQSMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MT
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|585057.6.peg.2593
Escherichia coli IAI39 (4-835/883)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
TYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
EDDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADEH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQSMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MT
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|749527.3.peg.1680
Escherichia coli MS 21-1 (4-835/883)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
TYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
EDDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHRTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADEH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQSMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MT
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|431946.3.peg.2313
Escherichia coli SE15 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQSLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
TYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
SGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
EDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADEH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------SQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
VYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NTD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
N
RI--DDA
-
T
H
Y
Q
I
NV
G
TSE-----QHGSVDGYLS
H
DGTL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGALVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
INAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|685038.3.peg.2402
Escherichia coli O83:H1 str. NRG 857C (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
V
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|439855.10.peg.2662
Escherichia coli SMS-3-5 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
TYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADEH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
V
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQSMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MT
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|362663.8.peg.2404
Escherichia coli 536 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|362663.9.peg.2409
Escherichia coli 536 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|405955.9.peg.2108
Escherichia coli APEC O1 (6-836/884)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|525281.3.peg.4146
Escherichia coli 83972 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNADEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|655817.3.peg.2790
Escherichia coli ABU 83972 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNADEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|199310.1.peg.2808
Escherichia coli CFT073 (6-836/884)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNADEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|199310.4.peg.2720
Escherichia coli CFT073 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNADEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|749546.3.peg.2772
Escherichia coli MS 185-1 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNADEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|749528.3.peg.2685
Escherichia coli MS 45-1 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNADEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|753642.3.peg.2745
Escherichia coli NC101 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KHPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|749550.3.peg.2880
Escherichia coli MS 200-1 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
EEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|585397.7.peg.2809
Escherichia coli ED1a (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KHPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNADEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|585397.9.peg.2806
Escherichia coli ED1a (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KHPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNADEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|340197.3.peg.3888
Escherichia coli F11 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
G
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|340197.5.peg.4063
Escherichia coli F11 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
G
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|340197.3.peg.4806
Escherichia coli F11 (6-836/884)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KQPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
VADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
G
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSVDGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|656440.3.peg.2410
Escherichia coli TA206 (4-834/882)
HSNFRLRGIACYI
---
ALAISGGSVNAWADDSIQ
F
D
PRF
L
ELKGDTK
---
I
DL
G
KFSKKG
-
YVD
A
G
K
Y
N
L
R
V
FI
N
KHPLS
-
D
EYD
I
NWY
---
VSENDPTK-
-
NYA
C
L
T
PEL
V
AAL
GL
KEGIA
-------
KSLQWT
--
HNDE
C
L
-
KPGQ-LDGMEVEN
D
LSQSA
L
L
LT
V
-
PQA
Y
L
EYTSS
D
W
D
PP
SR
WD
-----
D
GI
PGLIA
D
Y
S
LNAQTRHQEQG
--
--
-
---------------
GEDSHDISGNGT
V
G
A
N
L
G
A
WR
F
R
A
DWQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRYYAW
R
ALPS
L
KAK
L
S
LG
E
DY
L
N
S
DIFD
G
FNYI
G
SS
V
S
T
D
DQ
MLP
PNLR
G
Y
AP
D
V
S
G
V
A
HS
S
-
A
K
VTI
S
Q
M
G
RV
L
Y
ETQ
VP
A
GPF
R
I
Q
D
I
G-DSVS
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QV
RY
KV
M
M
G
RPED-WNHKTEGGF
F
SGGEASW
G
GADGW
S
L
YGG
A
-
LADKH
Y
Q
S
AAM
G
V
G
RDLAQF
GA
L
A
F
D
V
T
H
S
H
V
N
L
D
HDSAY
GKGKL
-
D
G
N
S
F
R
VS
Y
A
K
DFD
-
ELN
S
RVT
F
AG
YR
F
S
EKN
F
M
T
MS
E
YLDA-
-
-
--
-----------------NQSDMARTGND
K
EMYTI
T
YN
Q
N
F
AAAGV
S
IYL
N
YSH
R
T
YW
DR
P
EQTN
Y
NLMF
S
HY
FNM
GSIRN
M
SI
S
V
T
GYRYEYDD
-----
-NAD
K
-GMYLS
M
SIP
W----
--
-------
-
SDSSTVTYNGS-YGSGSDSSQVGYF
K
RV--DDA
-
T
H
Y
Q
V
NV
G
TSE-----QHGSADGYLS
H
DGSL
A
KVDLS
A
N
YH
--
EG
-
EYRSAGIAL
Q
GG
ATLTAH
G
G
A
L
HR
TQNMG
G
TRL
LI
D
A
D
-
G
IANVP
V
E
S
NGAPVY
T
N
MF
G
KA
V
V
A
DI
N
N
Y
Y
R
N
Q
A
Y
I
D
LNNLPEDAE
A
TQ
S
VVQATLTE
GAI
GYRK
F
K
V
IS
G
QKAMAVLRLRD
G
SYP
PFG
AEVKND
---
E-QQQV
GIV
D
D
E
G
NV
YL
A
G
VNAGEH
-
MM
V
F
W
E
GSAQCE
I
VLPKPLPAD
fig|749540.3.peg.1503
Escherichia coli MS 146-1 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RGDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|595496.3.peg.3195
Escherichia coli BW2952 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|536056.3.peg.513
Escherichia coli DH1 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|656414.3.peg.3692
Escherichia coli H736 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|83333.1.peg.3159
Escherichia coli K12 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|749538.3.peg.2251
Escherichia coli MS 116-1 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|749544.3.peg.1590
Escherichia coli MS 175-1 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|316407.3.peg.3095
Escherichia coli W3110 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|316385.5.peg.3344
Escherichia coli str. K-12 substr. DH10B (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|316385.7.peg.3415
Escherichia coli str. K-12 substr. DH10B (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|511145.12.peg.3311
Escherichia coli str. K-12 substr. MG1655 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|511145.6.peg.3296
Escherichia coli str. K-12 substr. MG1655 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HQFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
fig|749548.3.peg.4359
Escherichia coli MS 196-1 (5-778/793)
TLLAY
TIG
FAFSPPANADGIEIAAVD
F
D
RET
L
KSLGVD-
---
P
NI
S
HYFSRS
A
RFL
PG
E
Y
S
L
I
V
SV
N
GEKKG
--
NIA
T
RF-
---
-------DE
N
GDI
C
L
D
QAF
L
QQA
GL
KIPSE
-------
------
--
EKNG
C
Y
-
DYILSYPGTTITP
L
PNQEA
L
D
I
I
V
S
PQA
I
I
-----
--
-
-
P
IG
L
D
LTNAA
T
G
G
TAALL
NY
S
LMSSRAEFS--
--
--
-
---------------
NGSSDYSQAALE
G
G
I
N
I
N
D
W
M
LR
S
HHFLTQTNGTFSNQ
--------
-------NSSTYLQ
R
TFTD
L
KTL
M
R
A
G
E
VN
L
N
NS
V
L
E
G
ASIY
G
IE
I
A
P
D
NA
L
--
-QTS
G
SGV
Q
V
T
GIA
NT
S
Q
A
R
V
E
I
R
Q
Q
G
VL
I
H
SIL
VP
A
G
A
F
T
I
P
D
V
PVRNGN
S
DL
N
V
T
V
V
E
T
DG
SSHN
Y
IV
P
-
S
T
L
F
N
Q
H
V
E
S
FQ
G
Y
RF
A
I
G
RVDD---DYDESPW
V
ISASSGW
N
LTRWS
A
M
N
GG
V
-
IVAEN
Y
Q
A
AS-
-
I
R
SSLVPL
PD
L
T
V
S
S
Q
I
S
T
S
Q
-
-
-----
DTKDS
L
Q
G
Q
K
Y
R
LD
A
N
Y
NLP
F
SLG
L
TTS
L
TR
---
S
DRH
Y
R
E
LS
E
AIDD-
-
-
--
----------------------DYTDPT
K
STYAL
G
LN
W
S
N
SI-LG
G
FNI
S
GYK
T
Y
S
Y
DG
D
NDSS
-
NLNI
N
WN
---
KAFKH
A
TV
S
V
N
WQHQLSAS
---
EN
NEDD
G
DLFYVN
I
SIP
F----
--
-------
-
-GRSNTATLYTRHDDHKTHYGTGVM
G
VVSDE--
-
M
SY
Y
V
NA
E
RDHDERETSLNGS---IS
S
NLHY
T
QVSLA
AG
AS
--
GS
-
DSRTYNGTM
SGG
IAVHDQ
G
V
T
F
-
S
PWTIN
D
TFA
I
A
K
M
D
NN
IAGVR
I
T
S
QAGPVW
TD
FR
G
NA
V
I
P
SI
Q
P
W
R
T
S
G
V
E
I
D
TASLPKNVD
I
GN
G
TKMIKQGR
GA
V
GKVG
F
S
A
IT
Q
RRALLNITLSD
G
KKL
P
R
G
VAIEDS
---
-EGNYL
TTS
V
D
D
G
VV
F
L
N
N
IKPDMV
-
LD
I
K
--
-DEQQS
C
RIHLTFPED
Consen1
Primary consensus
MSYLNLRLmpqr
h
r
---
Fn
l
---
dls
-
pG
Y
vdi
N
--
i
---
-
c
t
l
Gl
-------
--
c
-
d
L
lt
-
PQa
l
gy
pP
Wd
-----
Gi
nYn
--
g
---------------
sG
N
g
WrLrn
--------
R
l
l
lGd
t
gdiFds
G
l
sd
MlP
gfaP
v
GiA
n
-
A
vti
QnG
iY
VppGpF
I
Dl
gdL
V
i
E
DG
f
p
ss
P
l
r
G
ry
t
G
F
G
t
YGG
-
Y
a
G
G
GA
S
D
t
a
s
l
-----
-
G
s
r
Y
K
-
t
l
yRys
f
t
d
-
--
-
n
Q
l
s
s
q
yW
s
-
g
---
i
s
s
-----
d
vsiP
--
-
g
-
sy
v
g
y
g
ys
--
-
sGg
G
tl
--
d
li
a
-
G
v
-
Td
G
v
p
t
Yr
n
v
ld
l
n
GAi
f
a
g
g
pfG
---
giV
d
g
yl
g
-
v
wg
c
l
A
CeK
Consen2
Secondary consensus
yqrn
r
h
d
qt
t
n
a
l
n
stt
p
q
v
a
v
n
n
is
i
nw
s
v
ns
-
l
q
h
i
v
e
f
s
v
sg
i
rs
y
nyt
i
v
lsv
s
v
sa
a
i
at
v
s
a
m
q
kf
s
l
s
n
s
t
f
q
s
w
ft
y
m
gy
k
i
n
r
w
a
s
vdi
e
n
kr
p
d
nw
s
a
g
s
fhr
t
i
n
i
d
n
l
r
n
tl
iq
s
s
i
t
s
i
id
i
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character