fig|1040638.4.peg.5238
Escherichia coli O104:H4 str. LB226692
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
LI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|6666666.5357.peg.236
Escherichia coli TY-2482
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
LI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNK
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|585055.6.peg.558
Escherichia coli 55989
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
LI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SLNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|585055.8.peg.559
Escherichia coli 55989
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
LI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SLNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|550672.3.peg.65
Escherichia coli B088
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
T
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|656408.3.peg.477
Escherichia coli H591
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
T
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|656443.3.peg.792
Escherichia coli TA271
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
T
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|656419.3.peg.763
Escherichia coli M718
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GI
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|550676.3.peg.1341
Escherichia coli B185
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQ
V
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|656414.3.peg.733
Escherichia coli H736
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTVLG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|413997.3.peg.514
Escherichia coli B str. REL606
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
H
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|511693.5.peg.520
Escherichia coli BL21
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
H
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|469008.4.peg.3231
Escherichia coli BL21(DE3)
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
H
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749547.3.peg.1388
Escherichia coli MS 187-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
H
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|637912.3.peg.683
Escherichia coli OP50
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
H
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|585034.4.peg.532
Escherichia coli IAI1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
T
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
T
PGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|585034.5.peg.531
Escherichia coli IAI1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
T
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
T
PGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|562.371.peg.2802
Escherichia coli 1044A
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|562.373.peg.3190
Escherichia coli 1125A
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|562.372.peg.1534
Escherichia coli 1212A
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|562.374.peg.5441
Escherichia coli 536A
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|83334.1.peg.666
Escherichia coli O157:H7
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|444454.5.peg.5078
Escherichia coli O157:H7 str. EC4024
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|444449.5.peg.5414
Escherichia coli O157:H7 str. EC4042
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|444448.5.peg.3289
Escherichia coli O157:H7 str. EC4045
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|444453.5.peg.630
Escherichia coli O157:H7 str. EC4076
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|444452.5.peg.897
Escherichia coli O157:H7 str. EC4113
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|444450.8.peg.749
Escherichia coli O157:H7 str. EC4115
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|444451.5.peg.2250
Escherichia coli O157:H7 str. EC4196
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|444447.5.peg.3463
Escherichia coli O157:H7 str. EC4206
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|478004.5.peg.1457
Escherichia coli O157:H7 str. EC4401
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|478005.5.peg.2487
Escherichia coli O157:H7 str. EC4486
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|478006.5.peg.857
Escherichia coli O157:H7 str. EC4501
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|478007.5.peg.947
Escherichia coli O157:H7 str. EC508
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|478008.5.peg.1689
Escherichia coli O157:H7 str. EC869
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|637388.3.peg.1084
Escherichia coli O157:H7 str. FRIK2000
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|570506.3.peg.3621
Escherichia coli O157:H7 str. FRIK966
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|386585.9.peg.701
Escherichia coli O157:H7 str. Sakai
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|544404.4.peg.614
Escherichia coli O157:H7 str. TW14359
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|502346.5.peg.609
Escherichia coli O157:H7 str. TW14588
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|701177.3.peg.653
Escherichia coli O55:H7 str. CB9615
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
G
V
PAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SS
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|595495.4.peg.4191
Escherichia coli KO11
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
LI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSIGIT
-
S
M
X
X
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|566546.3.peg.4370
Escherichia coli W
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
LI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSIGIT
-
S
M
X
X
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|481805.3.peg.3310
Escherichia coli ATCC 8739
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|481805.6.peg.3298
Escherichia coli ATCC 8739
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|656379.3.peg.1027
Escherichia coli FVEC1302
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|656380.3.peg.785
Escherichia coli FVEC1412
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749549.3.peg.1409
Escherichia coli MS 198-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|585056.7.peg.781
Escherichia coli UMN026
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|155864.1.peg.589
Escherichia coli O157:H7 EDL933
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVXWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|155864.8.peg.603
Escherichia coli O157:H7 EDL933
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTFV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVXWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|358709.5.peg.2019
Escherichia coli 101-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
F
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
H
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|679206.4.peg.14
Escherichia coli MS 119-7
MKIPTTTDIPQRYTWCL
------AGICYSSLAIXPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
T
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|550677.3.peg.978
Escherichia coli B354
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|656444.3.peg.1029
Escherichia coli TA280
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|409438.11.peg.685
Escherichia coli SE11
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
S
CF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRV
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749540.3.peg.4794
Escherichia coli MS 146-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTVLG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|340186.3.peg.810
Escherichia coli E110019
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
S
CF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDE
L
IIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|340186.5.peg.845
Escherichia coli E110019
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
S
CF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDE
L
IIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
ATY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749537.3.peg.4076
Escherichia coli MS 115-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
T
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSKQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|216592.1.peg.293
Escherichia coli 042
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQRLQQAVTVIS
A
V
CTHPGS
fig|216592.3.peg.603
Escherichia coli 042
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQRLQQAVTVIS
A
V
CTHPGS
fig|573235.3.peg.574
Escherichia coli O26:H11 str. 11368
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
S
L
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
G
T
VS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749545.3.peg.3674
Escherichia coli MS 182-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
A
R
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
A
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSIGIT
-
S
M
X
X
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749532.3.peg.2527
Escherichia coli MS 78-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
A
R
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
A
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSIGIT
-
S
M
X
X
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749527.3.peg.4959
Escherichia coli MS 21-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GKXXWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|340184.3.peg.3201
Escherichia coli B7A
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
H
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
W
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
NGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|340184.6.peg.3341
Escherichia coli B7A
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
H
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
W
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
NGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|679204.3.peg.4273
Escherichia coli MS 145-7
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
H
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
W
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
NGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|331112.3.peg.566
Escherichia coli HS
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
S
L
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
G
T
VS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
L
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|331112.6.peg.592
Escherichia coli HS
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
S
L
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
G
T
VS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
L
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749531.3.peg.1514
Escherichia coli MS 69-1
MKIPTTTDIPQRY
S
WCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTH
S
GS
fig|585396.4.peg.585
Escherichia coli O111:H- str. 11128
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
S
L
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
G
T
VS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|679207.4.peg.4494
Escherichia coli MS 107-1
MKIPTTTDIPQRY
S
WCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
A
R
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
T
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDE
L
IIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|679205.4.peg.243
Escherichia coli MS 124-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
A
R
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
F
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
A
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
F
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSIGIT
-
S
M
X
X
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749533.3.peg.1261
Escherichia coli MS 84-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
A
R
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
F
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
A
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
F
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSIGIT
-
S
M
X
X
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|585057.4.peg.528
Escherichia coli IAI39
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPW
S
LR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|585057.6.peg.527
Escherichia coli IAI39
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPW
S
LR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|656437.3.peg.630
Escherichia coli TA143
MKIPTTTDIPQRYTWCL
------AGICYSSLAISPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTVNTGDKSG-GLM
PCF
NQAL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTH
S
GS
fig|344610.3.peg.1538
Escherichia coli 53638
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
S
L
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
I
S
QI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
G
T
VS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGN
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|344610.7.peg.1141
Escherichia coli 53638
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
S
L
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
I
S
QI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
G
T
VS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGN
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
R
K
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|562.375.peg.3948
Escherichia coli EC4100B
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
A
R
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
IR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADD-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRY
L
T
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYEYDY
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
I
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVD
L
X
X
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
A
T
L
GV
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MPYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSIGIT
-
S
M
X
X
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|595496.3.peg.454
Escherichia coli BW2952
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|536056.3.peg.3256
Escherichia coli DH1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|83333.1.peg.529
Escherichia coli K12
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749538.3.peg.2524
Escherichia coli MS 116-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749544.3.peg.277
Escherichia coli MS 175-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|749548.3.peg.1617
Escherichia coli MS 196-1
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|316407.3.peg.515
Escherichia coli W3110
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|316385.5.peg.487
Escherichia coli str. K-12 substr. DH10B
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|316385.7.peg.494
Escherichia coli str. K-12 substr. DH10B
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|511145.12.peg.553
Escherichia coli str. K-12 substr. MG1655
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|511145.6.peg.546
Escherichia coli str. K-12 substr. MG1655
MKIPTTTDIPQRYTWCL
------AGICYSSLAILPSF-----LSYAESY
FNP
A
FL
LENGTSV
---
A
DL
S
RF
ERG
N
HQP
-
AG
V
Y
R
V
D
LW
R
N
DEFIGS
Q
-
DI
V
F
-----
ESTTENTGDKSG-GLM
PCF
NQVL
--
L
ERI
GL
NSSAFPEL
--------
AQQQ
N
NK
CI
-
NLLKA
V
PDATINF
D
FAAMR
L
N
I
T
IPQI
A
L
LSSAHG
Y
IP
P
EE
WD
E
GIPAL
LL
NYN
F
TG
----NRGN
G
-
---------------------
ND
S
YFFSE
L
-S
G
I
NIGPWRLR
NNGSW
N
YFRG
NG
------------
Y
--
HSEQ
W
NNIG
T
WVQ
RAI
IP
LKSEL
V
MGD
GN
T
G
S
---
DIFDG
VGF
RG
VR
L
Y
S
S
D
N
M
Y
P
D
SQ
Q
GFAP
T
V
R
GIA
R
TAA
Q
L
TI
R
QNG
FI
IYQS
Y
V
S
P
GAF
E
I
T
DL
HPT
S
SN
GDL
D
VTIDE
R
DG
NQ
Q
N
Y
TI
P
Y
S
T
VP
I
L
Q
R
E
G
RFKFD
L
TA
G
DF
R
SG
N
SQQ
--
SS
P
FFF
Q
GTALG
G
LPQEF
T
A
YGG
T
Q
-
LSAN
Y
T
A
F
L
L
GLG
R
NL
GNW
GAVS
L
DVTHA
RSQ
L
ADA-S
----
RHE
G
D
S
I
R
F
LYAK
SMNTFG
T
NFQ
L
M
GYRYST
QG
F
YTLD
D
V
A
Y
-----
---
------
RRMEGYE--Y
DY
DG
EHRDEPIIVNYH
NL
RFSR
K
DRLQL
N
V
SQ
S
L
---
N-DF
GSLY
I
S
G
T
HQK
YW
NTSDSDTWYQV
GY
T
SSWVGI
S
---
Y
SLS
F
S
WNESVGI
---------
PDN
E
RIVG
LN
V
SVP
FN
VLTKRRYT
R
ENALDR---AY
----
A
S
FNA
N
R
N
S
N
GQNSWLA
G
VG
GT
LL
E
G
H
-
NL
SY
H
V
-
--------
---
S
QGDTSNN
G
Y
TG
-
SATAN
W
Q
---
AAY
G
T
L
GG
GYNY
-DR
D
QHDVNWQLS
GG
--
V
VG
H
EN
GITLS
-
Q
P
L
---
G
D
T
NV
LI
K
APGA
G
G
VR
IE
-
N
QTGIL
TD
WR
G
YA
V
MLYA
T
V
Y
RY
NR
IA
LD
T
N
TMG
-
N
S
I
D
VEK
N
ISSV
VPTQGALV
R
A
N
F
D
T
RI
G
VRA
L
IT
V
TQ-G
GK
PV
PFG
S
---
L
V
R
E
NSTGIT
-
S
M
V
G
D
D
G
Q
VYL
S
G
APLS
G
E
L
L
V
Q
WG
DGANSR
C
IAH
Y
V
L
PKQSLQQAVTVIS
A
V
CTHPGS
fig|679207.4.peg.3440
Escherichia coli MS 107-1 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSAELY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVSGM
--------
NLLA
D
DA
C
V
-
PLTSI
I
HDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RS
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|656380.3.peg.59
Escherichia coli FVEC1412 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSADLY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVAGM
--------
NLLA
D
DA
C
V
-
PLTTM
V
QDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|656419.3.peg.49
Escherichia coli M718 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSADLY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVAGM
--------
NLLA
D
DA
C
V
-
PLTTM
V
QDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|749540.3.peg.3242
Escherichia coli MS 146-1 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSADLY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVAGM
--------
NLLA
D
DA
C
V
-
PLTTM
V
QDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|749549.3.peg.189
Escherichia coli MS 198-1 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSADLY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVAGM
--------
NLLA
D
DA
C
V
-
PLTTM
V
QDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|749545.3.peg.4255
Escherichia coli MS 182-1 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSAELY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVSGM
--------
NLLA
D
DA
C
V
-
PLTSM
I
HDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
E
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
IAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|749548.3.peg.575
Escherichia coli MS 196-1 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSADLY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVAGM
--------
NLLA
D
DA
C
V
-
PLTTM
V
QDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
V
PGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|83333.1.peg.4227
Escherichia coli K12 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSADLY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVAGM
--------
NLLA
D
DA
C
V
-
PLTTM
V
QDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
T
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|316407.3.peg.4145
Escherichia coli W3110 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGFFVRLVVACAFAAQAPLSSADLY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVAGM
--------
NLLA
D
DA
C
V
-
PLTTM
V
QDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
G
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DR
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
T
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|749531.3.peg.2134
Escherichia coli MS 69-1 (10-877/878)
QR
N
T
Q
CL
HIRKHRLAGLFVRLFVACAFAAQAPLSSAELY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NAYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRVQ
--
L
ASM
GL
NTASVSGM
--------
NLLA
D
DA
C
V
-
PLTSM
I
HDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
S
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DS
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
T
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|216593.1.peg.3674
Escherichia coli E2348/69 (8-877/878)
I
Y
QR
N
T
Q
CL
HIRKHRLAVFFVRLFVACAFAAQAPLSSAELY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVSGM
--------
NLLA
D
DA
C
V
-
PLTSM
I
HDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
S
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DS
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
A
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|574521.7.peg.4728
Escherichia coli O127:H6 str. E2348/69 (8-877/878)
I
Y
QR
N
T
Q
CL
HIRKHRLAVFFVRLFVACAFAAQAPLSSAELY
FNP
R
FL
ADDPQAV
---
A
DL
S
RF
ENG
Q
ELP
-
P
G
T
Y
R
V
D
IY
L
N
NGYMAT
R
-
D
V
T
F
-----
-----NTGDSEQ-GIV
PC
L
TRAQ
--
L
ASM
GL
NTASVSGM
--------
NLLA
D
DA
C
V
-
PLTSM
I
HDATAHL
D
VGQQR
L
N
L
T
IPQ
A
F
M
SNRARG
Y
IP
P
EL
WD
P
GI
N
A
G
LL
NYN
F
S
G
NSVQNRIG
S
-
---------------------
NS
H
YAYLN
L
QS
G
L
NIG
A
WRLR
DNTTW
S
YNSS
DS
------------
S
S
G
SKNK
W
QHIN
T
WLE
R
D
I
IP
L
R
S
R
L
T
L
GD
GY
T
Q
G
---
DIFDG
INF
RG
AQ
L
A
SDD
N
MLP
D
SQ
R
GFAP
V
I
H
GIA
R
GT
A
Q
VTI
K
QNG
YD
IY
N
S
T
VP
P
G
P
F
T
I
N
D
I
YAA
G
NS
GDL
Q
VTI
K
E
A
DG
ST
Q
I
F
TV
P
Y
SSVP
L
L
Q
R
E
G
HTRYS
I
TA
G
EY
R
SG
N
AQQ
--
EK
P
RFF
Q
STLLH
G
LPAGW
T
I
YGG
A
Q
-
LADR
Y
R
A
F
N
F
G
I
G
K
N
M
GAL
GA
L
S
V
D
M
T
Q
A
NST
L
PDD-S
----
QHD
G
Q
S
V
R
F
LY
N
K
SLNESG
T
NIQ
L
V
GYRYST
SG
Y
FNFA
D
T
T
Y
-----
---
------
SRMNGYNIET
Q
-
DG
VIQVK
P
KFTD
Y
Y
NL
AYNK
R
GKLQL
T
V
T
Q
Q
L
---
G-RT
S
T
LYLS
G
S
HQT
YW
GTSNVDEQFQA
G
L
N
TAFEDI
N
---
W
T
LS
Y
S
LTKNAWQ
---------
KGR
D
QMLA
LN
V
N
I
P
FS
-----
HWL
R
SDSKSQWRHAS
----
A
S
YSM
S
H
D
L
N
GRMTNLA
G
VY
GT
LL
E
D
N
-
NL
SY
S
V
Q
--------
TGY
A
GGGDGNS
G
S
TG
-
YATLN
Y
R
---
GGY
G
N
A
NI
GY
SH
-SD
D
IKQLYYGVS
GG
--
V
LA
H
AN
G
V
TL
G
-
Q
P
L
---
N
D
T
VV
L
V
K
APGA
K
D
AK
V
E
-
N
QTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
NR
VA
LD
T
N
TLA
-
D
N
V
D
LDN
A
VANV
VPT
R
GA
I
V
R
A
E
F
K
A
RV
G
IKL
L
MT
L
TH-N
N
K
PL
PFGA
---
M
V
T
S
ESSQSS
-
G
I
V
A
D
N
G
Q
VYL
S
G
MPLA
G
K
V
Q
V
K
WG
EEENAH
C
VAN
Y
Q
L
PPESQQQLLTQLS
A
E
C
fig|656419.3.peg.4577
Escherichia coli M718 (15-856/857)
ISVVAVAVASTF------SAHAGK
FNP
K
FL
-EDVQGV
GQH
V
DL
T
M
F
EKG
Q
EQQ
L
P
G
I
Y
R
V
S
VY
V
N
EQRMET
R
-
T
L
E
F
-----
KEATEAQRKAMGESLV
PC
L
SRTQ
--
L
AEM
G
V
RVESFPAL
--------
NLVP
A
EA
C
V
-
PFDEI
I
PQASSHF
D
FSEQK
L
V
L
S
F
PQ
A
A
M
HQVARG
T
VP
E
SL
WD
E
GIPAL
LL
D
Y
S
F
S
G
SNSEYDST
G
S
SSSYVDDNGTVHHDDGKDTLK
SD
S
Y-YLN
L
RS
G
L
N
L
G
A
WRLR
NYSTW
S
HSGG
--------------
-
--
-KAQ
W
DNIG
T
SLS
RAI
IP
F
K
A
Q
L
T
MGD
TA
T
A
G
---
DIFD
S
VQM
RG
AM
L
A
SD
E
E
MLP
D
SQ
R
GFAP
I
V
R
GIA
K
S
N
A
E
V
S
I
E
QNG
YV
IY
R
T
Y
V
Q
P
GAF
E
I
N
DL
YPT
A
NS
GDL
T
V
I
I
K
E
A
DG
SE
Q
R
F
IQ
P
F
SSVP
I
F
Q
R
E
G
HLKYS
F
AA
G
EY
Q
AG
N
YDS
--
AS
P
RFG
Q
LDLIY
G
LPWGM
T
A
YGG
V
L
-
ISNN
Y
N
A
F
A
L
G
I
G
K
N
F
GYI
GA
I
S
I
DVT
Q
A
KSE
L
NND-R
----
DSQ
G
Q
S
Y
R
F
LY
S
K
SF-ESG
T
DFR
L
A
GYRYST
SG
F
YTFQ
E
A
T
-
-----
---
------
------DVRS
D
A
D
SDYN
---------
--
RYHK
R
SEIQG
N
L
T
Q
Q
L
---
G-AY
GS
V
YL
N
L
T
QQD
YW
NDAGKQNTVSA
GY
N
GRIGKV
S
---
Y
S
IA
Y
S
WNKSPEW
---------
DES
D
RLWS
F
N
I
SVP
LG
--------
-
--------RAW
----
S
N
YRV
T
T
D
Q
D
GRTNQQV
G
VS
GT
LL
E
D
R
-
NL
SY
S
V
Q
--------
EGY
A
SNGVGNS
-
-
-
G
-
NANVG
Y
Q
---
GGS
G
N
V
NV
GY
S
Y
-GK
D
YRQLNYSVR
GG
--
V
IV
H
SE
G
V
TLS
-
Q
P
L
---
G
E
T
MT
LI
S
V
PGA
R
N
AR
V
V
-
N
NGGVQ
V
D
WM
G
NA
I
VPYA
M
P
Y
RE
N
E
IS
L
R
S
D
SLG
-
D
D
V
D
VEN
A
FQKV
VPT
R
GA
I
V
R
A
R
F
D
T
RV
G
YRV
L
MT
L
LRSA
G
S
PV
PFGA
T
A
T
L
I
T
D
KQNEVS
-
S
I
V
G
E
E
G
Q
L
Y
I
S
G
MPEE
G
R
V
L
I
K
WG
NDASQQ
C
VAP
Y
K
L
SLELKQGGIVPVS
A
N
C
fig|155864.1.peg.4437
Escherichia coli O157:H7 EDL933 (15-856/857)
ISVVAVAVASTF------SAHAGK
FNP
K
FL
-EDVQGV
GQH
V
DL
T
M
F
EKG
Q
EQQ
L
P
G
I
Y
R
V
S
VY
V
N
EQRMET
R
-
T
L
E
F
-----
KEATEAQRKAMGESLV
PC
L
SRTQ
--
L
AEM
G
V
RVESFPAL
--------
NLVS
A
EA
C
V
-
PFDEI
I
PLASSHF
D
FSEQK
L
V
L
S
F
PQ
A
A
M
HQVARG
T
VP
E
SL
WD
E
GIPAL
LL
D
Y
S
F
S
G
SNSEYDST
G
S
SSSYVDDNGTVHHDDGKDTLK
SD
S
Y-YLN
L
RS
G
L
N
L
G
A
WRLR
NYSTW
S
HSGG
--------------
-
--
-KAQ
W
DNIG
T
SLS
RAI
IP
F
K
A
Q
L
T
MGD
TA
T
A
G
---
DIFD
S
VQM
RG
AM
L
A
SD
E
E
MLP
D
SQ
R
GFAP
I
V
R
GIA
K
S
N
A
E
V
S
I
E
QNG
YV
IY
R
T
Y
V
Q
P
GAF
E
I
N
DL
YPT
A
NS
GDL
T
V
I
I
K
E
A
DG
SE
Q
R
F
IX
P
F
SSVP
I
F
Q
R
E
G
HLKYS
F
AA
G
EY
Q
AG
N
YDS
--
AS
P
RFG
Q
LDLIY
G
LPWGM
T
A
YGG
V
L
-
ISNN
Y
N
A
F
T
L
G
I
G
K
N
F
GYI
GA
I
S
I
DVT
Q
A
KSE
L
NND-R
----
DSQ
G
Q
S
Y
R
F
LY
S
K
SF-ESG
T
DFR
L
A
GYRYST
SG
F
YTFQ
E
A
T
-
-----
---
------
------DVRS
D
A
D
SDYN
---------
--
RYHK
R
SEIQG
N
L
T
Q
Q
L
---
G-AY
GS
V
YL
N
L
T
QQD
YW
NDAGKQNTVSA
GY
N
GRIGKV
S
---
Y
S
IA
Y
S
WNKSPEW
---------
DES
D
RLWS
F
N
I
SVP
LG
--------
-
--------RAW
----
S
N
YRV
T
T
D
Q
D
GRTNQQV
G
VS
GT
LL
E
D
R
-
NL
SY
S
V
Q
--------
EGY
A
SNGVGNS
-
-
-
G
-
NANVG
Y
Q
---
GGS
G
N
V
NV
GY
S
Y
-GK
D
YRQLNYSVR
GG
--
V
IV
H
SE
G
V
TLS
-
Q
P
L
---
G
E
T
MT
LI
S
V
PGA
R
N
AR
V
V
-
N
NGGVQ
V
D
WM
G
NA
I
VPYA
M
P
Y
RE
N
E
IS
L
R
S
D
SLG
-
D
D
V
D
VEN
A
FQKV
VPT
R
GA
I
V
R
A
R
F
D
T
RV
G
YRV
L
MT
L
LRSA
G
S
PV
PFGA
T
A
T
L
I
T
D
KQNEVS
-
S
I
V
G
E
E
G
Q
L
Y
I
S
G
MPEE
G
R
V
L
I
K
WG
NDASQQ
C
VAP
Y
K
L
SLELKQGGIIPVS
A
N
C
fig|701177.3.peg.4309
Escherichia coli O55:H7 str. CB9615 (15-856/857)
ISVVAVAVASTF------SAHAGK
FNP
K
FL
-EDVQGV
GQH
V
DL
T
M
F
EKG
Q
EQQ
L
P
G
I
Y
R
V
S
VY
V
N
EQRMET
R
-
T
L
E
F
-----
KEATEAQRKAMGESLV
PC
L
SRTQ
--
L
AEM
G
V
RVESFPAL
--------
NLVS
A
EA
C
V
-
PFDEI
I
PLASSHF
D
FSEQK
L
V
L
S
F
PQ
A
A
M
HQVARG
T
VP
E
SL
WD
E
GIPAL
LL
D
Y
S
F
S
G
SNSEYDST
G
S
SSSYVDDNGTVHHDDGKDTLK
SD
S
Y-YLN
L
RS
G
L
N
L
G
A
WRLR
NYSTW
S
HSGG
--------------
-
--
-KAQ
W
DNIG
T
SLS
RAI
IP
F
K
A
Q
L
T
MGD
TA
T
A
G
---
DIFD
S
VQM
RG
AM
L
A
SD
E
E
MLP
D
SQ
R
GFAP
I
V
R
GIA
K
S
N
A
E
V
S
I
E
QNG
YV
IY
R
T
Y
V
Q
P
GAF
E
I
N
DL
YPT
A
NS
GDL
T
V
I
I
K
E
A
DG
SE
Q
R
F
IQ
P
F
SSVP
I
F
Q
R
E
G
HLKYS
F
AA
G
EY
Q
AG
N
YDS
--
AS
P
RFG
Q
LDLIY
G
LPWGM
T
A
YGG
V
L
-
ISNN
Y
N
A
F
A
L
G
I
G
K
N
F
GYI
GA
I
S
I
DVT
Q
A
KSE
L
NND-R
----
DSQ
G
Q
S
Y
R
F
LY
S
K
SF-ESG
T
DFR
L
A
GYRYST
SG
F
YTFQ
E
A
T
-
-----
---
------
------DVRS
D
A
D
SDYN
---------
--
RYHK
R
SEIQG
N
L
T
Q
Q
L
---
G-AY
GS
V
YL
N
L
T
QQD
YW
NDAGKQNTVSA
GY
N
GRIGKV
S
---
Y
S
IA
Y
S
WNKSPEW
---------
DES
D
RLWS
F
N
I
SVP
LG
--------
-
--------RAW
----
S
N
YRV
T
T
D
Q
D
GRTNQQV
G
VS
GT
LL
E
D
R
-
NL
SY
S
V
Q
--------
EGY
A
SNGVGNS
-
-
-
G
-
NANVG
Y
Q
---
GGS
G
N
V
NV
GY
S
Y
-GK
D
YRQLNYSVR
GG
--
V
IV
H
SE
G
V
TLS
-
Q
P
L
---
G
E
T
MT
LI
S
V
PGA
R
N
AR
V
V
-
N
NGGVQ
V
D
WM
G
NA
I
VPYA
M
P
Y
RE
N
E
IS
L
R
S
D
SLG
-
D
D
V
D
VEN
A
FQKV
VPT
R
GA
I
V
R
A
R
F
D
T
RV
G
YRV
L
MT
L
LRSA
G
S
PV
PFGA
T
A
T
L
I
T
D
KQNEVS
-
S
I
V
G
E
E
G
Q
L
Y
I
S
G
MPEE
G
R
V
L
I
K
WG
NDASQQ
C
VAP
Y
K
L
SLELKQGGIIPVS
A
N
C
fig|685038.3.peg.3604
Escherichia coli O83:H1 str. NRG 857C (13-841/843)
AHFSFSLLALTIASAL------PAYGGK
FNP
K
FL
-ENVQGI
DQH
V
DL
S
V
Y
DSP
V
GQQ
I
P
G
K
Y
R
V
F
VF
V
N
EEKMAS
R
-
T
L
D
F
-----
STASEAQRKASGESLM
PC
L
SRVQ
--
L
EEM
G
V
RIDSFPAL
--------
KILP
P
EA
C
V
-
AFDEI
I
PQATSRF
D
FNTQT
L
H
L
T
F
PQ
A
A
M
MMTARG
T
VD
P
SR
WD
E
GIPAL
LL
D
Y
S
F
S
G
SNGRNEGS
G
S
SPDST
----------------
SD
S
Y-YLN
L
RS
G
L
N
V
GPWRLR
NNSIW
N
RTDG
--------------
-
--
-KNQ
W
DNVG
T
SLN
RAI
IP
LKS
Q
I
T
L
GD
TA
T
P
G
---
E
IFD
S
VQM
RG
TL
L
A
SDD
E
MLP
D
SQ
R
GFAP
V
V
R
GIA
K
S
N
A
E
V
S
I
E
QNG
YV
IY
R
T
F
V
Q
P
GAF
E
I
N
DL
YAT
S
GS
GDL
T
V
I
I
K
E
S
DG
SE
Q
R
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YLKYS
L
AA
G
EY
R
AG
N
YDS
--
GK
P
RFG
Q
FTAMY
G
LPWGM
T
A
YGG
A
L
-
LSAD
Y
N
A
L
A
L
GLG
K
N
F
GTI
GAVS
V
DVT
Q
A
KSQ
L
RNN-E
----
KDE
G
Q
S
Y
R
F
LY
S
K
SF-EGG
T
DLR
L
L
GY
K
YST
SG
Y
YTFQ
E
A
T
-
-----
---
------
------DVRS
D
A
D
SDY
R
---------
--
RYHK
R
SQIQG
N
I
T
Q
Q
L
---
G-DY
GS
V
Y
F
N
M
T
QQD
YW
NVDGKENSLSA
GY
H
GHIGRV
N
---
Y
S
IA
Y
T
WTRSPEW
---------
DED
D
RLWS
F
S
L
S
I
P
LG
--------
-
--------GAW
----
G
S
YRM
T
T
D
Q
N
GKTSQQA
S
VS
GT
LL
E
D
R
-
NL
N
Y
N
V
Q
--------
QGY
T
SNGVGNS
-
-
-
G
-
SVNMG
Y
M
---
GGS
G
N
I
DV
GYNY
-SK
D
NQQVNYGVR
GG
--
V
IV
H
SE
GITLS
-
Q
P
L
---
G
E
S
LA
IV
S
APGA
R
G
GH
V
V
-
N
SSGVE
V
D
WM
G
NA
V
VPYL
T
P
Y
RE
TI
VE
L
R
S
D
TLG
-
Q
N
V
E
LQE
A
FQKV
VPT
R
GA
I
V
R
S
R
F
D
T
RV
G
YRV
L
MS
L
KRAN
G
N
AV
PFGA
T
A
A
L
-
S
D
ESKPAS
-
S
I
V
G
E
E
G
Q
L
Y
I
S
G
MPEE
G
E
L
Q
V
S
WG
HEQAQR
C
RVP
F
R
L
PEKKDNSGIVMVN
A
V
C
fig|585397.7.peg.4224
Escherichia coli ED1a (13-841/843)
AHFSFSLLALTIASAL------PAYGGK
FNP
K
FL
-ENVQGI
DQH
V
DL
S
V
Y
DFP
V
GQQ
I
P
G
K
Y
R
V
F
VF
V
N
EEKMAS
R
-
T
L
D
F
-----
STASEAQRKASGESLM
PC
L
SRVQ
--
L
EEM
G
V
RVDSFPAL
--------
KILP
P
EA
C
V
-
AFDEI
I
PQATSRF
D
FNTQT
L
H
L
T
F
PQ
A
A
M
MMTARG
T
VD
P
SR
WD
E
GIPAL
LL
D
Y
S
F
S
G
SNGRNEGS
G
S
SPDST
----------------
SD
S
Y-YLN
L
RS
G
L
N
V
GPWRLR
NNSIW
N
RTDG
--------------
-
--
-KNQ
W
DNVG
T
SLN
RAI
IP
LKS
Q
I
T
L
GD
TA
T
P
G
---
E
IFD
S
VQM
RG
AL
L
A
SDD
E
MLP
D
SQ
R
GFAP
V
V
R
GIA
K
S
N
A
E
V
S
I
E
QNG
YV
IY
R
T
F
V
Q
P
GAF
E
I
N
DL
YAT
S
GS
GDL
T
V
I
I
K
E
S
DG
SE
Q
R
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YLKYS
L
AA
G
EY
R
AG
N
YDS
--
GK
P
RFG
Q
FTAMY
G
LPWGM
T
A
YGG
A
L
-
LSAD
Y
N
A
L
A
L
GLG
K
N
F
GTI
GAVS
V
DVT
Q
A
KSQ
L
RNN-E
----
KDE
G
Q
S
Y
R
F
LY
S
K
SF-EGG
T
DLR
L
L
GY
K
YST
SG
Y
YTFQ
E
A
T
-
-----
---
------
------DVRS
D
A
D
SDY
R
---------
--
RYHK
R
SQIQG
N
I
T
Q
Q
L
---
G-DY
GS
V
Y
F
N
M
T
QQD
YW
NVDGKENSLSA
GY
H
GHIGRV
N
---
Y
S
IA
Y
T
WTRSPEW
---------
DED
D
RLWS
F
S
L
S
I
P
LG
--------
-
--------GAW
----
G
S
YRM
T
T
D
Q
N
GKTSQQA
S
VS
GT
LL
E
D
R
-
NL
N
Y
N
V
Q
--------
QGY
T
SNGVGNS
-
-
-
G
-
SVNMG
Y
M
---
GGS
G
N
I
DV
GYNY
-SK
D
NQQVNYGVR
GG
--
V
IV
H
SE
GITLS
-
Q
P
L
---
G
E
S
LA
IV
S
APGA
R
G
GH
V
V
-
N
SSGVE
V
D
WM
G
NA
V
VPYL
T
P
Y
RE
TI
VE
L
R
S
D
TLG
-
Q
N
V
E
LQE
A
FQKV
VPT
R
GA
I
V
R
S
R
F
D
T
RV
G
YRV
L
MS
L
KRAN
G
N
AV
PFGA
T
A
A
L
-
S
D
ESKPAS
-
S
I
V
G
E
E
G
Q
L
Y
I
S
G
MPEE
G
E
L
Q
V
S
WG
HEQAQR
C
RVP
F
R
L
PEKKDNSGIVMVN
A
V
C
fig|585397.9.peg.4221
Escherichia coli ED1a (13-841/843)
AHFSFSLLALTIASAL------PAYGGK
FNP
K
FL
-ENVQGI
DQH
V
DL
S
V
Y
DFP
V
GQQ
I
P
G
K
Y
R
V
F
VF
V
N
EEKMAS
R
-
T
L
D
F
-----
STASEAQRKASGESLM
PC
L
SRVQ
--
L
EEM
G
V
RVDSFPAL
--------
KILP
P
EA
C
V
-
AFDEI
I
PQATSRF
D
FNTQT
L
H
L
T
F
PQ
A
A
M
MMTARG
T
VD
P
SR
WD
E
GIPAL
LL
D
Y
S
F
S
G
SNGRNEGS
G
S
SPDST
----------------
SD
S
Y-YLN
L
RS
G
L
N
V
GPWRLR
NNSIW
N
RTDG
--------------
-
--
-KNQ
W
DNVG
T
SLN
RAI
IP
LKS
Q
I
T
L
GD
TA
T
P
G
---
E
IFD
S
VQM
RG
AL
L
A
SDD
E
MLP
D
SQ
R
GFAP
V
V
R
GIA
K
S
N
A
E
V
S
I
E
QNG
YV
IY
R
T
F
V
Q
P
GAF
E
I
N
DL
YAT
S
GS
GDL
T
V
I
I
K
E
S
DG
SE
Q
R
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YLKYS
L
AA
G
EY
R
AG
N
YDS
--
GK
P
RFG
Q
FTAMY
G
LPWGM
T
A
YGG
A
L
-
LSAD
Y
N
A
L
A
L
GLG
K
N
F
GTI
GAVS
V
DVT
Q
A
KSQ
L
RNN-E
----
KDE
G
Q
S
Y
R
F
LY
S
K
SF-EGG
T
DLR
L
L
GY
K
YST
SG
Y
YTFQ
E
A
T
-
-----
---
------
------DVRS
D
A
D
SDY
R
---------
--
RYHK
R
SQIQG
N
I
T
Q
Q
L
---
G-DY
GS
V
Y
F
N
M
T
QQD
YW
NVDGKENSLSA
GY
H
GHIGRV
N
---
Y
S
IA
Y
T
WTRSPEW
---------
DED
D
RLWS
F
S
L
S
I
P
LG
--------
-
--------GAW
----
G
S
YRM
T
T
D
Q
N
GKTSQQA
S
VS
GT
LL
E
D
R
-
NL
N
Y
N
V
Q
--------
QGY
T
SNGVGNS
-
-
-
G
-
SVNMG
Y
M
---
GGS
G
N
I
DV
GYNY
-SK
D
NQQVNYGVR
GG
--
V
IV
H
SE
GITLS
-
Q
P
L
---
G
E
S
LA
IV
S
APGA
R
G
GH
V
V
-
N
SSGVE
V
D
WM
G
NA
V
VPYL
T
P
Y
RE
TI
VE
L
R
S
D
TLG
-
Q
N
V
E
LQE
A
FQKV
VPT
R
GA
I
V
R
S
R
F
D
T
RV
G
YRV
L
MS
L
KRAN
G
N
AV
PFGA
T
A
A
L
-
S
D
ESKPAS
-
S
I
V
G
E
E
G
Q
L
Y
I
S
G
MPEE
G
E
L
Q
V
S
WG
HEQAQR
C
RVP
F
R
L
PEKKDNSGIVMVN
A
V
C
fig|656440.3.peg.3823
Escherichia coli TA206 (13-841/843)
AHFSFSLLALTIASAL------PAYGGK
FNP
K
FL
-ENVQGI
DQH
V
DL
S
V
Y
DSP
V
GQQ
I
P
G
K
Y
R
V
F
VF
V
N
EEKMAS
R
-
T
L
D
F
-----
STASEAQRKASGESLM
PC
L
SRVQ
--
L
EEM
G
V
RIDSFPAL
--------
KILP
P
EA
C
V
-
AFDEI
I
PQATSRF
D
FNTQT
L
H
L
T
F
PQ
A
A
M
MMTARG
T
VD
P
SR
WD
E
GIPAL
LL
D
Y
S
F
S
G
SNGRNEGS
G
S
SPDST
----------------
SN
S
Y-YLN
L
RS
G
L
N
V
GPWRLR
NNSIW
N
RTDG
--------------
-
--
-KNQ
W
DNVG
T
SLN
RAI
IP
LKS
Q
I
T
L
GD
TA
T
P
G
---
E
IFD
S
VQM
RG
AL
L
A
SDD
E
MLP
D
SQ
R
GFAP
V
V
R
GIA
K
S
N
A
E
V
S
I
E
QNG
YV
IY
R
T
F
V
Q
P
GAF
E
I
N
DL
YAT
S
GS
GDL
T
V
I
I
K
E
S
DG
SE
Q
R
F
IQ
P
F
S
A
V
A
I
F
Q
R
E
G
YLKYS
L
AA
G
EY
R
AG
N
YDS
--
GK
P
RFG
Q
FTAMY
G
LPWGM
T
A
YGG
A
L
-
LSAD
Y
N
A
L
A
L
GLG
K
N
F
GTI
GAVS
V
DVT
Q
A
KSQ
L
RNN-E
----
KDE
G
Q
S
Y
R
F
LY
S
K
SF-EGG
T
DLR
L
L
GY
K
YST
SG
Y
YTFQ
E
A
T
-
-----
---
------
------DVRS
D
A
D
SDY
R
---------
--
RYHK
R
SQIQG
N
I
T
Q
Q
L
---
G-DY
GS
V
Y
F
N
M
T
QQD
YW
NVDGKENSLSA
GY
H
GHIGRV
N
---
Y
S
IA
Y
T
WTRSPEW
---------
DED
D
RLWS
F
S
L
S
I
P
LG
--------
-
--------GAW
----
G
S
YRM
T
T
D
Q
N
GKTSQQA
S
VS
GT
LL
E
D
R
-
NL
N
Y
N
V
Q
--------
QGY
T
SNGVGNS
-
-
-
G
-
SVNMG
Y
M
---
GGS
G
N
I
DV
GYNY
-SK
D
NQQVNYGVR
GG
--
V
IV
H
SE
GITLS
-
Q
P
L
---
G
E
S
LA
IV
S
APGA
R
G
GH
V
V
-
N
SSGVE
V
D
WM
G
NA
V
VPYL
T
P
Y
RE
TI
VE
L
R
S
D
TLG
-
Q
N
V
E
LQE
A
FQKV
VPT
R
GA
I
V
R
S
R
F
D
T
RV
G
YRV
L
MS
L
KRAN
G
N
AV
PFGA
T
A
A
L
-
S
D
ESKPAS
-
S
I
V
G
E
E
G
Q
L
Y
I
S
G
MPEE
G
E
L
Q
V
S
WG
HEQAQR
C
RVP
F
R
L
PEKKDNSGIVMVN
A
V
C
fig|585057.4.peg.4465
Escherichia coli IAI39 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
RV
G
YRV
L
FR
V
AGTQ
GK
PA
PFGA
IA
-
T
V
Q
N
TSSADS
-
G
I
V
G
D
Q
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENVATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|585057.6.peg.4475
Escherichia coli IAI39 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
RV
G
YRV
L
FR
V
AGTQ
GK
PA
PFGA
IA
-
T
V
Q
N
TSSADS
-
G
I
V
G
D
Q
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENVATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|439855.10.peg.4236
Escherichia coli SMS-3-5 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
RV
G
YRV
L
FR
V
AGTK
GK
PA
PFGA
IA
-
T
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
I
SIPESESGLIEQG
V
T
fig|749527.3.peg.5299
Escherichia coli MS 21-1 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
TQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
RV
G
YRV
L
FR
V
AGTQ
GK
PA
PFGA
IA
-
T
V
Q
N
TSSADS
-
G
I
V
G
D
Q
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENVATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|656379.3.peg.4582
Escherichia coli FVEC1302 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNN
N
ASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FK
V
VNAK
GK
PA
PFGA
IA
-
A
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|656380.3.peg.4867
Escherichia coli FVEC1412 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNN
N
ASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FK
V
VNAK
GK
PA
PFGA
IA
-
A
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|749549.3.peg.383
Escherichia coli MS 198-1 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNN
N
ASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FK
V
VNAK
GK
PA
PFGA
IA
-
A
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|656437.3.peg.4212
Escherichia coli TA143 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNN
N
ASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FK
V
VNAK
GK
PA
PFGA
IA
-
A
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|585056.7.peg.4427
Escherichia coli UMN026 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNN
N
ASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FK
V
VNAK
GK
PA
PFGA
IA
-
A
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|749545.3.peg.3466
Escherichia coli MS 182-1 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
V
S
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|749532.3.peg.99
Escherichia coli MS 78-1 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
V
S
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|656393.3.peg.4901
Escherichia coli H299 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
SKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
VQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
E
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHA-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
RV
G
YRV
L
FR
V
AGTK
GK
PA
PFGA
IA
-
T
V
Q
N
TSSADS
-
G
I
V
G
D
Q
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|216592.1.peg.1077
Escherichia coli 042 (9-841/843)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNN
N
ASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
GTY
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FK
V
VNAK
GK
PA
PFGA
IA
-
A
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|216592.3.peg.4233
Escherichia coli 042 (9-841/843)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNN
N
ASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
GTY
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FK
V
VNAK
GK
PA
PFGA
IA
-
A
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|340186.3.peg.2411
Escherichia coli E110019 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
MF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|340186.5.peg.2489
Escherichia coli E110019 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
MF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|749531.3.peg.1891
Escherichia coli MS 69-1 (9-838/840)
GLTAGTCLIFSQSLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
F
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
VAIK
KT
L
TTF
G
V
KVDALKSL
--------
NDVD
E
TV
CI
-
DPGPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
SR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TTDG
--------------
-
--
-SAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
K
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVI
Q
AQILH
G
FPYGF
T
L
YGG
M
Q
-
AAEK
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAK
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNNPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNN
N
ASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GTY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FK
V
VNAK
GK
PA
PFGA
IA
-
A
V
Q
N
TSSADS
-
G
I
V
G
D
L
G
E
L
YL
S
G
LPEK
G
Q
V
M
L
S
WG
ENAATT
C
TFD
Y
S
L
SIPESESGLIEQG
V
T
fig|6666666.5357.peg.3363
Escherichia coli TY-2482 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|585055.6.peg.4233
Escherichia coli 55989 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|585055.8.peg.4236
Escherichia coli 55989 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|550672.3.peg.3806
Escherichia coli B088 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|340184.3.peg.835
Escherichia coli B7A (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|340184.6.peg.869
Escherichia coli B7A (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|562.375.peg.261
Escherichia coli EC4100B (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|656408.3.peg.4190
Escherichia coli H591 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|585034.4.peg.3824
Escherichia coli IAI1 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|585034.5.peg.3821
Escherichia coli IAI1 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|595495.4.peg.2724
Escherichia coli KO11 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|679205.4.peg.4094
Escherichia coli MS 124-1 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|749533.3.peg.1067
Escherichia coli MS 84-1 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|585396.4.peg.4721
Escherichia coli O111:H- str. 11128 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|566546.3.peg.3263
Escherichia coli W (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|566546.4.peg.3985
Escherichia coli W (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|331111.12.peg.4478
Escherichia coli E24377A (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPALR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|331111.3.peg.1880
Escherichia coli E24377A (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPALR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|679204.3.peg.2849
Escherichia coli MS 145-7 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
R
TGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|409438.11.peg.4189
Escherichia coli SE11 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSLS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|573235.3.peg.4997
Escherichia coli O26:H11 str. 11368 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
V
S
T
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
PV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|749537.3.peg.278
Escherichia coli MS 115-1 (4-838/840)
TRIVVGLTAGTCLIFSQNLMAEVSV
FNP
A
L
L
EIDHQSG
---
V
D
I
R
Q
F
NRA
N
LMP
-
P
G
V
Y
S
V
D
IF
I
N
GKMFER
Q
-
D
V
T
F
-----
------VQDNPDADLH
A
CF
IAIK
KT
L
SSF
G
I
KVDALKSF
--------
NDVD
E
TV
C
L
-
DPAPR
I
EGSSWQF
D
SDKLQ
L
N
I
S
IPQI
Y
M
DAMAYD
Y
IS
P
TR
WD
E
GI
N
AL
TI
NY
D
F
S
G
SHTLRSDY
G
S
Q
--------------------
ET
D
TSYLN
L
RN
G
L
NIGPWRLR
NYSTL
N
TSDG
--------------
-
--
-RAE
Y
NSIS
T
WIQ
R
D
I
AA
L
R
S
Q
I
M
I
GD
TW
T
A
S
---
DIFD
S
TQI
RG
AR
L
Y
T
D
N
D
MLP
A
SQ
N
GFAP
V
V
R
GIA
K
S
N
A
T
V
I
I
R
QNG
YV
IYQS
A
VP
Q
GAF
E
I
T
DL
NTA
S
TG
GDL
D
VTI
K
E
E
DG
SE
Q
R
F
TQ
P
Y
A
S
L
A
I
L
K
R
E
G
QTDVD
V
SV
G
EL
R
--
D
EDG
--
FT
P
DVL
Q
AQILH
G
FSHGI
T
L
YGG
M
Q
-
AAEN
Y
G
S
A
A
L
G
V
G
K
D
L
GAL
GA
I
S
F
DVTHA
RAN
F
SHD-D
----
TET
G
Q
S
Y
R
F
LY
S
K
RFDDTD
T
SLR
L
V
GYRYST
EG
Y
YTLN
E
W
A
S
-----
---
------
RRNSPEDFWE
--
--
------------
--
TGNR
R
SRVEG
T
L
T
Q
S
L
---
GRDY
G
N
LYL
T
L
S
RQQ
YW
HTDDVERLMQF
GYS
SSWKRL
S
---
W
NV
S
W
S
YSNTARQ
GTGNNHASD
NTS
E
QIYM
L
S
L
SVP
LS
--------
-
----GWWGNSY
----
A
T
YSV
S
Q
N
D
N
SGSSHQL
G
LS
GT
AL
E
R
N
-
NL
S
W
N
L
M
--------
QSY
N
SHDDEVG
-
-
-
G
-
NMSLT
Y
D
---
GSY
G
T
V
NG
S
YNY
-SQ
N
SQRLNYGIR
GG
--
I
LA
H
SE
G
V
TLS
-
Q
E
L
---
G
E
T
IA
L
V
K
APGA
A
G
LE
I
D
-
N
MRGAA
TD
WR
G
YT
V
KTQL
N
P
Y
DE
NR
VA
IS
D
N
YFS
K
S
N
I
E
LDN
T
VVTM
VPT
R
GA
V
V
K
A
E
F
V
T
HV
G
YRV
L
FR
V
LNAN
GK
LV
PFGA
IA
-
A
I
Q
D
ASLADS
-
G
I
V
G
D
R
G
E
L
YL
S
G
LPEK
G
Q
V
T
L
S
WG
ENASTK
C
IFN
Y
S
L
STPESESGLIEQG
V
T
fig|216593.1.peg.3415
Escherichia coli E2348/69 (3-812/825)
LPMHRTFVLTGIIFALSAVYSLSYARDE
FN
L
R
I
L
ELDSPLE
N
TQ
V
-
L
A
D
F
INN
N
NLT
-
P
G
V
Y
L
T
S
V
M
W
G
QDSLDK
R
-
N
I
T
F
-----
------VLSSDKKSLI
P
R
F
TKAD
--
L
REF
GL
KVDVIPAL
--------
KVMN
D
DT
E
V
G
DIAQI
I
DGARYDF
Q
LDSQT
L
W
L
R
IPQI
Y
Q
NAIAAG
S
IA
P
KY
W
N
D
G
ES
A
A
WL
S
Y
Y
A
S
G
SRQNSDGD
-
-
---------------------
NL
S
SNWLN
L
NS
G
I
N
L
G
A
WRLR
NNTVY
N
----
--------------
-
--
-ESN
W
ESIS
T
SLQ
R
D
I
KA
L
R
S
QM
E
I
G
Q
TF
T
N
G
---
D
L
FD
S
VQM
T
G
IK
L
E
T
D
T
S
MLP
D
S
E
Q
GFAP
V
V
R
V
IA
N
S
D
A
Q
V
V
I
K
QNG
YV
IYQ
T
W
V
S
A
G
P
F
E
I
K
DL
SQV
T
AG
S
DL
E
VTI
K
E
T
N
G
QE
H
S
F
IQ
A
S
S
T
VP
I
L
Q
R
E
G
ALKYS
L
AA
G
KY
R
DS
D
NNA
--
ET
P
VFG
V
ATAIY
G
LPYGI
T
I
YGG
I
L
-
GASM
Y
H
S
G
V
T
G
I
G
A
D
L
GRL
G
S
VS
V
D
I
T
A
A
KTK
F
DDGRD
----
DAT
G
L
S
W
R
A
Q
YAK
DFPDTD
T
TVT
L
A
S
YRYST
SQ
F
YTFQ
E
A
L
D
-----
---
------
QRDTPDD---
--
--
------
KGIYSYRQ
TNNR
R
NRLQI
N
L
SQ
N
I
---
G-RW
GS
V
YL
N
G
Y
QQD
YW
GMHGAERSIGM
GYS
TTWNNI
N
---
W
S
VN
Y
T
LTKTPGM
---------
-TG
E
QQFS
L
T
L
N
I
P
LS
--------
-
----RWLPDSW
----
A
M
YNV
N
R
S
D
K
SNTSHQL
G
IG
GT
AL
Q
D
N
-
NL
SY
N
L
Q
--------
QSY
T
DNNVGYG
-
-
-
A
-
SMNGR
Y
R
---
SSV
G
E
F
GL
GY
S
Y
-DK
N
SRQWNYSAQ
G
A
--
V
VA
H
AH
G
V
TL
G
-
Q
S
V
---
Q
D
S
FA
IV
H
INEG
A
N
VK
V
Q
-
N
AQGVY
TD
FW
G
NA
I
VPNM
T
N
Y
RH
N
A
IT
V
-
-
N
TQG
H
D
S
L
D
ISD
A
TQDV
I
P
SK
GA
V
V
G
V
D
F
D
A
RS
G
MRA
L
LT
L
VH-N
K
E
RV
PFGA
LLT
L
-
-
-
--GNST
-
A
I
V
G
E
D
G
E
VY
I
T
G
VQES
M
T
F
T
V
Q
WG
KEINQQ
C
TGV
I
T
E
PEK
fig|574521.7.peg.162
Escherichia coli O127:H6 str. E2348/69 (3-812/825)
LPMHRTFVLTGIIFALSAVYSLSYARDE
FN
L
R
I
L
ELDSPLE
N
TQ
V
-
L
A
D
F
INN
N
NLT
-
P
G
V
Y
L
T
S
V
M
W
G
QDSLDK
R
-
N
I
T
F
-----
------VLSSDKKSLI
P
R
F
TKAD
--
L
REF
GL
KVDVIPAL
--------
KVMN
D
DT
E
V
G
DIAQI
I
DGARYDF
Q
LDSQT
L
W
L
R
IPQI
Y
Q
NAIAAG
S
IA
P
KY
W
N
D
G
ES
A
A
WL
S
Y
Y
A
S
G
SRQNSDGD
-
-
---------------------
NL
S
SNWLN
L
NS
G
I
N
L
G
A
WRLR
NNTVY
N
----
--------------
-
--
-ESN
W
ESIS
T
SLQ
R
D
I
KA
L
R
S
QM
E
I
G
Q
TF
T
N
G
---
D
L
FD
S
VQM
T
G
IK
L
E
T
D
T
S
MLP
D
S
E
Q
GFAP
V
V
R
V
IA
N
S
D
A
Q
V
V
I
K
QNG
YV
IYQ
T
W
V
S
A
G
P
F
E
I
K
DL
SQV
T
AG
S
DL
E
VTI
K
E
T
N
G
QE
H
S
F
IQ
A
S
S
T
VP
I
L
Q
R
E
G
ALKYS
L
AA
G
KY
R
DS
D
NNA
--
ET
P
VFG
V
ATAIY
G
LPYGI
T
I
YGG
I
L
-
GASM
Y
H
S
G
V
T
G
I
G
A
D
L
GRL
G
S
VS
V
D
I
T
A
A
KTK
F
DDGRD
----
DAT
G
L
S
W
R
A
Q
YAK
DFPDTD
T
TVT
L
A
S
YRYST
SQ
F
YTFQ
E
A
L
D
-----
---
------
QRDTPDD---
--
--
------
KGIYSYRQ
TNNR
R
NRLQI
N
L
SQ
N
I
---
G-RW
GS
V
YL
N
G
Y
QQD
YW
GMHGAERSIGM
GYS
TTWNNI
N
---
W
S
VN
Y
T
LTKTPGM
---------
-TG
E
QQFS
L
T
L
N
I
P
LS
--------
-
----RWLPDSW
----
A
M
YNV
N
R
S
D
K
SNTSHQL
G
IG
GT
AL
Q
D
N
-
NL
SY
N
L
Q
--------
QSY
T
DNNVGYG
-
-
-
A
-
SMNGR
Y
R
---
SSV
G
E
F
GL
GY
S
Y
-DK
N
SRQWNYSAQ
G
A
--
V
VA
H
AH
G
V
TL
G
-
Q
S
V
---
Q
D
S
FA
IV
H
INEG
A
N
VK
V
Q
-
N
AQGVY
TD
FW
G
NA
I
VPNM
T
N
Y
RH
N
A
IT
V
-
-
N
TQG
H
D
S
L
D
ISD
A
TQDV
I
P
SK
GA
V
V
G
V
D
F
D
A
RS
G
MRA
L
LT
L
VH-N
K
E
RV
PFGA
LLT
L
-
-
-
--GNST
-
A
I
V
G
E
D
G
E
VY
I
T
G
VQES
M
T
F
T
V
Q
WG
KEINQQ
C
TGV
I
T
E
PEK
fig|431946.3.peg.168
Escherichia coli SE15 (3-812/825)
LPLHRTFVLTGITFALSAVYSLSYARDE
FN
L
R
I
L
ELDSPLE
N
TQ
V
-
L
E
D
F
VNN
N
NLT
-
P
G
V
Y
L
T
S
V
M
W
G
QEYLDK
R
-
N
I
T
F
-----
------ILSSDKKRLI
P
R
F
TKAD
--
L
REF
GL
KVDDIPAL
--------
QVMD
D
DT
EFG
DIAQI
I
DGARYDF
Q
LDSQT
L
C
L
R
IPQI
Y
Q
NARAAG
S
IS
P
KY
W
S
D
G
ES
A
V
WL
S
Y
Y
A
S
G
SRQNSDGD
-
-
---------------------
NL
N
SNWLN
L
NS
G
I
N
L
G
V
WRLR
NNTVY
S
----
--------------
-
--
-DSS
W
ESIS
T
SLQ
R
D
I
KA
L
R
S
QM
E
V
G
Q
TF
T
N
G
---
D
L
FD
S
VQM
T
G
IK
L
E
T
D
T
S
MLP
D
S
E
Q
GFAP
V
V
R
GIA
N
S
D
A
Q
V
V
I
K
QNG
YV
IYQ
T
W
V
S
A
G
P
F
E
I
K
DL
SQV
T
AG
A
DL
E
VTI
K
E
T
N
G
QE
H
S
F
IQ
A
S
S
T
VP
I
L
Q
R
E
G
ALKYS
L
AT
G
KY
R
DN
D
NHA
--
ET
P
VFG
V
ATAIY
G
LPYGI
T
I
YGG
I
L
-
GASI
Y
H
S
G
V
T
G
I
G
A
D
L
GRL
G
S
VS
V
D
I
T
A
A
ETK
F
DDGRD
----
DAT
G
L
S
W
R
A
Q
YAK
DFPDTD
T
TVT
L
A
S
YRYST
SQ
F
YTFQ
E
A
L
D
-----
---
------
QRDTPDD---
--
--
------
KGIYSYRQ
TNNR
R
NRLQI
N
L
SQ
N
I
---
G-RW
GS
V
YL
N
G
Y
QQD
YW
GMHGAERSIGM
GYS
TTWSNI
N
---
W
S
VN
Y
T
LTKTPGM
---------
-AG
E
QQFS
L
T
L
N
I
P
LS
--------
-
----RWLPDSW
----
A
M
YNV
N
R
S
D
K
SNTSHQL
G
IG
GT
AL
Q
D
N
-
NL
SY
N
L
Q
--------
QSY
T
DNNVGYD
-
-
-
A
-
SMNGR
Y
R
---
SSV
G
E
F
GL
GY
S
Y
-DK
N
SRQWNYSAQ
G
A
--
V
VA
H
AH
G
V
TL
G
-
Q
S
V
---
Q
D
S
FA
IV
H
INEG
A
N
VK
V
Q
-
N
AQGVY
TD
YW
G
NA
I
VPNM
T
N
Y
RH
N
A
IT
V
-
-
N
TQG
H
D
S
L
D
ISD
A
TQDV
I
P
SK
GA
V
V
G
V
D
F
D
A
RS
G
IRA
L
LT
L
VH-N
K
E
RV
PFGA
LLT
L
-
-
-
--GNST
-
A
I
V
G
E
D
G
E
VY
I
T
G
VQES
M
T
F
T
V
Q
WG
KEINQQ
C
TGV
V
T
V
PEK
fig|656417.3.peg.249
Escherichia coli M605 (3-812/825)
LPLHRTFVLTGITFALSAVYSLSYARDE
FN
L
R
I
L
ELDSPLE
N
TQ
V
-
L
E
D
F
VNN
N
NLT
-
P
G
V
Y
L
T
S
V
M
W
G
QEYLDK
R
-
N
I
T
F
-----
------ILSSDKKRLI
P
R
F
TKAD
--
L
REF
GL
KVDDIPAL
--------
QVMD
D
DT
EFG
DIAQI
I
DGARYDF
Q
LDSQT
L
C
L
R
IPQI
Y
Q
NARAAG
S
IS
P
KY
W
S
D
G
ES
A
V
WL
S
Y
Y
A
S
G
SRQNSDGD
-
-
---------------------
NL
N
SNWLN
L
NS
G
I
N
L
G
V
WRLR
NNTVY
S
----
--------------
-
--
-DSS
W
ESIS
T
SLQ
R
D
I
KA
L
R
S
QM
E
V
G
Q
TF
T
N
G
---
D
L
FD
S
VQM
T
G
IK
L
E
T
D
T
S
MLP
D
S
E
Q
GFAP
V
V
R
GIA
N
S
D
A
Q
V
V
I
K
QNG
YV
IYQ
T
W
V
S
A
G
P
F
E
I
K
DL
SQV
T
AG
A
DL
E
VTI
K
E
T
N
G
QE
H
S
F
IQ
A
S
S
T
VP
I
L
Q
R
E
G
ALKYS
L
AT
G
KY
R
DN
D
NHA
--
ET
P
VFG
V
ATAIY
G
LPYGI
T
I
YGG
I
L
-
GASI
Y
H
S
G
V
T
G
I
G
A
D
L
GRL
G
S
VS
V
D
I
T
A
A
ETK
F
DDGRD
----
DAT
G
L
S
W
R
A
Q
YAK
DFPDTD
T
TVT
L
A
S
YRYST
SQ
F
YTFQ
E
A
L
D
-----
---
------
QRDTPDD---
--
--
------
KGIYSYRQ
TNNR
R
NRLQI
N
L
SQ
N
I
---
G-RW
GS
V
YL
N
G
Y
QQD
YW
GMHGAERSIGM
GYS
TTWSNI
N
---
W
S
VN
Y
T
LTKTPGM
---------
-AG
E
QQFS
L
T
L
N
I
P
LS
--------
-
----RWLPDSW
----
A
M
YNV
N
R
S
D
K
SNTSHQL
G
IG
GT
AL
Q
D
N
-
NL
SY
N
L
Q
--------
QSY
T
DNNVGYG
-
-
-
A
-
SINGR
Y
R
---
SSV
G
E
F
GL
GY
S
Y
-DK
N
SRQWNYSAQ
G
A
--
V
VA
H
AH
G
V
TL
G
-
Q
S
V
---
Q
D
S
FA
IV
H
INEG
A
N
VK
V
Q
-
N
AQGVY
TD
YW
G
NA
I
VPNM
T
N
Y
RH
N
A
IT
V
-
-
N
TQG
H
D
S
L
D
ISD
A
TQDV
I
P
SK
GA
V
V
G
V
D
F
D
A
RS
G
IRA
L
LT
L
VH-N
K
E
RV
PFGA
LLT
L
-
-
-
--GNST
-
A
I
V
G
E
D
G
E
VY
I
T
G
VQES
M
T
F
T
V
Q
WG
KEINQQ
C
TGV
V
T
V
PEK
fig|679206.4.peg.4998
Escherichia coli MS 119-7 (11-851/852)
FVMAMCSICHANSGVERTYS
F
DS
T
L
L
NSDAKN-
---
V
DL
T
L
F
EAG
A
QL-
-
P
G
T
Y
H
V
D
I
I
L
N
DSIVES
R
-
EM
F
F
-----
HTAQDSEGKT---YLK
T
C
L
TRDM
--
L
IRY
G
V
KTEMYPEL
FHTSGKKN
NVGA
E
ED
C
A
-
DL-SV
I
PHATEMF
Q
FASQQ
L
R
L
G
IPQ
A
A
L
RPPLRG
I
AP
E
AL
WD
D
GI
T
A
F
LM
N
W
Q
A
NV
SQSEYRKY
G
H
S
--------------------
VS
D
NFWAS
I
EP
G
F
N
L
GPWR
V
R
NLMTW
S
KSSD
--------------
-
--
QPGN
W
ETVY
T
RAE
R
G
V
NN
M
KS
R
L
T
L
GD
DY
T
P
S
---
DIFD
S
LPF
RG
IM
L
G
SD
E
S
M
V
P
Y
N
Q
R
A
FAP
V
V
R
G
V
A
R
T
Q
A
R
I
E
V
R
QNG
YL
I
QSQ
T
V
A
P
GAF
A
L
T
DL
PLT
S
SG
GDL
Q
VT
V
L
E
S
DG
TT
Q
V
F
TV
P
F
T
T
P
A
V
A
L
R
E
G
YMKYN
V
TM
G
QY
R
PS
D
SAV
--
ER
S
LLG
Q
LTSIY
G
LPYGL
T
V
F
GG
V
Q
-
MAEH
Y
L
A
G
A
L
G
G
G
W
S
L
GGL
GA
I
S
V
D
SI
Y
A
RSQ
L
KGK-D
----
NEA
G
N
T
W
R
I
R
Y
N
K
SFELTD
T
SFT
V
A
S
Y
Q
YS
S
AG
Y
HSLP
N
V
L
D
-----
---
------
SYRDSRTGSF
D
H
--
------------
--
SENK
L
RRTTL
N
L
T
Q
P
L
---
G-VL
GS
AS
L
Y
G
S
RDE
Y
R
GNRAKQDSVGV
TLG
GSWNNI
S
---
W
S
VN
G
S
RNRNFGL
----
YKGQE
GKT
E
NRIS
L
W
M
SVP
LE
--------
-
----RWLGNAA
NDIR
A
T
TQI
L
K
S
S
G
QKTRYEV
G
MN
G
N
AF
-
D
R
-
RL
Y
W
D
I
S
--------
EEM
V
PGSENSS
-
D
NS
-
RLNLR
W
Q
---
GTY
G
E
L
TG
M
Y
G
Y
-SS
H
MRQISAGIS
GG
--
M
IA
H
SE
GITL
G
-
Q
K
T
---
G
D
T
TA
L
V
V
APG
V
S
G
AS
V
E
-
G
WPGVK
TD
YR
G
YT
L
AGYM
S
A
Y
QE
N
V
IT
M
D
P
S
TFK
-
E
N
A
E
VVR
T
DTKV
I
PT
K
GA
V
V
K
A
N
F
E
T
RV
G
ART
L
IT
L
TRHD
G
S
PV
PFG
S
VVT
L
E
E
E
KESHPS
V
G
V
M
G
N
N
G
E
VY
M
S
G
LPKK
G
N
L
K
V
V
WG
EK--NQ
C
NAS
Y
Q
L
PEQKGTAGIFLAR
S
V
C
fig|409438.11.peg.4936
Escherichia coli SE11 (4-850/870)
KSTFKLHNLSVAIIACL---SSAAYAEDY
F
D
P
D
L
L
SLGNRDM
SL
-
T
DL
S
A
F
SEQ
G
YSA
-
P
G
V
Y
I
V
D
IY
V
N
GNYLKT
D
-
S
I
R
F
-----
EHDKTN-------TLK
P
L
F
SLND
--
L
NEI
G
V
NLHSLKGT
--------
EHLP
H
DR
ASI
DNLSL
I
PFSSFVF
D
NSKQR
L
N
I
N
V
A
Q
V
H
M
QKETDN
R
LA
R
KF
WD
Q
GIPAL
FV
NY
S
Y
S
G
SQGQTRGN
K
N
KS
-------------------
ST
T
SDFLS
L
NA
G
A
N
L
G
A
WRLR
SNMNW
T
QSGF
ESEQYDTFE
D
AYRK
Q
KS
SQSK
W
DTGD
T
YLQ
R
D
V
QF
L
N
SEL
T
I
GD
YR
T
T
S
ITEQ
L
I
DG
FQF
RG
VS
L
S
S
S
E
Y
M
I
P
A
A
L
R
GFAP
V
I
T
G
Y
A
R
T
N
A
E
V
I
V
T
QNG
YS
IYQ
T
H
V
A
P
G
P
F
R
I
D
DL
PGG
S
SA
GD
I
Y
V
S
V
K
E
S
DG
TV
H
G
F
RQ
A
Y
S
T
L
P
E
M
Q
R
Q
G
DFKFE
Y
SV
G
RY
K
QS
G
YST
YE
NT
P
LFS
N
TSFLY
G
LPHNV
T
A
M
G
N
L
L
-
YSGD
Y
Q
S
V
S
L
G
AA
F
S
L
GML
G
T
L
S
T
S
VT
S
S
VTE
G
RDN-D
----
KLR
G
Y
S
V
N
A
R
Y
S
K
SLTETG
T
LFQ
L
A
S
YRYST
PD
F
RTFS
E
A
N
V
-----
---
------
EEYRGSSYIN
--
--
-----------
YM
L
SGRR
K
DTWSL
I
L
N
Q
S
I
---
T-SG
L
S
V
G
V
S
G
R
RDN
YW
DRHST-TSLSA
G
L
N
GTFRQT
S
---
W
SL
N
Y
N
IDRVRGN
-------
GS
WPE
N
RELS
L
S
V
SVP
FS
--------
A
FMSSGSMSSAN
----
F
N
YRT
A
H
N
N
Q
GRTTNMV
S
LN
G
S
AL
E
E
N
-
RL
S
W
N
I
S
--------
ENW
S
NSSRNYQ
R
D
ENF
SAGVS
Y
D
---
SQY
A
R
L
YG
GY
GR
-TS
Q
SNTYNYGAS
G
S
--
L
LA
H
PG
G
V
I
V
S
R
Q
N
I
---
G
N
A
AA
L
V
H
V
P
DV
P
G
AR
V
M
-
N
GRDIH
TD
NK
G
FA
L
VPYV
A
I
Y
EK
N
N
IT
I
D
P
V
SLS
-
D
G
I
E
LSE
T
SKAT
Y
PT
K
GA
I
V
S
V
E
Y
K
V
HS
G
QQA
L
IN
L
TH-D
GK
PV
P
L
GA
FVT
I
-
-
-
---GDQ
VF
I
V
G
H
S
G
Q
VY
V
S
G
VPES
G
R
L
K
V
K
WG
DKESY-
-
VAN
Y
K
L
NAKS
fig|344601.3.peg.2589
Escherichia coli B171 (18-857/866)
TFCA--LLYCNSAFCAEPVE
Y
D
H
T
FL
-MGQNAS
N
--
I
DL
S
R
Y
SEG
N
PAI
-
P
G
M
Y
D
V
S
VY
V
N
DQPIIN
Q
-
S
I
T
F
IE
IEG
KKNAQ-----------
A
C
I
TLKN
--
L
LQF
H
I
NSPDINNE
KA
V
LL
A
RD
ETLG
N
--
C
L
-
NLTEI
I
PQASVRY
D
VNEQR
L
D
I
D
V
PQ
A
W
V
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSETPGR
-
-
---------------------
RN
D
SIYAA
F
NG
G
M
N
L
G
A
WRLR
ASGNY
N
WMTD
--------------
-
--
SGSN
Y
DFKN
R
YIQ
R
D
I
AS
L
R
S
Q
L
I
L
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
T
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VTI
T
Q
G
G
YK
IY
ET
T
VP
P
GAF
V
I
D
DL
SPS
G
YG
S
DL
I
VT
V
E
E
S
DG
SK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SG
G
QV
L
KD
D
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
I
TDNN
Y
T
A
G
L
L
GLG
L
N
-
TSV
GA
F
S
F
DVTH
S
NVR
I
P-DDK
----
TYQ
G
Q
S
Y
R
V
S
W
N
K
LFEETS
T
SLN
I
A
A
YRYST
QN
Y
LGLN
D
A
L
T
-----
LID
EVKHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNRSNYSI
GYS
NSASWG
S
---
Y
S
V
S
-
A
QRSWNED
---------
GDT
D
DSVY
L
S
F
TI
P
IE
--------
K
LLGTEQRTSGF
----
Q
S
IDT
Q
M
S
S
D
FKGNNQL
N
VS
S
S
GY
S
D
N
A
RV
SY
S
V
N
--------
TGY
T
MNKASKD
L
S
YV
-
GGYAS
Y
E
---
SPW
G
S
L
AG
S
V
S
A
NSD
N
SRQVSLSTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
V
Q
APGA
Q
G
AR
I
N
-
Y
GN-ST
I
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
SAVA
VP
R
QG
SV
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
TRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQGNVI
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IEQQ
G
N
I
S
I
K
W
L
EESKPV
S
CL-
-
-
-
AHYQQSSEAEKIA
Q
S
II
fig|344601.5.peg.2694
Escherichia coli B171 (18-857/866)
TFCA--LLYCNSAFCAEPVE
Y
D
H
T
FL
-MGQNAS
N
--
I
DL
S
R
Y
SEG
N
PAI
-
P
G
M
Y
D
V
S
VY
V
N
DQPIIN
Q
-
S
I
T
F
IE
IEG
KKNAQ-----------
A
C
I
TLKN
--
L
LQF
H
I
NSPDINNE
KA
V
LL
A
RD
ETLG
N
--
C
L
-
NLTEI
I
PQASVRY
D
VNEQR
L
D
I
D
V
PQ
A
W
V
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSETPGR
-
-
---------------------
RN
D
SIYAA
F
NG
G
M
N
L
G
A
WRLR
ASGNY
N
WMTD
--------------
-
--
SGSN
Y
DFKN
R
YIQ
R
D
I
AS
L
R
S
Q
L
I
L
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
T
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VTI
T
Q
G
G
YK
IY
ET
T
VP
P
GAF
V
I
D
DL
SPS
G
YG
S
DL
I
VT
V
E
E
S
DG
SK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SG
G
QV
L
KD
D
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
I
TDNN
Y
T
A
G
L
L
GLG
L
N
-
TSV
GA
F
S
F
DVTH
S
NVR
I
P-DDK
----
TYQ
G
Q
S
Y
R
V
S
W
N
K
LFEETS
T
SLN
I
A
A
YRYST
QN
Y
LGLN
D
A
L
T
-----
LID
EVKHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNRSNYSI
GYS
NSASWG
S
---
Y
S
V
S
-
A
QRSWNED
---------
GDT
D
DSVY
L
S
F
TI
P
IE
--------
K
LLGTEQRTSGF
----
Q
S
IDT
Q
M
S
S
D
FKGNNQL
N
VS
S
S
GY
S
D
N
A
RV
SY
S
V
N
--------
TGY
T
MNKASKD
L
S
YV
-
GGYAS
Y
E
---
SPW
G
S
L
AG
S
V
S
A
NSD
N
SRQVSLSTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
V
Q
APGA
Q
G
AR
I
N
-
Y
GN-ST
I
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
SAVA
VP
R
QG
SV
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
TRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQGNVI
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IEQQ
G
N
I
S
I
K
W
L
EESKPV
S
CL-
-
-
-
AHYQQSSEAEKIA
Q
S
II
fig|340185.3.peg.1618
Escherichia coli E22 (18-857/866)
TFCA--LLYCNSAFCAEPVE
Y
D
H
T
FL
-MGQNAS
N
--
I
DL
S
R
Y
SEG
N
PAI
-
P
G
M
Y
D
V
S
VY
V
N
DQPIIN
Q
-
S
I
T
F
IE
IEG
KKNAQ-----------
A
C
I
TLKN
--
L
LQF
H
I
NSPDINNE
KA
V
LL
A
RD
ETLG
N
--
C
L
-
NLTEI
I
PQASVRY
D
VNEQR
L
D
I
D
V
PQ
A
W
V
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSETPGR
-
-
---------------------
RN
D
SIYAA
F
NG
G
M
N
L
G
A
WRLR
ASGNY
N
WMTD
--------------
-
--
SGSN
Y
DFKN
R
YIQ
R
D
I
AS
L
R
S
Q
L
I
L
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
T
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VTI
T
Q
G
G
YK
IY
ET
T
VP
P
GAF
V
I
D
DL
SPS
G
YG
S
DL
I
VT
V
E
E
S
DG
SK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SG
G
QV
L
KD
D
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
I
TDNN
Y
T
A
G
L
L
GLG
L
N
-
TSV
GA
F
S
F
DVTH
S
NVR
I
P-DDK
----
TYQ
G
Q
S
Y
R
V
S
W
N
K
LFEETS
T
SLN
I
A
A
YRYST
QN
Y
LGLN
D
A
L
T
-----
LID
EVKHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNRSNYSI
GYS
NSASWG
S
---
Y
S
V
S
-
A
QRSWNED
---------
GDT
D
DSVY
L
S
F
TI
P
IE
--------
K
LLGTEQRTSGF
----
Q
S
IDT
Q
M
S
S
D
FKGNNQL
N
VS
S
S
GY
S
D
N
A
RV
SY
S
V
N
--------
TGY
T
MNKASKD
L
S
YV
-
GGYAS
Y
E
---
SPW
G
S
L
AG
S
V
S
A
NSD
N
SRQVSLSTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
V
Q
APGA
Q
G
AR
I
N
-
Y
GN-ST
I
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
SAVA
VP
R
QG
SV
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
TRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQGNVI
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IEQQ
G
N
I
S
I
K
W
L
EESKPV
S
CL-
-
-
-
AHYQQSSEAEKIA
Q
S
II
fig|340185.4.peg.1706
Escherichia coli E22 (18-857/866)
TFCA--LLYCNSAFCAEPVE
Y
D
H
T
FL
-MGQNAS
N
--
I
DL
S
R
Y
SEG
N
PAI
-
P
G
M
Y
D
V
S
VY
V
N
DQPIIN
Q
-
S
I
T
F
IE
IEG
KKNAQ-----------
A
C
I
TLKN
--
L
LQF
H
I
NSPDINNE
KA
V
LL
A
RD
ETLG
N
--
C
L
-
NLTEI
I
PQASVRY
D
VNEQR
L
D
I
D
V
PQ
A
W
V
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSETPGR
-
-
---------------------
RN
D
SIYAA
F
NG
G
M
N
L
G
A
WRLR
ASGNY
N
WMTD
--------------
-
--
SGSN
Y
DFKN
R
YIQ
R
D
I
AS
L
R
S
Q
L
I
L
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
T
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VTI
T
Q
G
G
YK
IY
ET
T
VP
P
GAF
V
I
D
DL
SPS
G
YG
S
DL
I
VT
V
E
E
S
DG
SK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SG
G
QV
L
KD
D
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
I
TDNN
Y
T
A
G
L
L
GLG
L
N
-
TSV
GA
F
S
F
DVTH
S
NVR
I
P-DDK
----
TYQ
G
Q
S
Y
R
V
S
W
N
K
LFEETS
T
SLN
I
A
A
YRYST
QN
Y
LGLN
D
A
L
T
-----
LID
EVKHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNRSNYSI
GYS
NSASWG
S
---
Y
S
V
S
-
A
QRSWNED
---------
GDT
D
DSVY
L
S
F
TI
P
IE
--------
K
LLGTEQRTSGF
----
Q
S
IDT
Q
M
S
S
D
FKGNNQL
N
VS
S
S
GY
S
D
N
A
RV
SY
S
V
N
--------
TGY
T
MNKASKD
L
S
YV
-
GGYAS
Y
E
---
SPW
G
S
L
AG
S
V
S
A
NSD
N
SRQVSLSTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
V
Q
APGA
Q
G
AR
I
N
-
Y
GN-ST
I
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
SAVA
VP
R
QG
SV
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
TRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQGNVI
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IEQQ
G
N
I
S
I
K
W
L
EESKPV
S
CL-
-
-
-
AHYQQSSEAEKIA
Q
S
II
fig|585395.4.peg.141
Escherichia coli O103:H2 str. 12009 (18-857/866)
TFCA--LLYCNSAFCAEPVE
Y
D
H
T
FL
-MGQNAS
N
--
I
DL
S
R
Y
SEG
N
PAI
-
P
G
M
Y
D
V
S
VY
V
N
DQPIIN
Q
-
S
I
T
F
IE
IEG
KKNAQ-----------
A
C
I
TLKN
--
L
LQF
H
I
NSPDINNE
KA
V
LL
A
RD
ETLG
N
--
C
L
-
NLTEI
I
PQASVRY
D
VNEQR
L
D
I
D
V
PQ
A
W
V
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSETPGR
-
-
---------------------
RN
D
SIYAA
F
NG
G
M
N
L
G
A
WRLR
ASGNY
N
WMTD
--------------
-
--
SGSN
Y
DFKN
R
YIQ
R
D
I
AS
L
R
S
Q
L
I
L
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
T
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VTI
T
Q
G
G
YK
IY
ET
T
VP
P
GAF
V
I
D
DL
SPS
G
YG
S
DL
I
VT
V
E
E
S
DG
SK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SG
G
QV
L
KD
D
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
I
TDNN
Y
T
A
G
L
L
GLG
L
N
-
TSV
GA
F
S
F
DVTH
S
NVR
I
P-DDK
----
TYQ
G
Q
S
Y
R
V
S
W
N
K
LFEETS
T
SLN
I
A
A
YRYST
QN
Y
LGLN
D
A
L
T
-----
LID
EVKHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNRSNYSI
GYS
NSASWG
S
---
Y
S
V
S
-
A
QRSWNED
---------
GDT
D
DSVY
L
S
F
TI
P
IE
--------
K
LLGTEQRTSGF
----
Q
S
IDT
Q
M
S
S
D
FKGNNQL
N
VS
S
S
GY
S
D
N
A
RV
SY
S
V
N
--------
TGY
T
MNKASKD
L
S
YV
-
GGYAS
Y
E
---
SPW
G
S
L
AG
S
V
S
A
NSD
N
SRQVSLSTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
V
Q
APGA
Q
G
AR
I
N
-
Y
GN-ST
I
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
SAVA
VP
R
QG
SV
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
TRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQGNVI
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IEQQ
G
N
I
S
I
K
W
L
EESKPV
S
CL-
-
-
-
AHYQQSSEAEKIA
Q
S
II
fig|585397.7.peg.149
Escherichia coli ED1a (1-853/862)
M
TIKSTNHLTHIATFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGKDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
V
N
DQPIMS
Q
-
S
I
A
F
A
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSE
KAILL
K
RD
EDLG
D
--
C
L
-
NLAEM
I
PQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QELEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLGTEHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGYAS
Y
E
---
SPW
G
T
L
SG
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
APGA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
VP
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IGQE
G
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTTSEKIA
Q
S
II
fig|585397.9.peg.149
Escherichia coli ED1a (1-853/862)
M
TIKSTNHLTHIATFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGKDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
V
N
DQPIMS
Q
-
S
I
A
F
A
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSE
KAILL
K
RD
EDLG
D
--
C
L
-
NLAEM
I
PQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QELEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLGTEHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGYAS
Y
E
---
SPW
G
T
L
SG
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
APGA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
VP
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IGQE
G
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTTSEKIA
Q
S
II
fig|340197.3.peg.2915
Escherichia coli F11 (1-853/862)
M
TIKSTNHLTHIATFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGKDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
V
N
DQPIMS
Q
-
S
I
A
F
A
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSE
KAILL
K
RD
EDLG
D
--
C
L
-
NLAEM
I
PQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QELEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLGTEHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGYAS
Y
E
---
SPW
G
T
L
SG
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
APGA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
VP
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IGQE
G
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTTSEKIA
Q
S
II
fig|199310.1.peg.164
Escherichia coli CFT073 (1-853/862)
M
TIKSTNHLTHIATFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGKDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
V
N
DQPIMS
Q
-
S
I
A
F
A
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSE
KAILL
K
RD
DDLG
D
--
C
L
-
NLAEM
I
PQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLGTEHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGYAS
Y
E
---
SPW
G
T
L
SG
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
APGA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
VP
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IEQE
G
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTTSEKIA
Q
S
II
fig|749527.3.peg.3245
Escherichia coli MS 21-1 (18-857/866)
TFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGKDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
V
N
DQPIMS
Q
-
S
I
A
F
T
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSSDKNSE
KAILL
K
RD
EDLG
D
--
C
L
-
NLAEM
I
PQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QELEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLGTEHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGYAS
Y
E
---
SPW
G
T
L
SG
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
APGA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
VP
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
LN
I
VRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IEQE
G
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTTSEKIA
Q
S
II
fig|405955.9.peg.120
Escherichia coli APEC O1 (1-853/862)
M
TIKSTNHLTHIATFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGKDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
V
N
DQPIMS
Q
-
S
I
A
F
A
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSE
KAILL
K
RD
DDLG
D
--
C
L
-
NLAEM
I
PQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WITN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VRI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLGTEHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGYAS
Y
E
---
SPW
G
T
L
SG
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
APGA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
VP
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IGQE
G
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTTSEKIA
Q
S
II
fig|656393.3.peg.757
Escherichia coli H299 (18-857/866)
TFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGKDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
V
N
DQPIMS
Q
-
S
I
A
F
T
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSE
KAILL
K
RD
EDLG
D
--
C
L
-
NLAEM
I
PQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLGTEHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGYAS
Y
E
---
SPW
G
T
L
SG
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
APGA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
VP
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IGQE
G
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTTSEKIA
Q
S
II
fig|562.376.peg.1161
Escherichia coli WV_060327 (1-853/862)
M
TIKSTNHLTHIATFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGKDAS
N
--
I
DL
S
R
Y
TDG
N
PTL
-
P
G
I
Y
D
V
S
VY
V
N
DQPIMS
Q
-
S
I
A
F
A
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSE
KAILL
K
RD
DDLG
D
--
C
L
-
NLAEM
I
PQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
LH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
H
LLGTEHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGYAS
Y
E
---
SPW
G
T
L
SG
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
APGA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
G
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
V
LR
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
A
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
R
G
IEQE
G
N
I
R
I
T
W
I
EDGKPV
S
CF-
-
-
-
AHYQQNTTSEKIA
Q
S
II
fig|216593.1.peg.3428
Escherichia coli E2348/69 (1-853/862)
M
TIKSTNHLTHIATFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGNDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
I
N
DQPIMS
Q
-
S
I
A
F
T
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSG
KAILL
K
RD
DDLG
D
--
C
L
-
NLAEM
I
SQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLDTDHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGHAS
Y
E
---
SPW
G
T
L
SC
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
A
L
GA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
S
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
I
P
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
E
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
H
G
IEQE
E
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTASEKIA
Q
S
II
fig|574521.7.peg.147
Escherichia coli O127:H6 str. E2348/69 (1-853/862)
M
TIKSTNHLTHIATFCA--LLYSNSALCAELVE
Y
D
H
T
FL
-MGNDAS
N
--
I
DL
S
R
Y
TEG
N
PTL
-
P
G
I
Y
D
V
S
VY
I
N
DQPIMS
Q
-
S
I
A
F
T
VIEG
KKNAQ-----------
A
C
I
TQKN
--
L
LQF
H
I
SSPDKNSG
KAILL
K
RD
DDLG
D
--
C
L
-
NLAEM
I
SQSSIRY
D
VNDQR
L
D
I
D
V
PQ
A
W
I
MKNYQN
Y
VD
P
SL
W
E
N
GI
N
A
A
ML
S
YN
L
N
G
YHSESPGR
-
-
---------------------
TN
D
SIYAA
F
NG
G
I
N
L
G
A
WRLR
ASGNY
N
WMTN
--------------
-
--
VHSD
Y
DFQN
R
YLQ
R
DL
AS
L
R
S
Q
L
V
I
G
E
SY
T
T
G
---
E
T
FD
S
VSI
RG
IR
L
Y
SD
S
R
MLP
P
V
L
A
S
FAP
I
I
H
G
V
A
N
T
N
A
K
VT
V
M
QNG
YK
IY
ET
T
VP
P
GAF
A
I
D
DL
SPS
G
YG
S
DL
I
VTI
E
E
A
DG
TK
R
T
F
SQ
P
F
SSV
V
Q
M
L
R
P
G
VGRWD
I
SA
G
QV
L
KD
S
IQ-
--
DE
P
NLF
Q
ASYYY
G
LNNYL
T
G
Y
T
G
I
Q
L
TDNN
Y
T
A
G
L
L
GLG
M
N
-
TPV
GA
F
S
V
DVTH
S
NVS
I
P-DDK
----
TYQ
G
Q
S
Y
R
I
S
W
N
K
LFENTS
T
SLN
I
A
A
YRYST
QH
Y
LGLN
D
A
L
T
-----
LID
EVEHPE
QDLEPKSMRN
--
--
------------
--
YSRM
K
NQVTV
S
I
N
Q
P
L
KFE
KKDY
GS
F
YLS
G
S
WSD
YW
ASGQNSTNYSI
GYS
NSASWG
S
---
Y
S
I
S
-
A
QRSLNED
---------
GQT
D
DSIY
L
S
F
TI
P
IE
--------
N
LLDTDHRSSGF
----
Q
S
IDT
Q
L
N
S
D
FKGNNQL
N
IS
S
S
GY
S
D
TN
RI
SY
S
V
N
--------
TGY
M
MNKSSDD
L
S
YI
-
GGHAS
Y
E
---
SPW
G
T
L
SC
SAS
A
SSD
N
SRQFSLNTD
GG
--
F
VL
H
SG
G
L
T
F
S
-
N
D
SFSD
S
D
T
LA
V
I
Q
A
L
GA
K
G
AR
I
N
-
Y
GN-ST
V
D
RW
S
YG
V
TSAL
S
P
Y
HE
NR
IA
LD
I
N
DLE
-
N
D
V
E
LKS
T
STVA
I
P
R
QGA
V
V
F
A
D
F
E
T
VQ
G
QSA
I
MN
I
VRSD
GK
NI
PF
A
E
---
D
I
Y
D
EQNNII
-
G
N
V
G
Q
G
G
Q
A
F
V
H
G
IEQE
E
N
I
R
I
T
W
I
EEGKPV
S
CF-
-
-
-
AHYQQNTASEKIA
Q
S
II
fig|525281.3.peg.1568
Escherichia coli 83972 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
V
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSASINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYNSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
I
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFSAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|655817.3.peg.5064
Escherichia coli ABU 83972 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
V
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSASINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYNSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
I
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFSAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|749546.3.peg.3032
Escherichia coli MS 185-1 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
V
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSASINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYNSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
I
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFSAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|749528.3.peg.1477
Escherichia coli MS 45-1 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
V
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSASINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYNSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
I
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFSAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|362663.8.peg.3815
Escherichia coli 536 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
A
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSASINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
SQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYSSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
I
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGESVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
L
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFGAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AA
I
N
-
N
ASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|362663.9.peg.3829
Escherichia coli 536 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
A
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSASINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
SQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYSSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
I
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGESVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
L
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFGAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AA
I
N
-
N
ASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|340197.3.peg.849
Escherichia coli F11 (36-858/863)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
A
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSARINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYSSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
L
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFGAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|340197.5.peg.891
Escherichia coli F11 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
A
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSARINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYSSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
L
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFGAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|749550.3.peg.1517
Escherichia coli MS 200-1 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
A
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSARINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYSSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
L
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFGAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|869729.3.peg.4667
Escherichia coli UM146 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
A
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSARINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYSSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
L
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFGAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|364106.7.peg.4774
Escherichia coli UTI89 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
A
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSARINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYSSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
L
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFGAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|364106.8.peg.4773
Escherichia coli UTI89 (17-839/844)
LLFAALGLTVTNHSFAAEEAE
F
DS
E
FL
HLDKGIN
A
--
I
D
I
R
RF
SHG
N
PVP
-
E
G
R
Y
Y
S
D
IY
V
N
N-----
-
-
--
-
-
-
V
WK
G
KADLQYLRTANTGAPT
L
C
L
TPEL
--
L
---
--
--------
SL
I
D
LV
K
D
TMSG
N
TS
C
F
-
PASTG
L
SSARINF
D
LSTLR
L
N
I
E
IPQ
A
L
L
NTRPRG
Y
IS
P
AQ
W
Q
S
G
V
PA
A
FI
NY
D
A
NY
YQYSSSGT
-
-
---------------------
SN
E
QTYLG
L
KA
G
F
N
L
WG
W
A
LR
HRGSE
S
WNNS
--------------
-
--
YPAG
Y
QNIE
T
SIM
H
DL
AP
L
RA
Q
F
T
L
GD
FY
T
N
G
---
EL
M
D
S
LSL
RG
VR
L
A
SD
E
R
MLP
G
S
L
R
G
Y
AP
A
V
R
GIA
N
S
N
A
K
VTI
Y
QN
A
HI
L
Y
ET
T
VP
A
G
P
F
V
I
N
DL
YPS
G
YA
GDL
L
V
K
I
T
E
S
N
G
QT
R
M
F
TV
P
F
A
A
V
A
Q
L
I
R
P
G
FSRWQ
M
SV
G
KY
R
YA
N
KT-
--
YN
D
LIA
Q
GTYQY
G
LTNDI
T
L
NS
G
L
T
-
TASG
Y
T
A
G
L
A
GL
A
F
N
-
TPL
GA
I
A
S
D
I
T
L
S
RTA
F
RYSGV
----
TRK
G
Y
S
L
H
S
S
Y
S
I
NIPASN
T
NIT
L
A
A
YRYS
S
KD
F
YHLK
D
A
L
S
AN
HNA
FID
------
-DVSVKSTAF
--
--
------------
--
Y-RP
R
NQFQI
S
I
N
Q
E
L
---
GEKW
G
G
M
YL
T
G
T
TYN
YW
GHKGSRNEYQM
GYS
NFWKQL
G
---
Y
Q
I
G
L
S
QSRDNEQ
---------
QRR
D
DRFY
I
N
F
T
L
P
--
--------
-
-LGGSVQSPVF
----
S
T
V-L
N
Y
S
K
E
EKNSIQT
S
IS
GT
GG
E
D
N
-
QF
SY
G
I
S
GNSQENGP
SGY
A
MN-----
-
-
--
-
GG---
Y
R
---
SPY
V
N
I
TT
TVG
H
DTQ
N
NNQRSFGAS
G
A
--
V
VA
H
PY
G
V
TLS
-
N
D
L
---
S
D
T
FA
I
I
H
A
E
GA
Q
G
AV
I
N
-
N
ASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
Q
IS
I
D
P
S
NLD
-
L
N
V
E
LSA
T
EQEI
I
P
R
AN
S
AT
L
V
K
F
D
T
KT
G
RSL
L
FD
I
RMST
G
N
PP
P
M
AS
---
E
V
L
D
EHGQLA
-
G
Y
V
A
Q
A
G
K
V
F
T
R
G
LPEK
G
H
L
S
V
V
WG
PDNKDR
C
SFV
Y
H
V
AHNKDDMQSQLVP
V
L
C
IQHP
fig|444451.5.peg.657
Escherichia coli O157:H7 str. EC4196 (14-829/879)
CCVALAMSGSYVNAWAENEIQ
F
DS
R
FL
ELKGDTK
---
I
DL
K
RF
SSQ
G
YVE
-
P
G
K
Y
N
L
Q
V
Q
L
N
KQPLTE
E
Y
DI
Y
W
-----
-----YASENDASKTY
A
C
L
TPEL
--
V
AQF
GL
KEDVAKNL
--------
QWIH
D
GK
C
L
-
-KPGQ
L
EGIDIKA
D
LSQSA
L
V
I
S
L
PQ
A
Y
L
EYTDIN
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
I
T
A
QTRHEENG
G
D
---------------------
DS
N
EISGN
G
TV
G
V
N
L
G
A
WRLR
ADWQT
D
YLHS
---
K
SNDD
--
D
VIN
G
DD
TQKN
W
EWSR
Y
YAW
RA
L
PS
LK
A
K
L
G
L
G
E
DY
L
N
S
---
DIFDG
FNY
V
G
GS
I
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
I
S
G
V
A
H
T
T
A
K
VT
V
S
Q
L
G
RV
IY
ET
Q
VP
A
G
P
F
R
I
Q
DL
-GD
S
VS
G
T
L
H
I
R
I
E
E
Q
N
G
QV
Q
E
Y
DI
N
T
A
S
M
P
F
L
T
R
P
G
QVRYK
L
MM
G
RP
Q
EW
G
HHV
--
EG
G
FFS
G
GEASW
G
IANGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
L
G
V
G
R
D
L
SVF
GAV
A
F
D
I
TH
S
HTR
L
DKETA
YGKG
SLD
G
N
S
F
R
L
S
Y
S
K
DFDELN
S
RVT
F
A
GYR
F
S
E
EN
F
MTMS
E
Y
L
D
A
SD
S
E
MVR
------
----------
--
--
------------
--
TGND
K
EMYTA
T
Y
N
Q
N
F
---
RDAG
V
S
V
YL
N
Y
T
RHT
YW
D-RDEQTNYNV
M
L
S
HYFNLG
S
IRN
M
S
I
S
M
T
GYRYEYD
---------
NQA
D
KGVY
IS
L
S
M
P
WG
--------
-
-----------
----
D
S
STI
S
Y
N
G
N
YGSGSDS
S
QV
G
Y
FS
R
V
D
D
AT
H
Y
Q
L
N
--------
VGT
S
DNHSSV-
-
-
--
-
DGYYS
H
D
---
GSL
A
Q
V
DL
SA
NY
HEG
Q
YTSAGISLQ
GG
AT
L
TA
Q
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
V
A
G
VP
V
E
G
N
GAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
N
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
S
V
IS
G
QKA
M
AV
L
RLQD
G
S
YP
PFGA
---
E
V
K
N
DSAQNV
-
G
L
V
D
D
D
G
N
VYL
A
G
VKPG
E
H
M
I
V
S
WG
--GVAH
C
--D
I
H
L
PDP
fig|444447.5.peg.513
Escherichia coli O157:H7 str. EC4206 (14-829/879)
CCVALAMSGSYVNAWAENEIQ
F
DS
R
FL
ELKGDTK
---
I
DL
K
RF
SSQ
G
YVE
-
P
G
K
Y
N
L
Q
V
Q
L
N
KQPLTE
E
Y
DI
Y
W
-----
-----YASENDASKTY
A
C
L
TPEL
--
V
AQF
GL
KEDVAKNL
--------
QWIH
D
GK
C
L
-
-KPGQ
L
EGIDIKA
D
LSQSA
L
V
I
S
L
PQ
A
Y
L
EYTDIN
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
I
T
A
QTRHEENG
G
D
---------------------
DS
N
EISGN
G
TV
G
V
N
L
G
A
WRLR
ADWQT
D
YLHS
---
K
SNDD
--
D
VIN
G
DD
TQKN
W
EWSR
Y
YAW
RA
L
PS
LK
A
K
L
G
L
G
E
DY
L
N
S
---
DIFDG
FNY
V
G
GS
I
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
I
S
G
V
A
H
T
T
A
K
VT
V
S
Q
L
G
RV
IY
ET
Q
VP
A
G
P
F
R
I
Q
DL
-GD
S
VS
G
T
L
H
I
R
I
E
E
Q
N
G
QV
Q
E
Y
DI
N
T
A
S
M
P
F
L
T
R
P
G
QVRYK
L
MM
G
RP
Q
EW
G
HHV
--
EG
G
FFS
G
GEASW
G
IANGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
L
G
V
G
R
D
L
SVF
GAV
A
F
D
I
TH
S
HTR
L
DKETA
YGKG
SLD
G
N
S
F
R
L
S
Y
S
K
DFDELN
S
RVT
F
A
GYR
F
S
E
EN
F
MTMS
E
Y
L
D
A
SD
S
E
MVR
------
----------
--
--
------------
--
TGND
K
EMYTA
T
Y
N
Q
N
F
---
RDAG
V
S
V
YL
N
Y
T
RHT
YW
D-RDEQTNYNV
M
L
S
HYFNLG
S
IRN
M
S
I
S
M
T
GYRYEYD
---------
NQA
D
KGVY
IS
L
S
M
P
WG
--------
-
-----------
----
D
S
STI
S
Y
N
G
N
YGSGSDS
S
QV
G
Y
FS
R
V
D
D
AT
H
Y
Q
L
N
--------
VGT
S
DNHSSV-
-
-
--
-
DGYYS
H
D
---
GSL
A
Q
V
DL
SA
NY
HEG
Q
YTSAGISLQ
GG
AT
L
TA
Q
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
V
A
G
VP
V
E
G
N
GAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
N
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
S
V
IS
G
QKA
M
AV
L
RLQD
G
S
YP
PFGA
---
E
V
K
N
DSAQNV
-
G
L
V
D
D
D
G
N
VYL
A
G
VKPG
E
H
M
I
V
S
WG
--GVAH
C
--D
I
H
L
PDP
fig|83334.1.peg.3218
Escherichia coli O157:H7 (14-829/879)
CCVALAMSGSYVNAWAENEIQ
F
DS
R
FL
ELKGDTK
---
I
DL
K
RF
SSQ
G
YVE
-
P
G
K
Y
N
L
Q
V
Q
L
N
KQPLTE
E
Y
DI
Y
W
-----
-----YASENDASKTY
A
C
L
TPEL
--
V
AQF
GL
KEDVAKNL
--------
QWIH
D
GK
C
L
-
-KPGQ
L
EGIDIKA
D
LSQSA
L
V
I
S
L
PQ
A
Y
L
EYTDIN
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
I
T
A
QTRHEENG
G
D
---------------------
DS
N
EISGN
G
TV
G
V
N
L
G
A
WRLR
ADWQT
D
YLHS
---
K
SNDD
--
D
VIN
G
DD
TQKN
W
EWSR
Y
YAW
RA
L
PS
LK
A
K
L
G
L
G
E
DY
L
N
S
---
DIFDG
FNY
V
G
GS
I
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
I
S
G
V
A
H
T
T
A
K
VT
V
S
Q
L
G
RV
IY
ET
Q
VP
A
G
P
F
R
I
Q
DL
-GD
S
VS
G
T
L
H
I
R
I
E
E
Q
N
G
QV
Q
E
Y
DI
N
T
A
S
M
P
F
L
T
R
P
G
QVRYK
L
MM
G
RP
Q
EW
G
HHV
--
EG
G
FFS
G
GEASW
G
IANGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
L
G
V
G
R
D
L
SVF
GAV
A
F
D
I
TH
S
HTR
L
DKETA
YGKG
SLD
G
N
S
F
R
L
S
Y
S
K
DFDELN
S
RVT
F
A
GYR
F
S
E
EN
F
MTMS
E
Y
L
D
A
SD
S
E
MVR
------
----------
--
--
------------
--
TGND
K
EMYTA
T
Y
N
Q
N
F
---
RDAG
V
S
V
YL
N
Y
T
RHT
YW
D-RDEQTNYNV
M
L
S
HYFNLG
S
IRN
M
S
I
S
M
T
GYRYEYD
---------
NQA
D
KGVY
IS
L
S
M
P
WG
--------
-
-----------
----
D
S
STI
S
Y
N
G
N
YGSGSDS
S
QV
G
Y
FS
R
V
D
D
AT
H
Y
Q
L
N
--------
VGT
S
DNHSSV-
-
-
--
-
DGYYS
H
D
---
GSL
A
Q
V
DL
SA
NY
HEG
Q
YTSAGISLQ
GG
AT
L
TA
Q
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
V
A
G
VP
V
E
G
N
GAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
N
A
E
ATQ
S
VVQG
TL
T
E
GA
I
G
Y
R
K
F
S
V
IS
G
QKA
M
AV
L
RLQD
G
S
YP
PFGA
---
E
V
K
N
DSAQNV
-
G
L
V
D
D
D
G
N
VYL
A
G
VKPG
E
H
M
I
V
S
WG
--GVAH
C
--D
I
H
L
PDP
fig|502346.5.peg.4151
Escherichia coli O157:H7 str. TW14588 (14-829/879)
CCVALAMSGSYVNAWAENEIQ
F
DS
R
FL
ELKGDTK
---
I
DL
K
RF
SSQ
G
YVE
-
P
G
K
Y
N
L
Q
V
Q
L
N
KQPLTE
E
Y
DI
Y
W
-----
-----YASENDASKTY
A
C
L
TPEL
--
V
AQF
GL
KEDVAKNL
--------
QWIH
D
GK
C
L
-
-KPGQ
L
EGIDIKA
D
LSQSA
L
V
I
S
L
PQ
A
Y
L
EYTDIN
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
I
T
A
QTRHEENG
G
D
---------------------
DS
N
EISGN
G
TV
G
V
N
L
G
A
WRLR
ADWQT
D
YLHS
---
K
SNDD
--
D
VIN
G
DD
TQKN
W
EWSR
Y
YAW
RA
L
PS
LK
A
K
L
G
L
G
E
DY
L
N
S
---
DIFDG
FNY
V
G
GS
I
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
I
S
G
V
A
H
T
T
A
K
VT
V
S
Q
L
G
RV
IY
ET
Q
VP
A
G
P
F
R
I
Q
DL
-GD
S
VS
G
T
L
H
I
R
I
E
E
Q
N
G
QV
Q
E
Y
DI
N
T
A
S
M
P
F
L
T
R
P
G
QVRYK
L
MM
G
RP
Q
EW
G
HHV
--
EG
G
FFS
G
GEASW
G
IANGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
L
G
V
G
R
D
L
SVF
GAV
A
F
D
I
TH
S
HTR
L
DKETA
YGKG
SLD
G
N
S
F
R
L
S
Y
S
K
DFDELN
S
RVT
F
A
GYR
F
S
E
EN
F
MTMS
E
Y
L
D
A
SD
S
E
MVR
------
----------
--
--
------------
--
TGND
K
EMYTA
T
Y
N
Q
N
F
---
RDAG
V
S
V
YL
N
Y
T
RHT
YW
D-RDEQTNYNV
M
L
S
HYFNLG
S
IRN
M
S
I
S
M
T
GYRYEYD
---------
NQA
D
KGVY
IS
L
S
M
P
WG
--------
-
-----------
----
D
S
STI
S
Y
N
G
N
YGSGSDS
S
QV
G
Y
FS
R
V
D
D
AT
H
Y
Q
L
N
--------
VGT
S
DNHSSV-
-
-
--
-
DGYYS
H
D
---
GSL
A
Q
V
DL
SA
NY
HEG
Q
YTSAGISLQ
GG
AT
L
TA
Q
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
V
A
G
VP
V
E
G
N
GAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
N
A
E
ATQ
S
VVQG
TL
T
E
GA
I
G
Y
R
K
F
S
V
IS
G
QKA
M
AV
L
RLQD
G
S
YP
PFGA
---
E
V
K
N
DSAQNV
-
G
L
V
D
D
D
G
N
VYL
A
G
VKPG
E
H
M
I
V
S
WG
--GVAH
C
--D
I
H
L
PDP
fig|155864.1.peg.3219
Escherichia coli O157:H7 EDL933 (14-829/879)
CCVALAMSGSYVNAWAENEIQ
F
DS
R
FL
ELKGDTK
---
I
DL
K
RF
SSQ
G
YVE
-
P
G
K
Y
N
L
Q
V
Q
L
N
KQPLTE
E
Y
DI
Y
W
-----
-----YASENDASKTY
A
C
L
TPEL
--
V
AQF
GL
KEDVAKNL
--------
QWIH
D
GK
C
L
-
-KPGQ
L
EGIDIKA
D
LSQSA
L
V
I
S
L
PQ
A
Y
L
EYTDIN
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
I
T
A
QTRHEENG
G
D
---------------------
DS
N
EISGN
G
TV
G
V
N
X
G
A
WRLR
ADWQT
D
YLHS
---
K
SNDD
--
D
VIN
G
DD
TQKN
W
EWSR
Y
YAW
RA
L
PS
LK
A
K
L
G
L
G
E
DY
L
N
S
---
DIFDG
FNY
V
G
GS
I
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
I
S
G
V
A
H
T
T
A
K
VT
V
S
Q
L
G
RV
IY
ET
Q
VP
A
G
P
F
R
I
Q
DL
-GD
S
VS
G
T
L
H
I
R
I
E
E
Q
N
G
QV
Q
E
Y
DI
N
T
A
S
M
P
F
L
T
R
P
G
QVRYK
L
MM
G
RP
Q
EW
G
HHV
--
EG
G
FFS
G
GEASW
G
IANGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
L
G
V
G
R
D
L
SVF
GAV
A
F
D
I
TH
S
HTR
L
DKETA
YGKG
SLD
G
N
S
F
R
L
S
Y
S
K
DFDELN
S
RVT
F
A
GYR
F
S
E
EN
F
MTMS
E
Y
L
D
A
SD
S
E
MVR
------
----------
--
--
------------
--
TGND
K
EMYTA
T
Y
N
Q
N
F
---
RDAG
V
S
V
YL
N
Y
T
RHT
YW
D-RDEQTNYNV
M
L
S
HYFNLG
S
IRN
M
S
I
S
M
T
GYRYEYD
---------
NQA
D
KGVY
IS
L
X
M
P
WG
--------
-
-----------
----
D
S
STI
S
Y
N
G
N
YGSGSDS
S
QV
G
Y
FS
R
V
D
D
AT
H
Y
Q
L
N
--------
VGT
S
DNHSSV-
-
-
--
-
DGYYS
H
D
---
GSL
A
Q
V
DL
SA
NY
HEG
Q
YTSAGISLQ
GG
AT
L
TA
Q
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
V
A
G
VP
V
E
G
N
GAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
N
A
E
ATQ
S
VVQG
TL
T
E
GA
I
G
Y
R
K
F
S
V
IS
G
QKA
M
AV
L
RLQD
G
S
YP
PFGA
---
E
V
K
N
DSAQNV
-
G
L
V
D
D
D
G
N
VYL
A
G
VKPG
E
H
M
I
V
S
WG
--GVAH
C
--D
I
H
L
PDP
fig|656419.3.peg.3114
Escherichia coli M718 (14-829/879)
CCIALAMSGSYVNAWAEDEIQ
F
DS
R
FL
ELKDDTK
---
I
DL
K
RF
SSQ
G
YVE
-
P
G
K
Y
N
L
Q
V
Q
L
N
KQPLAE
E
Y
DI
Y
W
-----
-----YASENDASKTY
A
C
L
TPEL
--
V
SQF
GL
KEDVAKNL
--------
QWIH
D
GK
C
L
-
-KPGQ
L
EGIDIKA
D
LSQSA
L
V
I
S
L
PQ
A
Y
L
EYTDIN
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
I
T
A
QTRHEENG
G
D
---------------------
DS
N
EISGN
G
TV
G
V
N
L
G
A
WRLR
ADWQT
D
YLHS
---
K
SNDD
--
D
VIN
G
DD
TQKN
W
EWSR
Y
YAW
RA
L
PS
LK
A
K
L
A
L
G
E
DY
L
N
S
---
DIFDG
FNY
V
G
GS
I
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
I
S
G
V
A
H
T
T
A
K
VT
V
S
Q
M
G
RV
IY
ET
Q
VP
A
G
P
F
R
I
Q
DL
-GD
S
VS
G
T
L
H
I
R
I
E
E
Q
N
G
QV
Q
E
Y
DI
N
T
A
S
M
P
F
L
T
R
P
G
QVRYK
L
MM
G
RP
Q
EW
G
HHV
--
EG
G
FFS
G
GEASW
G
IANGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
L
G
V
G
R
D
L
SVF
GAV
A
F
D
I
TH
S
HTR
L
DKETA
YGKG
SLD
G
N
S
F
R
V
S
Y
S
K
DFDELN
S
RVT
F
A
GYR
F
S
E
EN
F
MTMS
E
Y
L
D
A
SD
S
E
MVR
------
----------
--
--
------------
--
TGND
K
EMYTA
T
Y
N
Q
N
F
---
RDAG
V
S
V
YL
N
Y
T
RHT
YW
D-RDEQTNYNV
M
L
S
HYFNLG
S
IRN
M
S
I
S
M
T
GYRYEYD
---------
NQA
D
KGVY
IS
L
S
M
P
WG
--------
-
-----------
----
D
S
STI
S
Y
N
G
N
YGSGSDS
S
QV
G
Y
FS
R
V
D
D
AT
H
Y
Q
L
N
--------
VGT
S
DKHSSV-
-
-
--
-
DGYYS
H
D
---
GSL
A
Q
V
DL
SA
NY
HEG
Q
YTSAGISLQ
GG
AT
P
TA
Q
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
V
A
G
VP
V
E
G
N
GAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
N
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
S
V
IS
G
QKA
M
AV
L
RLQD
G
S
HP
PFGA
---
E
V
K
N
DNAQNV
-
G
L
V
D
D
D
G
N
VYL
A
G
VKPG
E
H
M
T
V
S
WG
--GVAH
C
--D
I
H
L
PDP
fig|749547.3.peg.844
Escherichia coli MS 187-1 (14-829/879)
CCIALAMSGSYVNAWAEDEIQ
F
DS
R
FL
ELKDDTK
---
I
DL
K
RF
SSQ
G
YVE
-
P
G
K
Y
N
L
Q
V
Q
L
N
KQPLAE
E
Y
DI
Y
W
-----
-----YASENDASKTY
A
C
L
TPEL
--
V
SQF
GL
KEDVAKNL
--------
QWIH
D
GK
C
L
-
-KPGQ
L
EGIDIKA
D
LSQSA
L
V
I
S
L
PQ
A
Y
L
EYTDIN
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
I
T
A
QTRHEENG
G
D
---------------------
DS
N
EISGN
G
TV
G
V
N
L
G
A
WRLR
ADWQT
D
YLHS
---
K
SNDD
--
D
VIN
G
DD
TQKN
W
EWSR
Y
YAW
RA
L
PS
LK
A
K
L
G
L
G
E
DY
L
N
S
---
DIFDG
FNY
V
G
GS
I
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
I
S
G
V
A
H
T
T
A
K
VT
V
S
Q
M
G
RV
IY
ET
Q
VP
A
G
P
F
R
I
Q
DL
-GD
S
VS
G
T
L
H
I
R
I
E
E
Q
N
G
QV
Q
E
Y
DI
N
T
A
S
M
P
F
L
T
R
P
G
QVRYK
L
MM
G
RP
Q
EW
G
HHV
--
EG
G
FFS
G
GEASW
G
IANGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
L
G
V
G
R
D
L
SVF
GAV
A
F
D
I
P
H
S
HTR
L
DKETA
YGKG
SLD
G
N
S
F
R
V
S
Y
S
K
DFDELN
S
RVT
F
A
GYR
F
S
E
EN
F
MTMS
E
Y
L
D
A
SD
S
E
MVR
------
----------
--
--
------------
--
TGND
K
EMYTA
T
Y
N
Q
N
F
---
RDAG
V
S
V
YL
N
Y
T
RHT
YW
D-RDEQTNYNV
M
L
S
HYFNLG
S
IRN
M
S
V
S
M
T
GYRYEYD
---------
NQT
D
KGVY
IS
L
S
M
P
WG
--------
-
-----------
----
D
S
STI
S
Y
N
G
N
YGSGSDS
S
QV
G
Y
FS
R
V
D
D
AT
H
Y
Q
L
N
--------
VGT
S
DNHSSV-
-
-
--
-
DGYYS
H
D
---
GSL
A
Q
V
DL
SA
NY
HEG
Q
YTSAGISLQ
GG
AT
L
TA
Q
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
V
A
G
VP
V
E
G
N
GAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
N
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
S
V
IS
G
QKA
M
AV
L
RLQD
G
S
HP
PFGA
---
E
V
K
N
DNAQNV
-
G
L
V
D
D
D
G
N
VYL
A
G
VKPG
E
H
M
I
V
S
WG
--GVAH
C
--D
I
H
L
PDP
fig|656417.3.peg.3017
Escherichia coli M605 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQSLSD
E
Y
DI
N
W
-----
-----YVSENDPTKTY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
E
DDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
H
GKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
A
S
QSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
V
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNT
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FN
R
I
D
D
AT
H
Y
Q
I
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GTL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
INAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|685038.3.peg.2402
Escherichia coli O83:H1 str. NRG 857C (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
SVP
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|431946.3.peg.2313
Escherichia coli SE15 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQSLSD
E
Y
DI
N
W
-----
-----YVSENDPTKTY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GI
S
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
E
DDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
A
S
QSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
V
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNT
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FN
R
I
D
D
AT
H
Y
Q
I
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GTL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GALVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
INAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|525281.3.peg.4146
Escherichia coli 83972 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAD
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|655817.3.peg.2790
Escherichia coli ABU 83972 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAD
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|199310.4.peg.2720
Escherichia coli CFT073 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAD
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|749546.3.peg.2772
Escherichia coli MS 185-1 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAD
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|749528.3.peg.2685
Escherichia coli MS 45-1 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAD
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|199310.1.peg.2808
Escherichia coli CFT073 (3-831/884)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAD
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|585057.4.peg.2590
Escherichia coli IAI39 (1-830/883)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKTY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
E
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
S
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
T
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|585057.6.peg.2593
Escherichia coli IAI39 (1-830/883)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKTY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
E
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
S
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
T
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|362663.8.peg.2404
Escherichia coli 536 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|362663.9.peg.2409
Escherichia coli 536 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|749550.3.peg.2880
Escherichia coli MS 200-1 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
EEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|405955.9.peg.2108
Escherichia coli APEC O1 (3-831/884)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|585397.7.peg.2809
Escherichia coli ED1a (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KHPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAD
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|585397.9.peg.2806
Escherichia coli ED1a (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KHPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAD
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|753642.3.peg.2745
Escherichia coli NC101 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KHPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|439855.10.peg.2662
Escherichia coli SMS-3-5 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKTY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
V
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
S
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
T
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|749527.3.peg.1680
Escherichia coli MS 21-1 (9-830/883)
LRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKTY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
E
DDDSS
N
ST
TSKN
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HRT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADEH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
S
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
T
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|340197.3.peg.3888
Escherichia coli F11 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
G
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|340197.5.peg.4063
Escherichia coli F11 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
G
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|340197.3.peg.4806
Escherichia coli F11 (3-831/884)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KQPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
VADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
G
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSV-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
fig|656440.3.peg.2410
Escherichia coli TA206 (1-829/882)
MP
NHSNFRLRGIACYIALAISGGSVNAWADDSIQ
F
D
P
R
FL
ELKGDTK
---
I
DL
G
K
F
SKK
G
YVD
-
AG
K
Y
N
L
R
VF
I
N
KHPLSD
E
Y
DI
N
W
-----
-----YVSENDPTKNY
A
C
L
TPEL
--
V
AAL
GL
KEGIAKSL
--------
QWTH
N
DE
C
L
-
-KPGQ
L
DGMEVEN
D
LSQSA
L
L
L
T
V
PQ
A
Y
L
EYTSSD
W
DP
P
SR
WD
D
GIP
G
L
IA
D
Y
S
L
N
A
QTRHQEQG
G
E
---------------------
DS
H
DISGN
G
TV
G
A
N
L
G
A
WR
F
R
ADWQS
D
YQHT
---
RSNDD
-
DDDSS
N
ST
TSKH
W
DWSR
Y
YAW
RA
L
PS
LK
A
K
L
S
L
G
E
DY
L
N
S
---
DIFDG
FNY
I
G
SS
V
S
T
DD
Q
MLP
P
NL
R
G
Y
AP
D
V
S
G
V
A
H
S
S
A
K
VTI
S
Q
M
G
RV
L
Y
ET
Q
VP
A
G
P
F
R
I
Q
D
I
-GD
S
VS
G
T
L
H
V
RV
E
E
Q
N
G
QV
Q
E
Y
DV
T
T
A
S
M
P
F
L
T
R
Q
G
QVRYK
V
MM
G
RP
E
DW
N
HKT
--
EG
G
FFS
G
GEASW
G
GADGW
S
L
YGG
A
L
-
ADKH
Y
Q
S
A
A
M
G
V
G
R
D
L
AQF
GA
L
A
F
DVTH
S
HVN
L
DHDSA
YGKG
KLD
G
N
S
F
R
V
S
YAK
DFDELN
S
RVT
F
A
GYR
F
S
E
KN
F
MTMS
E
Y
L
D
ANQSD
MAR
------
----------
--
--
------------
--
TGND
K
EMYTI
T
Y
N
Q
N
F
---
AAAG
V
S
I
YL
N
Y
S
HRT
YW
D-RPEQTNYNL
MF
S
HYFNMG
S
IRN
M
S
I
S
V
T
GYRYEYD
---------
DNA
D
KGMY
L
S
M
S
I
P
WS
--------
-
-----------
----
D
S
STV
T
Y
N
G
S
YGSGSDS
S
QV
G
Y
FK
R
V
D
D
AT
H
Y
Q
V
N
--------
VGT
S
EQHGSA-
-
-
--
-
DGYLS
H
D
---
GSL
A
K
V
DL
SA
NY
HEG
E
YRSAGIALQ
GG
AT
L
TA
H
GG
A
L
HRT
-
Q
N
M
---
G
G
T
RL
LI
D
A
D
G
I
A
N
VP
V
E
S
N
GAPVY
T
N
MF
G
KA
V
VADI
N
N
Y
YR
N
Q
AY
I
D
L
N
NLP
-
E
D
A
E
ATQ
S
VVQA
TL
T
E
GA
I
G
Y
R
K
F
K
V
IS
G
QKA
M
AV
L
RLRD
G
S
YP
PFGA
---
E
V
K
N
DEQQQV
-
G
I
V
D
D
E
G
N
VYL
A
G
VNAG
E
H
M
M
V
F
W
E
--GSAQ
C
--E
I
V
L
PK
Consen1
Primary consensus
MKIPTTTDIPQRyTwcl
Fnp
fL
---
Dl
rF
n
-
aG
Y
v
lw
N
q
-
di
f
-----
pCf
--
l
gl
--------
n
Ci
-
v
D
L
i
iPQi
l
y
P
Wd
GIpal
nYn
tg
g
---------------------
s
l
G
NiGpWRlR
n
--------------
--
w
t
Rai
Lksel
mGd
t
s
---
diFDg
rG
l
sdd
MlP
sq
GfAP
v
GiA
taA
vtI
QnG
iYqs
Vp
GaF
I
Dl
s
GdL
VtidE
dG
q
y
p
ssvp
l
R
G
l
G
r
n
--
p
q
G
t
YgG
q
-
Y
a
l
GlG
nl
GAvs
DvTha
l
----
G
S
R
lYaK
t
l
gYRySt
f
d
a
-----
------
--
dg
------------
nl
k
n
sQ
l
---
gslYls
t
YW
gys
s
---
sls
s
---------
e
ln
svP
--------
r
----
s
n
n
n
g
Gt
e
h
-
sy
v
--------
s
g
tg
-
w
---
g
l
gyny
d
GG
--
v
H
gitls
-
q
l
---
g
T
li
ApGa
g
ie
-
N
td
G
V
t
Y
Nr
ld
N
-
s
d
n
vptqGAlv
a
F
t
G
l
v
gk
PFga
---
v
e
-
g
V
d
G
vyl
G
g
l
v
Wg
C
y
l
a
cthpgS
Consen2
Secondary consensus
n
qmp
ds
l
n
i
g
p
l
y
v
w
vieg
a
l
ktv
h
kaillvrd
l
v
a
m
w
e
nga
d
a
-
q
g
l
a
f
ng
rsndd
dddss
st
y
y
dl
ra
i
e
l
g
el
s
i
v
ts
y
nl
y
i
v
s
li
m
l
et
s
p
i
g
t
rv
n
r
f
t
at
a
m
d
g
g
s
t
l
s
a
d
-
a
i
qs
f
ygkg
s
s
f
a
f
e
y
e
l
anqsd
dy
--
ehrdepiivnyh
--
r
f
v
i
s
mft
nirn
g
t
gtgnnhasd
d
is
ti
vltkrryt
-
t
s
s
y
r
d
hw
l
-
--
a
sash
at
a
hrt
n
s
iv
d
i
n
v
s
vn
n
q
is
k
e
tlr
g
r
v
m
ns
asia
i
s
q
lf
e
l
e
i
-
v
iiqhp
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character