fig|585055.6.peg.1329
Escherichia coli 55989 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|585055.8.peg.1328
Escherichia coli 55989 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|562.375.peg.4097
Escherichia coli EC4100B (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|344601.3.peg.17
Escherichia coli B171 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|344601.5.peg.17
Escherichia coli B171 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|413997.3.peg.1241
Escherichia coli B str. REL606 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|469008.4.peg.2494
Escherichia coli BL21(DE3) (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749547.3.peg.596
Escherichia coli MS 187-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|358709.5.peg.1530
Escherichia coli 101-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749540.3.peg.4436
Escherichia coli MS 146-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|1040638.4.peg.4517
Escherichia coli O104:H4 str. LB226692
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|679207.4.peg.1902
Escherichia coli MS 107-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|340186.3.peg.4389
Escherichia coli E110019 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESL
P
GRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|340186.5.peg.4614
Escherichia coli E110019 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESL
P
GRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|656414.3.peg.1433
Escherichia coli H736 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-ISDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|595496.3.peg.1130
Escherichia coli BW2952 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|536056.3.peg.2577
Escherichia coli DH1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|316401.4.peg.1510
Escherichia coli ETEC H10407 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|83333.1.peg.1188
Escherichia coli K12 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749544.3.peg.1401
Escherichia coli MS 175-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749548.3.peg.4961
Escherichia coli MS 196-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|316407.3.peg.1163
Escherichia coli W3110 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|316385.5.peg.1256
Escherichia coli str. K-12 substr. DH10B (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|316385.7.peg.1282
Escherichia coli str. K-12 substr. DH10B (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|511145.12.peg.1249
Escherichia coli str. K-12 substr. MG1655 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|511145.6.peg.1238
Escherichia coli str. K-12 substr. MG1655 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749545.3.peg.110
Escherichia coli MS 182-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NHSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749532.3.peg.1901
Escherichia coli MS 78-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NHSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|585034.4.peg.1213
Escherichia coli IAI1
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|585034.5.peg.1209
Escherichia coli IAI1
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|481805.3.peg.2601
Escherichia coli ATCC 8739 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEP
K
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|481805.6.peg.2588
Escherichia coli ATCC 8739 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEP
K
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749537.3.peg.3852
Escherichia coli MS 115-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VT
I
NSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKVS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749538.3.peg.4718
Escherichia coli MS 116-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAT
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|670888.3.peg.1882
Escherichia coli 1827-70 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
S
SSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|344610.3.peg.4280
Escherichia coli 53638 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
G
K
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|344610.7.peg.4874
Escherichia coli 53638 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
G
K
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|331112.3.peg.1219
Escherichia coli HS (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TSV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|331112.6.peg.1274
Escherichia coli HS (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TSV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|6666666.5357.peg.1682
Escherichia coli TY-2482 (240-954/954)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
S-
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|679204.3.peg.1436
Escherichia coli MS 145-7 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|550676.3.peg.683
Escherichia coli B185 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VA
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
V
V
M
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPN------PTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
G
K
SQ
L
NI
Y
VK
T
G
A
IH
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|409438.11.peg.1387
Escherichia coli SE11 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|511693.5.peg.1272
Escherichia coli BL21 (161-876/876)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
I
IN
FTGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPE------PAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQRF
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
T
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|216592.1.peg.1626
Escherichia coli 042 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHSG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPN------PTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
D
N
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|216592.3.peg.1304
Escherichia coli 042 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHSG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPN------PTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
D
N
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|550677.3.peg.2657
Escherichia coli B354 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
V
T
M
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHTS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPN------PTPTPKPTTTAD
A
GGNYL
NV
S
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YI
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAQ
NG
F
Y
S
DL
VI
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SQ
E
A
GQRF
N
L
SP
TG
Y
G
F
Y
L
EPQ
TQ
L
T
Y
SHQNEM
V
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|585396.4.peg.1591
Escherichia coli O111:H- str. 11128 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|573235.3.peg.1742
Escherichia coli O26:H11 str. 11368 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|656408.3.peg.1271
Escherichia coli H591 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|595495.4.peg.1742
Escherichia coli KO11 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|679206.4.peg.801
Escherichia coli MS 119-7 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|656443.3.peg.1528
Escherichia coli TA271 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|566546.3.peg.306
Escherichia coli W (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|566546.4.peg.1289
Escherichia coli W (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|331111.12.peg.1638
Escherichia coli E24377A (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
N
A
T
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|331111.3.peg.3810
Escherichia coli E24377A (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
N
A
T
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|340184.3.peg.3621
Escherichia coli B7A (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGTASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|340184.6.peg.3785
Escherichia coli B7A (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTFAT
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGTASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
T
D
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
S
N
LF
D
QKQVNGGYRFSF
fig|656379.3.peg.3292
Escherichia coli FVEC1302 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
VANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
DSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
V
T
M
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHTS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPN------PTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDE
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
S
G
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
AL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|656380.3.peg.2135
Escherichia coli FVEC1412 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
VANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
DSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
V
T
M
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHTS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPN------PTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDE
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
S
G
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
AL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749549.3.peg.874
Escherichia coli MS 198-1 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
VANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
DSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
V
T
M
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHTS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPN------PTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDE
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
S
G
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
AL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|585056.7.peg.1696
Escherichia coli UMN026 (240-955/955)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
VANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
DSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
V
T
M
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHTS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPN------PTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDE
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
S
G
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
AL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|656419.3.peg.1537
Escherichia coli M718 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VN
D
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHAS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPTPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDE
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
G
K
SQ
L
NI
Y
VK
T
G
A
IH
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|656437.3.peg.1344
Escherichia coli TA143 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPAPAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDE
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
S
G
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
AL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|562.373.peg.2844
Escherichia coli 1125A (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
T
S
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YQ
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGY
H
FSF
fig|478007.5.peg.4699
Escherichia coli O157:H7 str. EC508 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
T
S
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YQ
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGY
H
FSF
fig|701177.3.peg.1460
Escherichia coli O55:H7 str. CB9615 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
T
S
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YQ
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGY
H
FSF
fig|656444.3.peg.1912
Escherichia coli TA280 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHTS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPAPTPKPTTTAD
A
GGNYL
NV
S
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
S
G
F
Y
S
DL
VV
I
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
AL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|749531.3.peg.41
Escherichia coli MS 69-1 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
STIK
TN
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
S
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
S
GS
Y
G
AS
AQ
T
----------
A
TAV
VNM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHTS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTIP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPAPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDE
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
MS
I
SL
E
A
GQR
L
N
L
SP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NV
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|550672.3.peg.1503
Escherichia coli B088 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VI
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VN
D
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHAS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPTPTPTPKPTTTAD
A
GGNYL
N
I
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
RDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
G
K
SQ
L
NM
Y
VK
T
G
A
IH
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
T
Q
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGYRFSF
fig|679205.4.peg.4040
Escherichia coli MS 124-1 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VA
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHSG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VT
N
NSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHAS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPTPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
RDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGY
H
FSF
fig|749533.3.peg.4591
Escherichia coli MS 84-1 (240-961/961)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VA
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHSG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VT
N
NSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHAS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPTPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
RDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
TG
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NM
Y
VK
T
G
A
IR
EFS
GD
TE
YL
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGY
H
FSF
fig|478008.5.peg.2916
Escherichia coli O157:H7 str. EC869 (161-882/882)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
T
S
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YQ
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGY
H
FSF
fig|637388.3.peg.2046
Escherichia coli O157:H7 str. FRIK2000 (161-882/882)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
T
S
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YQ
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGY
H
FSF
fig|570506.3.peg.3094
Escherichia coli O157:H7 str. FRIK966 (161-882/882)
MG
IN
V
Q--
KN
S
VVD
LG
T
N
----
S
S
IK
TS
GDNAHGLWSFG
QVSA
N
A
L
T
VD
VT
G
A
---
AANG
V
EVRG
GT
TTIGA
DS
HI
---
S
S
AQ
GG
G
L
-VT
SGS
D
A
T
IN
FSGTA
AQ
R
N
SI
FS
G
GS
Y
G
AS
AQ
T
----------
A
TAV
I
NM
Q
N
T
DI
TVDR
N
GSLAL
GLW
ALSG
G
RI
TG
---
--
--
--
DS
LA
I
T
GA
AGARG
IYA
MTNS
QI
DLT
S
------------
DL
V
I
DM
S
T
PD
Q---
-
--
MA
----
I
A
TQH
D
DGYAA
----------
S
R
IN
AS
-
GRMLIN
GS
VLS------KG
G
LINLDMHPG
SV
---------------
W
T
GSSLSD
N
VNG
-
GKL
D
VAM
N
N
-
S
V
W
N
VTSNSN
LD
TL
A
L
SH
S
T
V
-
D
FA
SHGS
T
A
GTF
T
T
LNVE
NLS
G
N
-----------------
STFIMRAD
V
VGEGNGVNNK
G
D
------------
L
L
N
ISG
-------------------
SSAGN
HVLAIR
N
QG
S
EATTG
N
EVL
T
VVK
---
TT
DGAASFS
A
SSQV
E
L
G
G
Y
L
YD
VRK
NGT
NW
E
L
Y
ASGTVP
E
PTPN
P
E
PTPA
P
A
QPP
IVNPD
PTPEPDPTPNPTPTPKPTTTAD
A
GGNYL
NV
G
Y
LLNYV
E
NR
TL
MQ
RMGDLR
NQSK
D
-
GN
I
WLR
SY
GG
SLDSF
A
S
G
KL
SGFD
MG
YSGIQ
F
GGD
K
R
-LSDV
M
P
L
YV
GL
YIG
ST
HAS
PDYS
G
GD
GT---
ARSD
YM
G
----
M
YAS
YMAH
NG
F
Y
S
DL
VV
K
AS
R
QK
N
SFH
V
LDS
Q
NN
G
-VN
AN
GT
ANG
LS
I
SL
E
A
GQRF
N
L
TP
T
S
Y
G
F
YIEPQ
TQ
L
T
Y
SHQNEM
A
MKAS
----
NGLNIH
L-
-
N
HYESLLGRAS
MIL
GYDIT
-
A
GN
SQ
L
NI
Y
VK
T
G
A
IR
EFS
GD
TE
YQ
L
N
N
SREKYS
F
K
G
-
--
NGWNN
GVGV
S
A
QYN
KQ
HTF
YLE
A
DYT
Q
-
G
N
LF
D
QKQVNGGY
H
FSF
fig|199310.1.peg.407
Escherichia coli CFT073 (104-760/776)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|405955.9.peg.257
Escherichia coli APEC O1 (104-760/776)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|431946.3.peg.292
Escherichia coli SE15 (75-747/763)
GHDI
T
A
T
S
T
VDQG
F
V
E
G
VK
V
SGD
KN
V
V
I
N
-A
T
G
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
S
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
S
S
N
GS
T
A
IF
AQ
K
G
SV
L
N
GF
N
GD
A
TDN
IT
L
A
G
S
N
I
I---
N
GGIET
---
IVIA
K
EN
K
G
T
HT
VN
LN
IK
D
G
SI
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SAGTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTLNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
A
S
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LSVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|749546.3.peg.71
Escherichia coli MS 185-1 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
V
NLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|525281.3.peg.2587
Escherichia coli 83972 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|655817.3.peg.431
Escherichia coli ABU 83972 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|199310.4.peg.398
Escherichia coli CFT073 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|405955.13.peg.316
Escherichia coli APEC O1 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|714962.3.peg.304
Escherichia coli IHE3034 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|585035.6.peg.308
Escherichia coli S88 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|869729.3.peg.3378
Escherichia coli UM146 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|364106.7.peg.445
Escherichia coli UTI89 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|364106.8.peg.443
Escherichia coli UTI89 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
Y
S
N
GS
T
A
IF
AQ
K
GNLLQGFDGD
A
TDN
IT
L
A
D
S
N
I
I---
N
GGIET
---
IVTA
G
NK
TG
IHT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SASTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|656417.3.peg.448
Escherichia coli M605 (100-749/765)
KN
V
V
I
N
-A
T
G
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
S
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KN
S
S
N
GS
T
A
IF
AQ
K
G
SV
L
N
GF
N
GD
A
TDN
IT
L
A
G
S
N
I
I---
N
GGIET
---
IVIA
K
EN
K
G
T
HT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SAGTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTLNSGDKAG
S
DLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
DA
S
V
S
L
Q
N
G
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LSVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
DV
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|550676.3.peg.1094
Escherichia coli B185 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
G
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
S
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
S
S
AT
G
I
S
L
EAT
T
G
N
N
L
T
L
N
GTTIN
AQ
G
N
KS
S
S
N
GS
T
A
IF
AQ
K
G
SV
L
N
GF
N
GD
A
TDN
IT
L
A
G
S
N
I
I---
N
GRIET
---
IVIA
K
EN
TG
T
HT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
AGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DI
H
A
----
L
S
ASE
N
SAGTT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
NLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
N
A
S
V
S
L
Q
N
E
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
N
V
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AT
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|656419.3.peg.491
Escherichia coli M718 (93-749/765)
G
VK
V
SGN
KN
V
V
I
N
-A
T
D
----
STI
T
AQ
G
E
--------
G
TYVR
T
A
M
V
I
D
ST
G
-
---
---D
V
VVNG
G
N
FV--A
K
N
EK
---
G
S
AT
G
I
S
L
EGT
K
G
N
N
V
T
L
N
GTTIN
AQ
G
N
KS
S
S
N
A
S
T
A
IF
AQ
K
G
S
LL
N
GF
N
GD
A
TDN
IT
L
A
G
S
N
I
I---
N
GRIET
---
ILIA
Q
EN
K
G
T
HT
VN
LN
IK
D
G
SV
I
G
A
A
NNKQT
IYA
SASA
Q
G
TGS
A
------------
TQ
N
L
NL
S
V
A
D
STIY
S
DV
L
A
----
L
S
ESE
N
SAATT
----------
T
N
V
N
MN
V
ARSYWE
G
N
AYTFNSGDKAG
S
NLDINLSDS
SV
---------------
W
K
G
-----
K
V
S
G
A
G
N
A
S
V
S
L
Q
N
E
S
V
W
N
VT
GS
S
T
VD
A
L
A
V
KD
S
T
V
-
N
I
T
KATV
N
T
GTFA
-
----
--
S
Q
N
-----------------
G
T
L
I
VD
A
-
-
--------
SS
E
N
------------
T
L
D
ISG
-------------------
KA
S
G
D
--LRVY
S
AG
S
LDLIN
E
Q--
T
AFI
---
S
T
GKDSTLK
A
TGTT
E
G
G
L
Y
Q
YD
LTQ
-
G
A
DG
N
F
Y
FVKNTH
K
ASNA
S
S
VIQ
A
M
A
AAP
-----
----------------------
-
----A
NV
A
N
L----
Q
AD
TL
SA
R
QDAV
R
LSEN
D
K
G
G
V
W
I
Q
YF
GG
KQKHT
T
A
G
NA
S
-
Y
D
LD
V
N
G
V
M
L
GGD
T
R
FMTED
G
S
W
LA
G
V
AMS
S
-
---
---
A
K
GD
MTTMQ
S
KG
D
TE
G
YSFH
A
Y
L
S
RQYN
NG
I
F
I
D
T
AA
Q
FG
H
YS
N
TAD
V
RLM
N
GG
G
TIK
A
D
FN
T
NG
FG
A
MV
K
G
G
YT
W
K
-
--
D
G
N
G
L
F
I
Q
P
-
--
-
-
Y
AKLSAL
T
LEGV
DYQL
NG
V
N
V
H
S-
-
D
S
Y
N
S
V
LG
E
A
G
TRV
GYD
F
A
-
V
GN
AS
V
KP
Y
LN
L
A
A
LN
EFS
DG
N
K
VR
L
G
D
ESVNAS
I
D
G
-
--
AAFRV
G
A
GV
Q
A
DIT
K
N
MGA
Y
AS
L
DYT
K
-
G
D
fig|340184.3.peg.4473
Escherichia coli B7A (668-1366/1366)
QG
HPIIHA
G
TT
------
TS
AS
Q
S
DWE
T
R
Q
F
TL
G
K
L
K
L
DA
AT
F
YLS
R
N
G
Q
M
H
GD
I
N
A
V
N
G
ST
V
I
LG
S
D
---------
----
-
H
V
F
T
D
KN
D
G
---
TGNS
V
SSVE
GT
ATATT
TV
D-
---
Q
S
DY
R
G
N
L
TLE
N
K
S
S
L
Q
I
-
-----
--
-
R
EK
F
T
G
G
I
E
A
--
--
-
----------
Y
DSS
V
SV
N
S
Q
NV
IFDR
V
GSFVN
---
----
-
--
--
---
SS
L
S
LE
K
W
AS
L
T
A
Q
SG---
I
F
S
TGVV
D
L
KGN
A
------------
S
L
T
L
-T
G
I
P
S
AEKH
S
YY
S
PV
VS
-
I
I
EGI
N
L-GSQ
----------
S
S
L
T
VE
-
NMGYLN
SD
IMAENEAMVNL
G
--DSGAETG
KTDSPLFI
SL
MK
GY
NAVL
S
G
-----
N
I
T
G
-
A
K
S
I
V
N
M
N
N
-
S
L
W
C
I
T
G
NS
T
TG
M
L
N
A
RN
S
R
V
-
E
VG
NG-K
N
F
AN
L
QV
KELV
--
A
D
N
-----------------
STF
L
M
H
T
N
N
--------
S
Q
A
D
------------
H
L
N
VT
D
-------------------
K
L
S
G
S
RNTILV
N
FL
N
NPANG
-
-MN
V
TLI
---
T
A
PK-----
-
----
-
-
-
-
-
-
--
---
-
G
S
NE
K
M
F
QAG--T
Q
QIGF
S
N
V
TP
V
I
S
AEK
-----
TDSSTKWVLTGYQTVSDVRTSK
I
ATDFM
AS
G
Y
KSFLR
E
VN
N
L
NK
RMGDLR
DTQG
D
-
T
G
V
W
G
R
IM
N
G
-----
R
G
S
AN
G
G
Y
S
DN
Y
T
H
V
Q
I
G
A
D
R
N
HELDN
M
D
L
FT
G
V
LLT
Y
T
DSD
---
A
S
SH
VFRGK
T
K
S
V
GG
G
----
L
YAS
ALFN
S
G
A
Y
F
DL
IG
K
YL
H
DD
N
QYT
-
--A
D
FA
S
LGA
K
N
YS
S
HS
WY
A
GA
E
V
G
Y
R
Y
H
L
SE
EA
-
-
-
W
V
EPQ
IE
L
V
Y
GAVSGK
S
FKWE
D
---
R
G
M
E
L
G
MK
D
R
D
Y
NP
L
I
GR
S
G
VDM
G
K
V
F
S
-
G
G
D
WK
I
TA
R
AG
L
G
Y
QF
DL
L
TN
G
E
TV
L
R
D
ISGEKR
F
N
G
E
KD
SRMLV
S
VG
L
N
A
EIK
N
N
MRF
G
LE
L
D
K
S
A
F
G
K
YN
V
D
N
AI
N
ANI
R
Y
Y
F
fig|340184.6.peg.4686
Escherichia coli B7A (668-1366/1366)
QG
HPIIHA
G
TT
------
TS
AS
Q
S
DWE
T
R
Q
F
TL
G
K
L
K
L
DA
AT
F
YLS
R
N
G
Q
M
H
GD
I
N
A
V
N
G
ST
V
I
LG
S
D
---------
----
-
H
V
F
T
D
KN
D
G
---
TGNS
V
SSVE
GT
ATATT
TV
D-
---
Q
S
DY
R
G
N
L
TLE
N
K
S
S
L
Q
I
-
-----
--
-
R
EK
F
T
G
G
I
E
A
--
--
-
----------
Y
DSS
V
SV
N
S
Q
NV
IFDR
V
GSFVN
---
----
-
--
--
---
SS
L
S
LE
K
W
AS
L
T
A
Q
SG---
I
F
S
TGVV
D
L
KGN
A
------------
S
L
T
L
-T
G
I
P
S
AEKH
S
YY
S
PV
VS
-
I
I
EGI
N
L-GSQ
----------
S
S
L
T
VE
-
NMGYLN
SD
IMAENEAMVNL
G
--DSGAETG
KTDSPLFI
SL
MK
GY
NAVL
S
G
-----
N
I
T
G
-
A
K
S
I
V
N
M
N
N
-
S
L
W
C
I
T
G
NS
T
TG
M
L
N
A
RN
S
R
V
-
E
VG
NG-K
N
F
AN
L
QV
KELV
--
A
D
N
-----------------
STF
L
M
H
T
N
N
--------
S
Q
A
D
------------
H
L
N
VT
D
-------------------
K
L
S
G
S
RNTILV
N
FL
N
NPANG
-
-MN
V
TLI
---
T
A
PK-----
-
----
-
-
-
-
-
-
--
---
-
G
S
NE
K
M
F
QAG--T
Q
QIGF
S
N
V
TP
V
I
S
AEK
-----
TDSSTKWVLTGYQTVSDVRTSK
I
ATDFM
AS
G
Y
KSFLR
E
VN
N
L
NK
RMGDLR
DTQG
D
-
T
G
V
W
G
R
IM
N
G
-----
R
G
S
AN
G
G
Y
S
DN
Y
T
H
V
Q
I
G
A
D
R
N
HELDN
M
D
L
FT
G
V
LLT
Y
T
DSD
---
A
S
SH
VFRGK
T
K
S
V
GG
G
----
L
YAS
ALFN
S
G
A
Y
F
DL
IG
K
YL
H
DD
N
QYT
-
--A
D
FA
S
LGA
K
N
YS
S
HS
WY
A
GA
E
V
G
Y
R
Y
H
L
SE
EA
-
-
-
W
V
EPQ
IE
L
V
Y
GAVSGK
S
FKWE
D
---
R
G
M
E
L
G
MK
D
R
D
Y
NP
L
I
GR
S
G
VDM
G
K
V
F
S
-
G
G
D
WK
I
TA
R
AG
L
G
Y
QF
DL
L
TN
G
E
TV
L
R
D
ISGEKR
F
N
G
E
KD
SRMLV
S
VG
L
N
A
EIK
N
N
MRF
G
LE
L
D
K
S
A
F
G
K
YN
V
D
N
AI
N
ANI
R
Y
Y
F
fig|550672.3.peg.4213
Escherichia coli B088 (635-1281/1281)
N
AA
F
DLA
R
N
A
S
L
S
TN
I
N
A
NH
-
ST
VT
LG
S
E
---------
----
-
D
L
Y
I
D
IN
D
G
---
NGVQ
T
TPSR
G
Q
SKATT
ET
D-
---
Q
S
RF
N
G
R
V
TLK
N
GS
T
L
T
I
-
-----
--
-
N
EH
F
T
G
G
I
E
S
--
--
-
----------
T
DST
T
T
V
T
S
G
D
A
TLNV
F
SSFTR
---
----
-
--
--
---
SA
L
A
LA
D
G
AN
L
T
A
T
SG---
L
I
S
DGEV
TA
GAG
S
------------
T
L
S
M
-L
S
G
Q-
---Y
-
--
--
----
T
A
GRW
S
FIGQG
----------
S
T
L
N
VG
-
AGSIMA
G
N
IQANDAASLSF
G
VTNQADQNL
-FTA
-------------
Y
G
G
-----
N
LSA
-
P
L
A
R
V
M
M
T
N
-
T
L
W
R
AE
G
Q
S
V
VK
S
L
E
L
KG
A
Q
V
-
S
F
S
NV-G
A
A
G
A
LT
V
DTLT
--
A
N
N
-----------------
S
M
FI
I
N
T
N
G
--------
KT
A
D
------------
M
V
T
V
N
Q
-------------------
S
LN
G
K
NNSLVV
V
PT
V
SAARE
A
TSS
L
PLV
---
N
A
PK-----
-
----
-
-
-
-
-
-
--
---
-
A
T
ES
D
V
F
MLNPIT
Q
RTGF
H
T
Y
TP
Q
L
S
MVE
-----
TENSKQWRLEGFDVQQDKAAIQ
A
GKSVM
D
V
G
Y
KSFLT
E
MN
N
L
NY
RMGDLR
NTHG
D
-
T
G
T
W
A
R
IY
S
G
-----
T
G
S
AD
A
G
Y
S
NS
WT
H
L
Q
I
G
A
D
R
K
QSFNG
G
D
L
FT
G
V
TAT
F
T
NSN
---
S
H
G
T
GWSGQ
T
K
S
V
GI
G
----
L
YAS
TMFD
S
G
L
Y
V
D
V
IG
K
YV
H
HD
N
HYS
-
--A
T
EV
S
MSE
QD
YS
S
R
S
WY
L
GT
E
T
G
W
RF
S
L
PG
EA
-
-
-
F
IEPQ
TE
L
V
Y
GAVSGN
R
FAWQ
S
---
E
G
Y
D
I
S
MQ
R
K
QE
NP
L
I
GR
TG
VEI
G
KTF
T
-
G
G
D
YK
L
TA
L
AG
V
H
Y
QY
DL
F
NP
E
K
TV
V
R
D
LAGETF
I
R
N
G
KD
SRVNF
N
L
GV
N
A
EIK
E
N
TRI
S
L
D
V
ERS
A
S
G
H
Y
D
I
D
K
AI
N
AN
V
R
Y
SF
fig|340185.3.peg.3793
Escherichia coli E22 (640-1309/1309)
R
N
S
VV
E
GD
I
V
A
S
N
-
ST
L
K
LG
GD
---------
----
V
P
V
F
I
D
MY
D
G
INI
TGNG
F
GFRQ
DV
REGRS
AD
DG
---
S
S
SY
T
G
N
I
TLQ
K
GS
T
L
D
I
-
-----
--
-
N
NR
F
T
G
G
I
E
A
--
--
-
----------
H
DSQ
VN
V
T
S
P
D
A
LLQN
S
GVFMN
---
----
-
--
--
---
ST
L
S
VR
D
G
GH
L
T
A
Q
KG---
L
Y
S
DGRV
QI
GKN
G
------------
T
L
S
L
-S
G
T
P
E
NGAD
N
TW
M
PV
LTY
M
T
EGY
D
LTGDN
----------
A
T
LD
IS
-
QQAHVS
G
D
VHATSSSTIRI
G
SENPGSVSS
SV
SPVLAAGV
F
S
GY
NAA
Y
Y
G
-----
A
I
T
G
-
GK
G
N
V
S
M
N
N
-
G
L
W
Q
L
T
G
D
S
D
IN
S
L
T
T
RN
S
R
V
-
Q
SE
EK-G
A
F
R
T
LT
V
NTLD
--
A
T
G
-----------------
S
D
F
V
L
R
T
D
L
--------
KG
A
D
------------
K
I
S
I
T
E
-------------------
KA
S
G
S
DNTLNV
S
FM
K
NPSPG
Q
SLN
I
PLV
---
SA
PA-----
-
----
-
-
-
-
-
-
--
---
-
GT
SG
D
I
F
KAG--T
R
VTGF
S
R
V
TP
T
L
H
VDT
-----
TGGSTKWILDGFRTEADKAAAA
K
ADSFM
N
A
G
Y
KNFMT
E
VN
N
L
NK
RMG
E
LR
DTNG
D
-
AG
A
W
A
R
IM
N
G
-----
A
G
S
AD
G
G
Y
S
DN
Y
T
H
V
Q
V
G
F
D
K
K
HVLDG
V
D
L
FT
G
I
TMT
Y
T
DSS
---
A
D
S
D
AFSGK
T
K
S
V
GG
G
----
L
YAS
ALFN
S
G
A
Y
I
DL
IG
K
YI
H
HN
N
DYT
-
--G
N
FA
G
LGT
K
H
YG
T
HS
WY
A
GA
E
T
G
Y
R
Y
H
L
TE
DT
-
-
-
F
IEPQ
AE
L
V
Y
GAVSGK
T
FRWK
D
---
GE
M
D
L
S
MK
N
K
D
F
S
P
L
I
GR
TG
IEL
G
KTF
S
-
G
KD
WS
V
TA
R
AG
T
S
W
QF
DL
L
NN
G
E
TV
L
R
D
ASGEKR
I
K
G
E
KD
SRMLF
N
VG
M
N
A
QIK
DN
MRF
G
LE
F
E
K
S
A
F
G
K
YN
V
D
N
AI
N
AN
F
R
Y
M
F
fig|340185.4.peg.3988
Escherichia coli E22 (585-1254/1254)
R
N
S
VV
E
GD
I
V
A
S
N
-
ST
L
K
LG
GD
---------
----
V
P
V
F
I
D
MY
D
G
INI
TGNG
F
GFRQ
DV
REGRS
AD
DG
---
S
S
SY
T
G
N
I
TLQ
K
GS
T
L
D
I
-
-----
--
-
N
NR
F
T
G
G
I
E
A
--
--
-
----------
H
DSQ
VN
V
T
S
P
D
A
LLQN
S
GVFMN
---
----
-
--
--
---
ST
L
S
VR
D
G
GH
L
T
A
Q
KG---
L
Y
S
DGRV
QI
GKN
G
------------
T
L
S
L
-S
G
T
P
E
NGAD
N
TW
M
PV
LTY
M
T
EGY
D
LTGDN
----------
A
T
LD
IS
-
QQAHVS
G
D
VHATSSSTIRI
G
SENPGSVSS
SV
SPVLAAGV
F
S
GY
NAA
Y
Y
G
-----
A
I
T
G
-
GK
G
N
V
S
M
N
N
-
G
L
W
Q
L
T
G
D
S
D
IN
S
L
T
T
RN
S
R
V
-
Q
SE
EK-G
A
F
R
T
LT
V
NTLD
--
A
T
G
-----------------
S
D
F
V
L
R
T
D
L
--------
KG
A
D
------------
K
I
S
I
T
E
-------------------
KA
S
G
S
DNTLNV
S
FM
K
NPSPG
Q
SLN
I
PLV
---
SA
PA-----
-
----
-
-
-
-
-
-
--
---
-
GT
SG
D
I
F
KAG--T
R
VTGF
S
R
V
TP
T
L
H
VDT
-----
TGGSTKWILDGFRTEADKAAAA
K
ADSFM
N
A
G
Y
KNFMT
E
VN
N
L
NK
RMG
E
LR
DTNG
D
-
AG
A
W
A
R
IM
N
G
-----
A
G
S
AD
G
G
Y
S
DN
Y
T
H
V
Q
V
G
F
D
K
K
HVLDG
V
D
L
FT
G
I
TMT
Y
T
DSS
---
A
D
S
D
AFSGK
T
K
S
V
GG
G
----
L
YAS
ALFN
S
G
A
Y
I
DL
IG
K
YI
H
HN
N
DYT
-
--G
N
FA
G
LGT
K
H
YG
T
HS
WY
A
GA
E
T
G
Y
R
Y
H
L
TE
DT
-
-
-
F
IEPQ
AE
L
V
Y
GAVSGK
T
FRWK
D
---
GE
M
D
L
S
MK
N
K
D
F
S
P
L
I
GR
TG
IEL
G
KTF
S
-
G
KD
WS
V
TA
R
AG
T
S
W
QF
DL
L
NN
G
E
TV
L
R
D
ASGEKR
I
K
G
E
KD
SRMLF
N
VG
M
N
A
QIK
DN
MRF
G
LE
F
E
K
S
A
F
G
K
YN
V
D
N
AI
N
AN
F
R
Y
M
F
fig|656417.3.peg.5223
Escherichia coli M605 (585-1254/1254)
R
N
S
VV
E
GD
I
V
A
S
N
-
ST
L
K
LG
GD
---------
----
V
P
V
F
I
D
MY
D
G
INI
TGNG
F
GFRQ
DV
REGRS
AD
DG
---
S
S
SY
T
G
K
I
TLQ
K
GS
T
L
D
I
-
-----
--
-
N
NR
F
I
G
G
I
E
A
--
--
-
----------
H
DSK
VN
V
T
S
P
D
A
LLQN
S
GVFVN
---
----
-
--
--
---
ST
L
S
VR
D
G
GH
L
T
A
Q
KG---
L
Y
S
DGRV
QI
GKK
G
------------
T
L
S
L
-S
G
T
P
E
NGAD
N
TW
M
PV
LTY
M
T
EGY
D
LTGDN
----------
A
T
L
N
IS
-
QQAHVS
G
D
VHATSSSSIRI
G
SENPGSVSS
SV
SPVLAAG
LF
S
GY
NAA
Y
Y
G
-----
A
I
T
G
-
GK
G
N
V
S
M
N
N
-
G
L
W
Q
L
T
G
D
S
D
IN
S
L
T
T
RN
S
R
V
-
Q
SE
EN-G
A
F
R
T
LT
V
KTLD
--
A
T
G
-----------------
S
D
F
V
L
R
T
D
L
--------
KD
A
D
------------
K
I
S
I
T
E
-------------------
KA
S
G
S
DNTLNV
S
FM
K
NPSPG
Q
SLN
I
PLV
---
SA
PA-----
-
----
-
-
-
-
-
-
--
---
-
GT
SG
D
I
F
KAG--T
R
VTGF
S
R
V
TP
T
L
R
VDT
-----
TGGSTKWILDGFRTEADKAAAA
K
ANSFM
N
A
G
Y
KSFMT
E
VN
N
L
NK
RMG
E
LR
DTNG
D
-
AG
A
W
A
R
IM
N
G
-----
A
G
S
AD
G
G
Y
S
DN
Y
T
H
V
Q
V
G
F
D
K
K
HALDG
V
D
L
FT
G
V
TMT
Y
T
DSS
---
A
D
S
D
AFSGK
T
K
S
V
GG
G
----
L
YAS
ALFN
S
G
A
Y
I
DL
IG
K
YI
H
HD
N
DYT
-
--G
N
FA
G
LGT
K
H
YG
T
HS
WY
A
GA
E
T
G
Y
R
Y
H
L
TE
DT
-
-
-
F
IEPQ
AE
L
V
Y
GAVSGK
T
FRWK
D
---
GE
M
D
L
S
MK
N
K
D
F
S
P
L
I
GR
TG
IEL
G
KTF
R
-
G
KD
WS
V
TA
R
AG
T
S
W
QF
DL
L
NN
G
E
TV
L
R
D
ASGEKR
I
K
G
E
KD
SRMLF
N
VG
M
N
A
QIK
DN
MRF
G
LE
F
E
K
S
A
F
G
K
YN
V
D
N
AI
N
AN
F
R
Y
M
F
fig|431946.3.peg.4172
Escherichia coli SE15 (628-1278/1278)
L
N
L
QN
AE
F
NLA
R
N
A
S
L
N
TR
I
N
A
EH
-
ST
VT
LG
S
E
---------
----
-
D
L
Y
I
D
LN
D
G
---
NGVA
T
KPTL
G
K
SKATA
E
D
D-
---
Q
S
RF
N
G
H
V
QLQ
Q
GS
A
L
T
I
-
-----
--
-
N
EH
F
A
G
G
I
D
S
--
--
-
----------
A
DSA
T
T
I
T
S
T
D
T
TLNQ
L
SRFTQ
---
----
-
--
--
---
SS
L
S
LG
E
G
AK
L
T
A
T
AG---
L
L
S
DGTV
SS
NAG
A
------------
S
L
S
L
-L
S
D
QP
GTMY
-
--
--
----
S
A
QSW
E
LSGQD
----------
T
S
L
N
VG
-
AGGIIT
G
D
INANDAASISF
G
TTDINQS--
--T
N
-------------
Y
Y
G
-----
N
I
N
A
-
P
L
A
S
V
T
M
K
D
-
T
A
W
Q
V
N
KQ
S
V
AK
S
L
T
L
NG
S
T
L
-
S
F
N
RF-G
-
Q
G
G
LT
S
DTLE
--
A
T
N
-----------------
S
S
FI
I
N
AD
G
--------
KA
A
D
------------
T
V
T
V
N
Q
-------------------
A
LT
G
G
NNTLVV
I
PT
T
NSVKQ
G
GDP
V
SLV
---
T
A
PK-----
-
----
-
-
-
-
-
-
--
---
-
N
T
QS
N
I
F
TLNPVS
I
NAGF
H
S
F
TP
Q
L
D
VLE
-----
TDVNKQWRLEGFYIQPDKAALR
T
GKSFM
DL
G
Y
KNFIT
E
IN
N
L
ND
RMGDLR
HTHG
E
-
T
G
A
W
A
R
LN
S
G
-----
S
G
S
AT
D
GF
T
GS
Y
T
H
L
Q
I
G
A
D
R
K
HIIEG
G
E
L
FT
G
V
TAT
F
T
SSN
---
N
R
G
T
GWSGR
T
K
S
T
GI
G
----
V
YAS
AMFD
S
G
L
Y
V
D
T
IG
K
YV
R
HD
N
HYS
-
--S
S
AL
G
MPE
QD
YG
S
HS
WY
L
GA
E
A
G
W
RF
S
L
PD
E
T
-
-
-
YI
Q
PQ
TE
L
I
Y
GTVSEN
Q
FAWQ
F
---
NG
GE
I
Y
MQ
R
K
QM
Q
P
L
I
GR
TG
IEF
G
KTF
R
-
G
KD
WE
M
TA
L
TG
I
N
Y
QY
DL
F
KP
T
V
TA
F
K
D
LAGDTY
I
N
N
G
KD
SRVVF
N
VGV
N
T
KIK
E
N
TRI
S
L
N
V
ERS
E
F
G
S
YN
I
D
K
L
I
N
ANI
R
YT
F
fig|431946.3.peg.4176
Escherichia coli SE15 (585-1254/1254)
R
N
S
VV
E
GD
I
V
A
S
N
-
ST
L
K
LG
GD
---------
----
V
P
V
F
I
D
MY
D
G
INI
TGNG
F
GFRQ
DV
REGRS
AD
DG
---
S
S
SY
T
G
K
I
TLQ
K
GS
T
L
D
I
-
-----
--
-
N
NR
F
I
G
G
I
E
A
--
--
-
----------
H
DSK
VN
V
T
S
P
D
A
LLQN
S
GVFVN
---
----
-
--
--
---
ST
L
S
VR
D
G
GH
L
T
A
Q
KG---
L
Y
S
DGRV
QI
GKN
G
------------
T
L
S
L
-S
G
T
P
E
NGAD
N
TW
M
PV
LTY
M
T
EGY
D
LTGDN
----------
A
T
L
N
IS
-
QQAHVS
G
D
VHATSSSSIRI
G
SENPGSVSS
SV
SPVLAAG
LF
S
GY
NAA
Y
Y
G
-----
A
I
T
G
-
GK
G
N
V
S
M
N
N
-
G
L
W
Q
L
T
G
D
S
D
IN
S
L
T
T
RN
S
R
V
-
Q
SE
EN-G
A
F
R
T
LT
V
KTLD
--
A
T
G
-----------------
S
D
F
V
L
R
T
D
L
--------
KD
A
D
------------
K
I
S
I
T
E
-------------------
KA
S
G
S
DNTLNV
S
FM
K
NPSPG
Q
SLN
I
PLV
---
SA
PA-----
-
----
-
-
-
-
-
-
--
---
-
GT
SG
D
I
F
KAG--T
R
VTGF
S
R
V
TP
T
L
R
VDT
-----
TGGSTKWILDGFRTEADKAAAA
K
ANSFM
N
A
G
Y
KSFMT
E
VN
N
L
NK
RMG
E
LR
DTNG
D
-
AG
A
W
A
R
IM
N
G
-----
A
G
S
AD
G
G
Y
S
DN
Y
T
H
V
Q
V
G
F
D
K
K
HALDG
V
D
L
FT
G
V
TMT
Y
T
DSS
---
A
D
S
D
AFSGK
T
K
S
V
GG
G
----
L
YAS
ALFN
S
G
A
Y
I
DL
IG
K
YI
H
HD
N
DYT
-
--G
N
FA
G
LGT
K
H
YG
T
HS
WY
A
GA
E
T
G
Y
R
Y
H
L
TE
DT
-
-
-
F
IEPQ
AE
L
V
Y
GAVSGK
T
FRWK
D
---
GE
M
D
L
S
MK
N
K
D
F
S
P
L
I
GR
TG
IEL
G
KTF
R
-
G
KD
WS
V
TA
R
AG
T
S
W
QF
DL
L
NN
G
E
TV
L
R
D
ASGEKR
I
K
G
E
KD
SRMLF
N
VG
M
N
A
QIK
DN
MRF
G
LE
F
E
K
S
A
F
G
K
YN
V
D
N
AI
N
AN
F
R
Y
M
F
fig|656417.3.peg.5218
Escherichia coli M605 (633-1283/1283)
L
N
L
QN
AE
F
NLA
R
N
A
S
L
N
TR
I
N
AD
H
-
ST
VT
LG
S
E
---------
----
-
D
L
Y
I
D
LN
D
G
---
NGVA
T
KPTL
G
K
SKATA
E
D
D-
---
Q
S
RF
N
G
H
V
QLQ
Q
GS
A
L
T
I
-
-----
--
-
N
EH
F
A
G
G
I
D
S
--
--
-
----------
A
DSA
T
T
I
T
S
T
D
T
TLNQ
L
SRFTQ
---
----
-
--
--
---
SS
L
S
LG
E
G
AK
L
T
A
T
AG---
L
L
S
DGTV
SS
NAG
A
------------
S
L
S
L
-L
S
D
QP
GTMY
-
--
--
----
S
A
QSW
E
LSGQD
----------
T
S
L
N
VG
-
AGGIIT
G
D
INANDAASISF
G
TTDINQS--
--T
N
-------------
Y
Y
G
-----
N
I
N
A
-
P
L
A
S
V
T
M
K
D
-
T
A
W
Q
V
N
KQ
S
V
AK
S
L
T
L
NG
S
T
L
-
S
F
N
RF-G
-
Q
G
G
LT
S
DTLE
--
A
T
N
-----------------
S
S
FI
I
N
AD
G
--------
KA
A
D
------------
T
V
T
V
N
Q
-------------------
A
LT
G
G
NNTLVV
I
PT
T
NSVKQ
G
GDP
V
SLV
---
T
A
PK-----
-
----
-
-
-
-
-
-
--
---
-
N
T
QS
N
I
F
TLNPVS
I
NAGF
H
S
F
TP
Q
L
D
VLE
-----
TDVNKQWRLEGFYIQPDKAALR
T
GKSFM
DL
G
Y
KNFIT
E
IN
N
L
ND
RMGDLR
HTHG
E
-
T
G
A
W
A
R
LN
S
G
-----
S
G
S
AT
D
GF
T
GS
Y
T
H
L
Q
I
G
A
D
R
K
HIIEG
G
E
L
FT
G
V
TAT
F
T
SSN
---
N
R
G
T
GWSGR
T
K
S
T
GI
G
----
V
YAS
AMFD
S
G
L
Y
V
D
T
IG
K
YV
R
HD
N
HYS
-
--S
S
AL
G
MPE
QD
YG
S
HS
WY
L
GA
E
A
G
W
RF
S
L
PD
E
T
-
-
-
YI
Q
PQ
TE
L
I
Y
GTVSEN
Q
FAWQ
F
---
NG
GE
I
Y
MQ
R
K
QM
Q
P
L
I
GR
TG
IEF
G
KTF
R
-
G
KD
WE
M
TA
L
TG
I
N
Y
QY
DL
F
KP
T
V
TA
F
K
D
LAGDTY
I
N
N
G
KD
SRVVF
N
VGV
N
T
KIK
E
N
TRI
S
L
N
V
ERS
E
F
G
S
YN
I
D
K
L
I
N
ANI
R
YT
F
fig|340185.3.peg.3806
Escherichia coli E22 (643-1290/1290)
N
AD
F
SLA
R
N
A
S
L
S
TL
I
N
AD
H
-
ST
VT
LG
S
E
---------
----
-
N
L
Y
I
D
LN
D
G
---
NGAK
T
TPSF
G
Q
SKATN
A
A
D-
---
Q
S
RF
S
G
R
V
QLK
N
GS
T
L
N
I
-
-----
--
-
N
EH
F
V
G
G
I
D
S
--
--
-
----------
A
DSS
V
T
V
A
S
T
D
A
LFSQ
Y
SRFRH
---
----
-
--
--
---
SS
L
S
LA
D
G
AK
L
T
VT
SG---
L
A
S
DRGV
TA
GAG
S
------------
T
L
S
L
-L
S
G
Q-
---Y
-
--
--
----
A
A
ERW
S
LAGQG
----------
T
T
L
N
VA
-
AGATLA
G
N
IQADNVASINF
G
VTDQIGSVI
R
V
TG
-------------
Y
R
G
-----
N
I
N
A
-
P
L
A
D
V
T
M
T
N
-
I
G
W
Q
AD
S
G
S
T
VK
S
L
D
L
KG
S
Q
V
-
S
F
N
QT-G
G
P
G
S
LT
V
DDFV
--
A
S
N
-----------------
S
Q
FI
V
N
T
D
G
--------
KT
A
D
------------
T
V
T
V
KQ
-------------------
S
LT
G
K
NNVLTV
V
PT
A
LPVNN
E
VSS
V
PLV
---
T
A
PK-----
-
----
-
-
-
-
-
-
--
---
-
S
T
SA
D
V
L
TLNPVT
Q
HAGF
H
T
F
TP
Q
V
G
IVE
-----
SEDSKQWLLEGFDVQQDKALLQ
S
GKSFM
DM
E
Y
KNFLT
E
MN
N
L
NY
RMGDLR
NTLG
E
-
S
G
T
W
A
R
IF
S
G
-----
T
G
S
AE
A
G
Y
S
DS
WT
H
L
Q
I
G
A
D
R
K
HTFDG
A
D
L
FT
G
V
TAT
F
T
NSN
---
R
H
GD
GWSGQ
T
K
S
T
GV
G
----
V
Y
T
S
VMFD
S
G
L
Y
V
D
A
IG
K
YV
R
HD
N
HYS
-
--V
S
EM
G
MPE
QD
YN
S
HS
WY
L
GA
E
T
G
W
RF
F
L
PG
E
T
-
-
-
F
I
Q
PQ
TE
L
V
Y
GKASGN
Q
FAWQ
S
---
A
G
S
D
I
R
MQ
R
D
QM
K
P
L
I
GR
TG
VES
G
KTF
R
-
G
KD
WE
L
MT
V
AG
V
H
Y
QY
DL
F
NP
SK
TV
V
H
D
FAGDTY
I
R
N
G
KD
NRVNF
S
L
GV
N
T
RIK
E
N
TRI
S
L
N
I
ERS
A
F
G
H
Y
D
I
D
K
AI
N
ANI
R
Y
SF
fig|340185.4.peg.4006
Escherichia coli E22 (597-1244/1244)
N
AD
F
SLA
R
N
A
S
L
S
TL
I
N
AD
H
-
ST
VT
LG
S
E
---------
----
-
N
L
Y
I
D
LN
D
G
---
NGAK
T
TPSF
G
Q
SKATN
A
A
D-
---
Q
S
RF
S
G
R
V
QLK
N
GS
T
L
N
I
-
-----
--
-
N
EH
F
V
G
G
I
D
S
--
--
-
----------
A
DSS
V
T
V
A
S
T
D
A
LFSQ
Y
SRFRH
---
----
-
--
--
---
SS
L
S
LA
D
G
AK
L
T
VT
SG---
L
A
S
DRGV
TA
GAG
S
------------
T
L
S
L
-L
S
G
Q-
---Y
-
--
--
----
A
A
ERW
S
LAGQG
----------
T
T
L
N
VA
-
AGATLA
G
N
IQADNVASINF
G
VTDQIGSVI
R
V
TG
-------------
Y
R
G
-----
N
I
N
A
-
P
L
A
D
V
T
M
T
N
-
I
G
W
Q
AD
S
G
S
T
VK
S
L
D
L
KG
S
Q
V
-
S
F
N
QT-G
G
P
G
S
LT
V
DDFV
--
A
S
N
-----------------
S
Q
FI
V
N
T
D
G
--------
KT
A
D
------------
T
V
T
V
KQ
-------------------
S
LT
G
K
NNVLTV
V
PT
A
LPVNN
E
VSS
V
PLV
---
T
A
PK-----
-
----
-
-
-
-
-
-
--
---
-
S
T
SA
D
V
L
TLNPVT
Q
HAGF
H
T
F
TP
Q
V
G
IVE
-----
SEDSKQWLLEGFDVQQDKALLQ
S
GKSFM
DM
E
Y
KNFLT
E
MN
N
L
NY
RMGDLR
NTLG
E
-
S
G
T
W
A
R
IF
S
G
-----
T
G
S
AE
A
G
Y
S
DS
WT
H
L
Q
I
G
A
D
R
K
HTFDG
A
D
L
FT
G
V
TAT
F
T
NSN
---
R
H
GD
GWSGQ
T
K
S
T
GV
G
----
V
Y
T
S
VMFD
S
G
L
Y
V
D
A
IG
K
YV
R
HD
N
HYS
-
--V
S
EM
G
MPE
QD
YN
S
HS
WY
L
GA
E
T
G
W
RF
F
L
PG
E
T
-
-
-
F
I
Q
PQ
TE
L
V
Y
GKASGN
Q
FAWQ
S
---
A
G
S
D
I
R
MQ
R
D
QM
K
P
L
I
GR
TG
VES
G
KTF
R
-
G
KD
WE
L
MT
V
AG
V
H
Y
QY
DL
F
NP
SK
TV
V
H
D
FAGDTY
I
R
N
G
KD
NRVNF
S
L
GV
N
T
RIK
E
N
TRI
S
L
N
I
ERS
A
F
G
H
Y
D
I
D
K
AI
N
ANI
R
Y
SF
fig|340197.3.peg.1397
Escherichia coli F11 (672-1376/1376)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
DI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
ILGN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
A
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|656440.3.peg.153
Escherichia coli TA206 (672-1376/1376)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
DI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
ILGN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
A
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|405955.9.peg.235
Escherichia coli APEC O1 (672-1376/1376)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
EI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
VLEN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
V
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|199310.1.peg.374
Escherichia coli CFT073 (672-1376/1376)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
EI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
VLEN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
V
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|749528.3.peg.1421
Escherichia coli MS 45-1 (672-1376/1376)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
EI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
VLEN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
V
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|753642.3.peg.1353
Escherichia coli NC101 (672-1376/1376)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
EI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
VLEN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
V
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|869729.3.peg.3412
Escherichia coli UM146 (672-1376/1376)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
EI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
VLEN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
V
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|364106.7.peg.411
Escherichia coli UTI89 (36-740/740)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
EI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
VLEN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
V
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|364106.8.peg.409
Escherichia coli UTI89 (36-740/740)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
EI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
VLEN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
V
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|362663.8.peg.342
Escherichia coli 536 (330-1034/1034)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
DI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
ILGN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
A
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|362663.9.peg.341
Escherichia coli 536 (330-1034/1034)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
DI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
ILGN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
A
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|340197.5.peg.1477
Escherichia coli F11 (330-1034/1034)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
DI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
ILGN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
A
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|749550.3.peg.957
Escherichia coli MS 200-1 (330-1034/1034)
NTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSL
V
L
KD
TD
F
GLG
R
N
A
T
L
N
TT
I
Q
ADN
-
S
SVT
LG
-
D
---------
----
S
R
V
F
I
D
KK
D
G
---
QGTA
F
TLEE
GT
SVATK
D
A
D-
---
K
S
VF
N
G
T
V
NLD
N
Q
S
V
L
N
I
-
-----
--
-
N
DI
F
N
G
G
I
Q
A
--
--
-
----------
N
NST
VN
I
S
S
D
S
A
ILGN
S
-TLTS
---
----
-
--
--
---
TA
LN
LN
KG
AN
A
L
A
S
QS---
FVS
DGPV
N
I
-SD
A
------------
T
L
S
L
-N
S
R
PD
EVSH
-
TL
LPVYD
-
Y
A
GSW
N
LKGDD
----------
A
R
L
N
VG
-
PYSMLS
G
N
INVQDKGTVTL
G
GEGELSPDL
TLQNQMLYSLFNGYRNT
W
S
G
-----
S
L
N
A
-
PDA
T
V
S
M
T
D
-
T
Q
W
S
M
NG
NS
T
AG
N
M
K
L
NR
T
I
V
-
G
F
N
GGTS
S
F
T
T
LT
T
DNLD
--
A
V
Q
-----------------
S
A
F
V
MR
T
D
L
--------
NK
A
D
------------
K
L
V
I
NK
-------------------
S
A
T
G
H
DNSIWV
N
FL
K
KPSDK
D
TLD
I
PLV
---
SA
PE-----
-
----
-
-
-
-
-
-
--
---
-
A
T
AD
N
L
F
RAS--T
R
VVGF
S
D
V
TP
T
L
S
VRK
-----
EDGKKEWVLDGYQVARNDGQGK
A
AATFM
H
I
S
Y
NNFIT
E
VN
N
L
NK
RMGDLR
DING
E
-
AG
T
W
V
R
LL
N
G
-----
S
G
S
AD
G
GF
T
DH
Y
T
L
L
Q
M
G
A
D
R
K
HELGS
M
D
L
FT
G
V
MAT
Y
T
DTD
---
A
S
AG
LYSGK
T
K
S
W
GG
G
----
F
YAS
GLFR
S
G
A
Y
F
DL
IA
K
YI
H
NE
N
KYD
-
--L
N
FA
G
AGK
Q
N
FR
S
HS
LY
A
GA
E
V
G
Y
R
Y
H
L
TD
T
T
-
-
-
FV
EPQ
AE
L
V
W
GRLQGQ
T
FNWN
D
---
S
G
M
DVS
MR
R
N
SVNP
L
V
GR
TG
VVS
G
KTF
S
-
G
KD
WS
L
TA
R
AG
L
H
Y
EF
DLT
DS
A
D
VH
L
K
D
AAGEHQ
I
N
G
R
KD
GRMLY
GVG
L
N
A
RFG
DN
TRL
G
LE
V
ERS
A
F
G
K
YN
T
DDAI
N
ANI
R
Y
SF
fig|749527.3.peg.1275
Escherichia coli MS 21-1 (242-969/969)
I
N
VS
N
-
ST
VT
TG
S
N
TVLESSG
Y
G
HFGN
S
G
E
P
S
D
YA
G
P
---
GDVA
L
SFTD
S
T
SDYAM
K
N
NV
YFS
N
S
TL
M
G
D
V
AFT
S
TW
N
A
N
F
D
PSG--
--
-
H
DS
NG
D
G
V
K
D
TN
A
G
W
----------
V
DDS
L
N
V
D
E
L
N
I
TLDN
G
SKWVG
---
----
-
--
--
---
--
--
--
--
--
-
Q
A
T
FNAET
I
SP
DTMY
D
V
ATN
S
LTPGGTAEVNGW
N
R
I
I
DN
K
V
FQ
SGVF
N
VA
L
N
----
N
G
SEW
D
TTGDS
LVDTLTVNNG
S
Q
V
N
VS
-
DSSLTS
D
T
IDLTNGSSLNI
G
---------
--
---------------
E
D
G
YVD
T
D
H
L
TINSYS
T
VA
L
T
ES
T
G
W
G
ADY
N
LY
AN
T
I
T
V
TN
G
G
V
L
D
VN
VDQF
D
T
EA
F
R
T
DKLE
LT
S
G
N
IADNNGNVVSGVFDIHS
S
D
Y
V
L
N
AD
L
V
N
D
RTWDT
S
K
S
N
YGYGIVAMNSDG
H
L
T
I
N
G
NGDMNNGDELDNSSVDNVV
A
A
T
GN
YKVRID
N
AT
G
AGAIA
D
YKD
K
EII
YVNDV
NTNATFS
A
ANKA
D
L
G
A
Y
T
Y
Q
AEQ
R
G
-
NT
V
V
L
QQMELT
D
YANM
A
L
S
I
P
-
-
-
---
-----
SANTNIWNL-------------
-
-----
--
-
-
-----
E
QD
T
V
GT
R
L
T
N
S
R
HGLA
D
N
G
G
A
W
V
S
YF
GG
NFNGD
N
G
T
IN
-
-
Y
D
QD
V
N
GI
M
V
G
V
D
T
K
IDGNN
A
K
W
IV
G
A
AAG
F
-
---
---
A
K
GD
MNDRS
G
Q
V
D
QD
SQTAY
I
Y
S
S
AHFA
N
N
V
F
V
D
G
SL
S
YS
H
FN
N
DLS
A
T-M
S
NG
T
YVD
G
S
TN
SD
A
WG
F
GL
K
A
G
Y
D
F
K
L
GD
A
G
-
-
-
Y
V
T
P
-
--
-
-
Y
GSVSGL
F
QSGD
DYQLS
N-
N
M
K
VD
G
Q
S
Y
D
S
M
RY
E
L
G
VDA
GY
TF
T
Y
S
E
D
QA
L
TP
Y
FK
L
A
Y
VY
D
D
S
NN
H-
--
-
N
D
VNGDSI
D
N
G
T
EG
SAVRV
G
L
G
T
Q
F
SFT
K
N
FSA
Y
T
D
A
N
Y
L
G
G
G
D
V
D
Q
D
WSA
N
V
G
V
K
YT
W
fig|585057.4.peg.336
Escherichia coli IAI39 (242-969/969)
I
N
VS
N
-
ST
VT
TG
S
N
TVLESSG
Y
G
HFGN
S
G
E
P
S
D
YA
G
P
---
GDVA
L
SFTD
S
T
SDYAM
K
N
NV
YFS
N
S
TL
V
G
D
V
AFT
S
TW
N
A
N
F
D
PSG--
--
-
H
DS
NG
D
G
V
K
D
TN
A
G
W
----------
V
DDS
L
N
V
D
E
L
N
I
TLDN
G
SKWVG
---
----
-
--
--
---
--
--
--
--
--
-
Q
A
T
FNAET
I
SP
DTMY
D
V
ATN
S
LTPGGTAEANGW
N
R
I
I
DN
K
V
FQ
SGVF
N
VA
L
N
----
N
G
SEW
D
TTGRS
VVDTLTVNNA
S
Q
V
N
VS
-
ESKLTS
D
T
IDLTNGSSLNI
G
---------
--
---------------
E
D
G
YVD
T
D
H
L
TINSYS
T
VA
L
T
ES
T
G
W
G
ADY
N
LY
AN
T
I
T
V
TN
G
G
V
L
D
VN
VDQF
D
T
EA
F
R
T
DKLE
LT
S
G
N
IADNNGNVVSGVFDIHS
S
D
Y
V
L
N
AD
L
V
N
D
RTWDT
S
K
S
N
YGYGIVAMNSDG
H
L
T
I
N
G
NGDMNNGDELDNSSVDNVV
A
A
T
GN
YKVRID
N
AT
G
AGAIA
D
YKD
K
EII
YVNDV
NSNATFS
A
ANKA
D
L
G
A
Y
T
Y
Q
AEQ
R
G
-
NT
V
V
L
QQMELT
D
YANM
A
L
S
I
P
-
-
-
---
-----
SANTNIWNL-------------
-
-----
--
-
-
-----
E
QD
T
V
GT
R
L
T
N
S
R
HGLA
D
N
G
G
A
W
V
S
YF
GG
NFNGD
N
G
T
IN
-
-
Y
D
QD
V
N
GI
M
V
G
V
D
T
K
IDGNN
A
K
W
IV
G
A
AAG
F
-
---
---
A
K
GD
MNDRS
G
Q
V
D
QD
SQTAY
I
Y
S
S
AHFA
N
N
V
F
V
D
G
SL
S
YS
H
FN
N
DLS
A
T-M
S
NG
T
YVD
G
S
TN
SD
A
WG
F
GL
K
A
G
Y
D
F
K
L
GD
A
G
-
-
-
Y
V
T
P
-
--
-
-
Y
GSVSGL
F
QSGD
DYQLS
N-
N
M
K
VD
G
Q
S
Y
D
S
M
RY
E
L
G
VDA
GY
TF
T
Y
S
E
D
QA
L
TP
Y
FK
L
A
Y
VY
D
D
S
NN
H-
--
-
N
D
VNGDSI
D
N
G
T
EG
SAVRV
G
L
G
T
Q
F
SFT
K
N
FSA
Y
T
D
A
N
Y
L
G
G
G
D
V
D
Q
D
WSA
N
V
G
V
K
YT
W
fig|585057.6.peg.335
Escherichia coli IAI39 (242-969/969)
I
N
VS
N
-
ST
VT
TG
S
N
TVLESSG
Y
G
HFGN
S
G
E
P
S
D
YA
G
P
---
GDVA
L
SFTD
S
T
SDYAM
K
N
NV
YFS
N
S
TL
V
G
D
V
AFT
S
TW
N
A
N
F
D
PSG--
--
-
H
DS
NG
D
G
V
K
D
TN
A
G
W
----------
V
DDS
L
N
V
D
E
L
N
I
TLDN
G
SKWVG
---
----
-
--
--
---
--
--
--
--
--
-
Q
A
T
FNAET
I
SP
DTMY
D
V
ATN
S
LTPGGTAEANGW
N
R
I
I
DN
K
V
FQ
SGVF
N
VA
L
N
----
N
G
SEW
D
TTGRS
VVDTLTVNNA
S
Q
V
N
VS
-
ESKLTS
D
T
IDLTNGSSLNI
G
---------
--
---------------
E
D
G
YVD
T
D
H
L
TINSYS
T
VA
L
T
ES
T
G
W
G
ADY
N
LY
AN
T
I
T
V
TN
G
G
V
L
D
VN
VDQF
D
T
EA
F
R
T
DKLE
LT
S
G
N
IADNNGNVVSGVFDIHS
S
D
Y
V
L
N
AD
L
V
N
D
RTWDT
S
K
S
N
YGYGIVAMNSDG
H
L
T
I
N
G
NGDMNNGDELDNSSVDNVV
A
A
T
GN
YKVRID
N
AT
G
AGAIA
D
YKD
K
EII
YVNDV
NSNATFS
A
ANKA
D
L
G
A
Y
T
Y
Q
AEQ
R
G
-
NT
V
V
L
QQMELT
D
YANM
A
L
S
I
P
-
-
-
---
-----
SANTNIWNL-------------
-
-----
--
-
-
-----
E
QD
T
V
GT
R
L
T
N
S
R
HGLA
D
N
G
G
A
W
V
S
YF
GG
NFNGD
N
G
T
IN
-
-
Y
D
QD
V
N
GI
M
V
G
V
D
T
K
IDGNN
A
K
W
IV
G
A
AAG
F
-
---
---
A
K
GD
MNDRS
G
Q
V
D
QD
SQTAY
I
Y
S
S
AHFA
N
N
V
F
V
D
G
SL
S
YS
H
FN
N
DLS
A
T-M
S
NG
T
YVD
G
S
TN
SD
A
WG
F
GL
K
A
G
Y
D
F
K
L
GD
A
G
-
-
-
Y
V
T
P
-
--
-
-
Y
GSVSGL
F
QSGD
DYQLS
N-
N
M
K
VD
G
Q
S
Y
D
S
M
RY
E
L
G
VDA
GY
TF
T
Y
S
E
D
QA
L
TP
Y
FK
L
A
Y
VY
D
D
S
NN
H-
--
-
N
D
VNGDSI
D
N
G
T
EG
SAVRV
G
L
G
T
Q
F
SFT
K
N
FSA
Y
T
D
A
N
Y
L
G
G
G
D
V
D
Q
D
WSA
N
V
G
V
K
YT
W
Consen1
Primary consensus
QGntvsslGdnsvltqpTSftQdDWEnRtFsfGsL
Lmg
v
kN
vvd
t
----
Stik
gdnahglwsfg
n
l
vD
g
---
v
Gt
ds
---
S
gg
l
sgs
a
in
aq
N
fs
Gs
g
aq
----------
a
vnm
n
di
n
glw
g
tg
---
--
ds
i
ga
iya
qi
s
------------
dl
i
S
pd
-
ma
----
a
d
----------
s
iN
-
Gs
g
sv
---------------
w
Gsslsd
vng
-
gkl
Vam
n
-
s
W
vtsnSn
tl
l
s
V
-
fa
t
gTfat
nls
n
-----------------
stfimrad
vgegngvnnk
d
------------
L
Isg
-------------------
ssaGn
n
s
n
t
---
tt
a
e
g
y
yd
ngt
e
y
e
p
ptpa
a
ivnpd
a
nv
y
e
tL
RmgdlR
d
-
gn
Wlr
gG
a
g
sgfd
ysgiq
GgD
r
m
l
Gl
st
pdys
gd
arsd
G
----
YaS
nG
y
Dl
k
r
N
v
q
G
an
ang
i
e
Gqrf
l
tg
g
yiePq
l
y
t
----
nGlnih
-
hyesllGras
Gydit
-
gn
l
y
t
a
efs
te
L
n
f
G
--
GvGv
A
kq
yle
dyt
-
g
lf
qkqvNggyrfsF
Consen2
Secondary consensus
hpiiha
tt
------
as
s
t
q
tl
k
kd
f
r
t
n
i
adn
svt
-
e
---------
i
d
f
n
k
ni
v
qn
l
l
-
--
y
i
a
--
gnllqgfdgdn
it
a
s
---
-
--
iht
ln
kg
a
fvs
ng
a
tq
l
ae
s
lpvyd
s
n
v
n
s
tlqnqmlyslfngyrnty
-----
lsaapda
l
dgt
ngs
t
m
v
t
i
t
lt
-
--
a
q
g
lvvdt
-
--------
ss
n
nk
ka
s
k
i
sa
-
-
-
-
--
-
aa
f
s
viqt
s
-----
-
h
n
q
n
qdav
ekag
q
n
s
g
-
yt
v
l
m
a
k
g
w
v
y
-
---
a
ag
kgw
ysfh
l
s
f
t
q
h
-
n
qd
hs
a
k
yt
-
dt
-
fvq
-
-
w
a
dyqls
dvs
r
svnpv
etg
ktf
kd
v
r
l
y
dlt
d
i
kd
a
l
dn
gas
ers
fs
yn
ddai
anihyt
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character