fig|1040638.4.peg.4195
Escherichia coli O104:H4 str. LB226692
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
G
A
G
M
N
IAWRFE
fig|6666666.5357.peg.1824
Escherichia coli TY-2482
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
G
A
G
M
N
IAWRFE
fig|585055.6.peg.1612
Escherichia coli 55989
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
G
A
G
M
N
IAWRFE
fig|585055.8.peg.1615
Escherichia coli 55989
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
G
A
G
M
N
IAWRFE
fig|550672.3.peg.945
Escherichia coli B088
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|331111.12.peg.1915
Escherichia coli E24377A
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|331111.3.peg.4075
Escherichia coli E24377A
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|656408.3.peg.1584
Escherichia coli H591
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|585034.4.peg.1436
Escherichia coli IAI1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|585034.5.peg.1432
Escherichia coli IAI1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|679206.4.peg.3003
Escherichia coli MS 119-7
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|679204.3.peg.1298
Escherichia coli MS 145-7
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|656443.3.peg.1782
Escherichia coli TA271
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|340184.3.peg.114
Escherichia coli B7A
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|340184.6.peg.119
Escherichia coli B7A
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|409438.11.peg.1677
Escherichia coli SE11
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
K
Q
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|585396.4.peg.1916
Escherichia coli O111:H- str. 11128
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
S
T
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|749532.3.peg.988
Escherichia coli MS 78-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
P
V
GSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|679207.4.peg.1351
Escherichia coli MS 107-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GD
E
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|573235.3.peg.2085
Escherichia coli O26:H11 str. 11368
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
Q
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|340186.3.peg.193
Escherichia coli E110019
MKI
VS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|340186.5.peg.208
Escherichia coli E110019
MKI
VS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|344601.3.peg.222
Escherichia coli B171
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
T
S
RI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|344601.5.peg.219
Escherichia coli B171
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
T
S
RI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|340185.3.peg.1084
Escherichia coli E22
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
T
S
RI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|340185.4.peg.1139
Escherichia coli E22
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
T
S
RI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|585395.4.peg.1658
Escherichia coli O103:H2 str. 12009
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
T
S
RI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|679205.4.peg.4668
Escherichia coli MS 124-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|749533.3.peg.80
Escherichia coli MS 84-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|749527.3.peg.3747
Escherichia coli MS 21-1 (45-744/744)
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|595495.4.peg.4823
Escherichia coli KO11
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
Y
M
DG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|566546.3.peg.10
Escherichia coli W
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
Y
M
DG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|566546.4.peg.1569
Escherichia coli W
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
Y
M
DG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|595496.3.peg.1406
Escherichia coli BW2952
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|536056.3.peg.2305
Escherichia coli DH1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|316401.4.peg.1747
Escherichia coli ETEC H10407
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|656414.3.peg.1738
Escherichia coli H736
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|83333.1.peg.1437
Escherichia coli K12
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|749548.3.peg.4696
Escherichia coli MS 196-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|316407.3.peg.1410
Escherichia coli W3110
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|316385.5.peg.1564
Escherichia coli str. K-12 substr. DH10B
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|316385.7.peg.1604
Escherichia coli str. K-12 substr. DH10B
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|511145.12.peg.1517
Escherichia coli str. K-12 substr. MG1655
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|511145.6.peg.1503
Escherichia coli str. K-12 substr. MG1655
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
L
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
S
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|753642.3.peg.1711
Escherichia coli NC101
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|685038.3.peg.1443
Escherichia coli O83:H1 str. NRG 857C
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|585397.7.peg.1618
Escherichia coli ED1a
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
N
G
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|585397.9.peg.1611
Escherichia coli ED1a
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
N
G
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|749546.3.peg.802
Escherichia coli MS 185-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
N
G
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|749528.3.peg.1200
Escherichia coli MS 45-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
N
G
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|525281.3.peg.19
Escherichia coli 83972
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
N
G
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VP
S
L
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|655817.3.peg.1763
Escherichia coli ABU 83972
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
N
G
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VP
S
L
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|481805.3.peg.2370
Escherichia coli ATCC 8739
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
I
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|481805.6.peg.2360
Escherichia coli ATCC 8739
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
I
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|331112.3.peg.1442
Escherichia coli HS
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
I
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|331112.6.peg.1502
Escherichia coli HS
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
I
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|562.376.peg.3052
Escherichia coli WV_060327
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELD
I
PAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|670888.3.peg.2120
Escherichia coli 1827-70
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
I
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
T
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|405955.13.peg.1576
Escherichia coli APEC O1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|405955.9.peg.1284
Escherichia coli APEC O1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|714962.3.peg.1643
Escherichia coli IHE3034
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|869729.3.peg.2074
Escherichia coli UM146
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|364106.7.peg.1713
Escherichia coli UTI89
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|364106.8.peg.1714
Escherichia coli UTI89
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|216592.1.peg.1989
Escherichia coli 042 (54-753/753)
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
T
PQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
M
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|216592.3.peg.1640
Escherichia coli 042 (54-753/753)
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
T
PQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
M
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|656437.3.peg.1665
Escherichia coli TA143
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
T
PQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
M
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|656379.3.peg.3487
Escherichia coli FVEC1302
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
M
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
A
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|656380.3.peg.2841
Escherichia coli FVEC1412
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
M
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
A
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|585056.7.peg.1891
Escherichia coli UMN026 (54-753/753)
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
M
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
A
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|344610.3.peg.1050
Escherichia coli 53638
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
I
T
P
S
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|585035.6.peg.1535
Escherichia coli S88
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
Y
I
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|656393.3.peg.2185
Escherichia coli H299
MKI
FS
VRQTV
F
PALLVLS
---
P
T
VFAAD
EQTMI
V
SA
APQ
V
I
SELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
M
S
PQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|749549.3.peg.3868
Escherichia coli MS 198-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
R
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
M
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
A
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|656417.3.peg.1719
Escherichia coli M605
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTV
A
TT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
M
RK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
K
D
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|656444.3.peg.2235
Escherichia coli TA280
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
V
IT
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
M
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
L
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
I
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
E
QRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
K
G
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
M
N
IAWRFE
fig|562.371.peg.1762
Escherichia coli 1044A
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|562.373.peg.5108
Escherichia coli 1125A
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|562.372.peg.1247
Escherichia coli 1212A
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|562.374.peg.2338
Escherichia coli 536A
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|83334.1.peg.2085
Escherichia coli O157:H7
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|444454.5.peg.959
Escherichia coli O157:H7 str. EC4024
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|444449.5.peg.286
Escherichia coli O157:H7 str. EC4042
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|444448.5.peg.4642
Escherichia coli O157:H7 str. EC4045
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|444453.5.peg.2838
Escherichia coli O157:H7 str. EC4076
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|444452.5.peg.1979
Escherichia coli O157:H7 str. EC4113
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|444450.8.peg.2104
Escherichia coli O157:H7 str. EC4115
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|444451.5.peg.1970
Escherichia coli O157:H7 str. EC4196
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|444447.5.peg.5553
Escherichia coli O157:H7 str. EC4206
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|478004.5.peg.2829
Escherichia coli O157:H7 str. EC4401
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|478005.5.peg.2991
Escherichia coli O157:H7 str. EC4486
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|478006.5.peg.1961
Escherichia coli O157:H7 str. EC4501
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|478007.5.peg.2165
Escherichia coli O157:H7 str. EC508
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|478008.5.peg.3685
Escherichia coli O157:H7 str. EC869
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|637388.3.peg.1475
Escherichia coli O157:H7 str. FRIK2000
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|570506.3.peg.3238
Escherichia coli O157:H7 str. FRIK966
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|386585.9.peg.2156
Escherichia coli O157:H7 str. Sakai
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|544404.4.peg.1966
Escherichia coli O157:H7 str. TW14359
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|502346.5.peg.5316
Escherichia coli O157:H7 str. TW14588
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|701177.3.peg.1800
Escherichia coli O55:H7 str. CB9615
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|431946.3.peg.1428
Escherichia coli SE15
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
G
D
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTV
A
TT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
M
RK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|362663.8.peg.1466
Escherichia coli 536
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWT
Q
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|362663.9.peg.1470
Escherichia coli 536
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWT
Q
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|340197.3.peg.2988
Escherichia coli F11
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWT
Q
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|340197.5.peg.3121
Escherichia coli F11
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWT
Q
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|749550.3.peg.575
Escherichia coli MS 200-1
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWT
Q
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHN
F
TV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|749537.3.peg.963
Escherichia coli MS 115-1
MKI
FS
VRQTVLPALL
A
LS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
E
G
MRLA
-
TPRI
NL
SES
L
T
S
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
E
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
T
I
GL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
I
T
P
S
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
L
I
G
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|670897.3.peg.4892
Escherichia coli 2362-75
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGD
F
DYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
T
L
--
--
-
-
---
-
S
TGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRT
I
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|216593.1.peg.262
Escherichia coli E2348/69
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGD
F
DYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
T
L
--
--
-
-
---
-
S
TGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRT
I
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|574521.7.peg.1627
Escherichia coli O127:H6 str. E2348/69
MKI
FS
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IPATMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGD
F
DYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
E
AEWKAN
P
QQAPR
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
AQ
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
GG
V
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
T
L
--
--
-
-
---
-
S
TGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
R
--
-----------
G
V
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
V
DP
YLQ
TQW
Q
L
T
D
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
NVYL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRT
I
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
K
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
A
F
A
S
IGY
I
P
E
D
G
W
YA
G
T
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
I
N
IAWRFE
fig|155864.1.peg.2018
Escherichia coli O157:H7 EDL933
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IP
X
TMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQA
X
R
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|155864.8.peg.1843
Escherichia coli O157:H7 EDL933
MKI
VF
VRQTVLPALLVLS
---
PVVFAAD
EQTMI
V
SA
APQ
VVSELDTPAA
V
S
------
VV
DG
EE
MRLA
-
TPRI
NL
SES
L
T
G
-
VPGL
Q
V
----
Q
N
RQNYAQ
D
----
LQLSI
RG
FGSRSTYGI
R
GIRL
YVDG
IP
X
TMP
---------
--
D
GQG
Q
TSN
ID
LSSVQN
VEV
LR
GP
FSA
LYG
N
-
A
S
GG
VM
N
VT
T
QT
GQ
Q
------
PPT
IEAS
SYY
GS
FG
S
W
R
YG
L
KAT
GA
T
GDGT
QPGDVDYTVSTT
R
FTTH
G
YRD
H
----------------------
SGA
Q
KN
L
A
N
A
------
------
--
K
L
GVR
ID
D
A
SK
L
S
LI
FNS
V
DIKA
DD
P
GGL
------
T
K
AEWKAN
P
QQA
X
R
--
-
-
--------------
-
AE
QYD
TRK
T
IK
Q
TQ
AG
L
RYER
SLS
SR
DDMSVMMYAGE
RETT
QYQSI
P
MAPQ
L
NPSHA
G
SV
-----
ITLQRHYQGID
S
RWTH
R
-------------
GEL
G
VP
V
TF
--
--
-
-
---
-
TTGL
NYENMSEN
R
K
G
Y
N
NFR
L
N
S
--
-----------
G
M
PEY
G
QKGE
-----
L
R
R
D
ERNLMW
N
I
DP
YLQ
TQW
Q
LS
E
KLS
L
DA
GVRY
SSVWFDSN
D
H
Y
-------------------
V
T
P
G
N
GDD
S
------
----
G
D
A
S
YH
K
WL
PAGSL
K
Y
-----------
AM
T
DAW
N
I
YL
AAGR
G
F
E
T
P
TI
N
ELS
Y
RAD
G
--
Q
S
GM
N
F
G
LKP
STN
-------------
DT
I
EIG
S
K
TRIG
D
----
G
LL
SL
ALF
Q
TD
T
D
D
E
I
V
-------
VDSSSGGRTT
Y
KNA
GK
T
R
RQ
G
A
E
L
A
-
---------
W
DQRFAGD
F
R
V
N
ASW
T
WLD
AT
YRSNVCNE
QD
CNGNRM
P
GIARNM
G
F
A
S
IGY
V
P
E
D
G
W
YA
G
M
E
ARY
M
GD
IMA
D
DENTAKA
P
S
-------
---
YT
LVG
L
FT
GY
KY
N
YHNLTV
DL
FGR
V
D
NLFD
KE
YV
G
S
VIVNES
N
GRYYEP
A
P
G
RN
Y
GV
G
V
N
IAWRFE
fig|362663.8.peg.826
Escherichia coli 536 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATA
Y
GGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TE
D
Y
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|362663.9.peg.826
Escherichia coli 536 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATA
Y
GGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TE
D
Y
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|550677.3.peg.2074
Escherichia coli B354 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
Q
A
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
I
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPD
N
SIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
T
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|656444.3.peg.1356
Escherichia coli TA280 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
Q
A
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
I
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
T
V
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|679206.4.peg.4022
Escherichia coli MS 119-7 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
TTG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|656443.3.peg.1079
Escherichia coli TA271 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
TTG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|670888.3.peg.1366
Escherichia coli 1827-70 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|481805.3.peg.3050
Escherichia coli ATCC 8739 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|481805.6.peg.3036
Escherichia coli ATCC 8739 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|595496.3.peg.728
Escherichia coli BW2952 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|536056.3.peg.2994
Escherichia coli DH1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|562.375.peg.4396
Escherichia coli EC4100B (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|316401.4.peg.1011
Escherichia coli ETEC H10407 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|656414.3.peg.1004
Escherichia coli H736 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|331112.3.peg.796
Escherichia coli HS (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|331112.6.peg.830
Escherichia coli HS (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|83333.1.peg.791
Escherichia coli K12 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749537.3.peg.3550
Escherichia coli MS 115-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749538.3.peg.727
Escherichia coli MS 116-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|679205.4.peg.2075
Escherichia coli MS 124-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749540.3.peg.2741
Escherichia coli MS 146-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749544.3.peg.2689
Escherichia coli MS 175-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749533.3.peg.4816
Escherichia coli MS 84-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|316407.3.peg.772
Escherichia coli W3110 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|316385.7.peg.881
Escherichia coli str. K-12 substr. DH10B (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|511145.12.peg.832
Escherichia coli str. K-12 substr. MG1655 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|511145.6.peg.824
Escherichia coli str. K-12 substr. MG1655 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585057.6.peg.825
Escherichia coli IAI39 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGN
I
Y
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749527.3.peg.4375
Escherichia coli MS 21-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGN
I
Y
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|216592.1.peg.3331
Escherichia coli 042 (24-749/760)
S
I
TP
V
A
Q
A
L
A
-
A
EG
Q
A
N
A
D
D
T
L
V
V
E
A
S
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
M
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|562.376.peg.4800
Escherichia coli WV_060327 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDS
I
T
--
-
DTA
T
MR
F
E
H
DIN
DN
A
TI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|656379.3.peg.1641
Escherichia coli FVEC1302 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
I
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
M
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|656380.3.peg.1469
Escherichia coli FVEC1412 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
I
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
M
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749549.3.peg.2321
Escherichia coli MS 198-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
I
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
M
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|656437.3.peg.886
Escherichia coli TA143 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
I
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
M
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585056.7.peg.1144
Escherichia coli UMN026 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
I
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
M
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|340185.3.peg.2852
Escherichia coli E22 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNP
M
TLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749531.3.peg.1749
Escherichia coli MS 69-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
Q
A
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
I
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPD
N
SIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYR
F
HPG
E
P
fig|216593.1.peg.1400
Escherichia coli E2348/69 (25-749/760)
I
TP
V
A
Q
A
L
A
-
A
EG
Q
A
N
A
D
D
T
L
V
V
E
A
S
TPSL
-
-
-
YAPQ
K
SADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
N
K
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
S
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|656419.3.peg.1088
Escherichia coli M718 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|1040638.4.peg.4914
Escherichia coli O104:H4 str. LB226692 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585055.6.peg.859
Escherichia coli 55989 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585055.8.peg.863
Escherichia coli 55989 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|344601.3.peg.2534
Escherichia coli B171 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|340186.3.peg.2581
Escherichia coli E110019 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|331111.12.peg.1164
Escherichia coli E24377A (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|331111.3.peg.3373
Escherichia coli E24377A (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|595495.4.peg.3218
Escherichia coli KO11 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|679207.4.peg.1463
Escherichia coli MS 107-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749545.3.peg.1000
Escherichia coli MS 182-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749532.3.peg.2677
Escherichia coli MS 78-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|409438.11.peg.987
Escherichia coli SE11 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|566546.3.peg.3760
Escherichia coli W (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|566546.4.peg.854
Escherichia coli W (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|439855.10.peg.986
Escherichia coli SMS-3-5 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TA
R
SGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
G
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|316385.5.peg.869
Escherichia coli str. K-12 substr. DH10B (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
M
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|340185.4.peg.3004
Escherichia coli E22 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNP
M
TLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|358709.5.peg.3083
Escherichia coli 101-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
V
G
V
AKGS
PVTTVD
TA
R
SGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|413997.3.peg.815
Escherichia coli B str. REL606 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
V
G
V
AKGS
PVTTVD
TA
R
SGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|511693.5.peg.843
Escherichia coli BL21 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
V
G
V
AKGS
PVTTVD
TA
R
SGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|469008.4.peg.2918
Escherichia coli BL21(DE3) (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
V
G
V
AKGS
PVTTVD
TA
R
SGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749547.3.peg.1647
Escherichia coli MS 187-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
V
G
V
AKGS
PVTTVD
TA
R
SGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|637912.3.peg.1061
Escherichia coli OP50 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
V
G
V
AKGS
PVTTVD
TA
R
SGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|550672.3.peg.1043
Escherichia coli B088 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|340186.5.peg.2668
Escherichia coli E110019 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|573235.3.peg.948
Escherichia coli O26:H11 str. 11368 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|216592.3.peg.922
Escherichia coli 042 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
M
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585397.7.peg.778
Escherichia coli ED1a (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDS
I
T
--
-
DTA
T
MR
F
E
H
DIN
DN
A
TI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585397.9.peg.778
Escherichia coli ED1a (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDS
I
T
--
-
DTA
T
MR
F
E
H
DIN
DN
A
TI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|753642.3.peg.860
Escherichia coli NC101 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDS
I
T
--
-
DTA
T
MR
F
E
H
DIN
DN
A
TI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|344601.5.peg.2637
Escherichia coli B171 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585395.4.peg.887
Escherichia coli O103:H2 str. 12009 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|679204.3.peg.5067
Escherichia coli MS 145-7 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DA
R
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|685038.3.peg.731
Escherichia coli O83:H1 str. NRG 857C (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
A
M
RLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDS
I
T
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|562.372.peg.1095
Escherichia coli 1212A (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
V
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|562.374.peg.2629
Escherichia coli 536A (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
V
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|714962.3.peg.812
Escherichia coli IHE3034 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGN
I
Y
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|869729.3.peg.2908
Escherichia coli UM146 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGN
I
Y
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|364106.7.peg.893
Escherichia coli UTI89 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGN
I
Y
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|364106.8.peg.892
Escherichia coli UTI89 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGN
I
Y
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|562.371.peg.2247
Escherichia coli 1044A (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|562.373.peg.1230
Escherichia coli 1125A (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|83334.1.peg.953
Escherichia coli O157:H7 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|155864.1.peg.877
Escherichia coli O157:H7 EDL933 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|155864.8.peg.903
Escherichia coli O157:H7 EDL933 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|444454.5.peg.5374
Escherichia coli O157:H7 str. EC4024 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|444449.5.peg.4308
Escherichia coli O157:H7 str. EC4042 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|444448.5.peg.3584
Escherichia coli O157:H7 str. EC4045 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|444453.5.peg.926
Escherichia coli O157:H7 str. EC4076 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|444450.8.peg.1041
Escherichia coli O157:H7 str. EC4115 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|444451.5.peg.4681
Escherichia coli O157:H7 str. EC4196 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|478004.5.peg.3303
Escherichia coli O157:H7 str. EC4401 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|478006.5.peg.4607
Escherichia coli O157:H7 str. EC4501 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|478007.5.peg.2629
Escherichia coli O157:H7 str. EC508 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|478008.5.peg.2275
Escherichia coli O157:H7 str. EC869 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|637388.3.peg.2575
Escherichia coli O157:H7 str. FRIK2000 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|570506.3.peg.4454
Escherichia coli O157:H7 str. FRIK966 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|386585.9.peg.997
Escherichia coli O157:H7 str. Sakai (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|544404.4.peg.907
Escherichia coli O157:H7 str. TW14359 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|502346.5.peg.308
Escherichia coli O157:H7 str. TW14588 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|701177.3.peg.1015
Escherichia coli O55:H7 str. CB9615 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585034.4.peg.829
Escherichia coli IAI1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
L
V
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585034.5.peg.825
Escherichia coli IAI1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
L
V
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|340184.3.peg.1201
Escherichia coli B7A (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNL
I
DALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|340184.6.peg.1264
Escherichia coli B7A (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNL
I
DALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|340197.3.peg.3834
Escherichia coli F11 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TE
D
Y
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|340197.5.peg.4010
Escherichia coli F11 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TE
D
Y
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749550.3.peg.3764
Escherichia coli MS 200-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TE
D
Y
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|431946.3.peg.766
Escherichia coli SE15 (25-749/760)
I
TP
V
A
Q
A
L
A
-
A
EG
Q
A
N
A
D
D
T
L
V
V
E
A
S
TPSL
-
-
-
YAPQ
K
SADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPG
F
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
T
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|478005.5.peg.1425
Escherichia coli O157:H7 str. EC4486 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
T
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
AS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|656417.3.peg.991
Escherichia coli M605 (25-749/760)
I
TP
V
A
Q
A
L
A
-
A
EG
Q
A
N
A
D
D
T
L
V
V
E
A
S
TPSL
-
-
-
YAPQ
K
SADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQP
A
SDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585396.4.peg.900
Escherichia coli O111:H- str. 11128 (23-722/733)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
I
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GS
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNG
V
NAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
T
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNS
V
N
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
QV
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|525281.3.peg.2148
Escherichia coli 83972 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|655817.3.peg.890
Escherichia coli ABU 83972 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749546.3.peg.4958
Escherichia coli MS 185-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|749528.3.peg.4543
Escherichia coli MS 45-1 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|199310.1.peg.863
Escherichia coli CFT073 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
V
FG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|199310.4.peg.844
Escherichia coli CFT073 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VPG
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
V
FG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
L
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGNVY
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|585035.6.peg.824
Escherichia coli S88 (50-749/760)
TPSL
-
-
-
YAPQQSADPKFSRPVA
DTTRTMT
VISE
Q
VIKD
-
QGATNLTDALK
N
-
VP
A
V
G
AFFAG
ENGNSTTGD
-----
AIYMRGADTSNS
---
-
---
IY
I
DG
--------
IRDIGSV
SRDTFN
-
---
--
---
TEQVEVIKGPSGT
D
YGRSAP
T
GSINMI
S
KQ
PR
N
------
DSGI
D
ASASIGSAW
F
RRGTLDVN
QV
I
GD
T
T
----------
AVRLNVMGEKTH
----------------------
DAG
R
DK
V
KNE
------
RYGVAP
--
S
V
AFG
LG
T
ANRLYL
N
YLHVTQHN
TP
DGG
I
------
P
T
IGLPGY
S
APSAG
TA
A
L
NHSGKVDTHNFYGT
-
DS
D
YDDSTT
--
-
DTA
T
MR
F
E
H
DIN
DN
TTI
--------
R
N
TTRWSRV
K
-
QDYLMTAIMGGA
-----
SNITQPTSDVNSWTWSR
TANTKDVSNKILT
NQT
N
LT
S
TFYT
GA
I
-
GHD
V
S
TG
V
EFTRETQT
N
YGVNPVTLP
A
VN
IYHPDSSIHPG
GLTRNGANAN
-----
G
Q
TDTFAI
--
-
-
--
Y
AF
DTLQIT
R
DFELNGG
I
R
L
--------
DNY
HTEYDSATACGGSGRGAIT
C
P
A
G
V
AKGS
PVTTVD
TAKSGN
L
V
NWK
--
-
AG
A
L
-
Y
-----------
HLTENGN
I
Y
I
NYAV
S
Q
Q
PPGGNNFA
L
AQSGSGNSAN
R
T
D
F
KPQKA
-------------
N
TSEIGTKWQVLD
KR
--
LLLTAALFRTDI
E
NEV
E
-------
QNDDGT
----
YSQYGKKRVEGYEI
S
V
---------
-
AGNITPA
W
Q
M
I
GGYTQQKATIKNGKDVAQD
-
GSSSLPYTPEHA
F
T
L
W
SQ
Y
Q
A
TDD
I
SVG
A
GARYIG
S
MHK
G
SDGAVGTPA
F
------
TEGY
W
VAD
A
KLGYRVNRN
---
LD
F
QLNVYNLFDTDYVASI
-----
NK
-----
-
SGYRYHPG
E
P
fig|525281.3.peg.2355
Escherichia coli 83972 (14-725/746)
L
G
I
YG
V
A
Q
AQE
P
TDTP
V
S
H
D
D
T
I
V
V
T
A
A
EQ
N
L
-
-
-
Q
AP
G
V
S
----------
------
T
I
T
AD
E
I
R
K
N
-
PV
A
R
DV
SE
I
IR
T
-
M
PG
V
N
L
----
-T
GNST
S
G
QRGNNR
Q
I
D
I
RG
M
G
PE
N
T
L
--
-
---
I
L
I
DG
K
P
VSSRNSV
R
QGWRGE
RDT
R
G
D
TS
W
V
PPEMI
E
RI
EV
LR
GP
A
A
A
R
YG
N
G
A
A
GG
V
V
N
I
ITK
K
G
SGEWHGSW
D
A
YFN
A
PEHKEEGA
TK
R
TNFS
L
T
G
P
L
GD
EF
----------
S
F
RL
YGNL
D
KT
QADAWDINQGHQSARAGTHATTLP
AG
R
E
G
V
I
N
K
DINGVV
R
W
DF
AP
LQ
SL
ELEA
G
Y
S
R
Q
GN
L
-
Y
AGD
TQ
N
T
N
S
D
---
------
-
-
----
A
Y
TRSKY
G
--
-
-
D
E
T
N
RL
YRQ
N
Y
SL
T
W
N
GG
W
D
N
GV
T
--
-
T
S
NW
VQ
YE
H
---
--
---
--------
---
TR
N
SR
I
P
-
E
-----G
L
A
GG
TEGKFNEKAA
Q
DFV
D
I
-------
-----
D
LDDVM
L
H
S
E
V
N
L
P
I
D
F
L
-
--
V
-
NQT
L
T
L
G
T
E
W
N
QQ
RMKDLSS
N
TQA
L
T
G
T
N
T
----------
G
GAI
D
G
V
S
A
TDRSPYS
K
A
E
I
F
S
L
--
-
-
--
F
A
EN
N
ME
L
T
D
STI
V
TP
G
L
R
F
--------
D
H
H
-------
SIV
G
NNWSP
A
LN
I
S
Q
G
L
GDD
F
------
T
L
K
M
G
I
A
R
A
Y
K
--
-
A
P
SL
-
Y
-----------
QTNP
N
---
Y
I
L
Y
S
K
GQ
G-------C
YA
S
A
G
GCYLQG
N
D
DLK
A
E
T
S
-------------
INK
EIG
L
EF
----
KR
DGW
L
AGVTW
FR
N
D
YR
N
K
IE
AGYVAVG
QN
AV
GT
DLYQ
W
D
N
VP
K
AV
VEG
L
E
G
S
L
NVPVSETVM
W
TN
NIT
YML
K
-
-
---
SE
N
K
T
T
----------
-
-
G
D
R
L
SI
I
PE
Y
T
L
N
S
T
L
S
W
Q
A
R
E
D
L
S
M
Q
T
TFT
W
Y
G
KQQPKKY
N
YK
G
Q
PA
VGPETKEISP
Y
SI
VG
L
SAT
W
D
V
T
K
N
---
V
S
L
T
G
G
V
D
NLFD
K
RLWR
A
G
-----
N
AQTTGD
L
A
G
A
N
Y
IA
G
fig|655817.3.peg.676
Escherichia coli ABU 83972 (14-725/746)
L
G
I
YG
V
A
Q
AQE
P
TDTP
V
S
H
D
D
T
I
V
V
T
A
A
EQ
N
L
-
-
-
Q
AP
G
V
S
----------
------
T
I
T
AD
E
I
R
K
N
-
PV
A
R
DV
SE
I
IR
T
-
M
PG
V
N
L
----
-T
GNST
S
G
QRGNNR
Q
I
D
I
RG
M
G
PE
N
T
L
--
-
---
I
L
I
DG
K
P
VSSRNSV
R
QGWRGE
RDT
R
G
D
TS
W
V
PPEMI
E
RI
EV
LR
GP
A
A
A
R
YG
N
G
A
A
GG
V
V
N
I
ITK
K
G
SGEWHGSW
D
A
YFN
A
PEHKEEGA
TK
R
TNFS
L
T
G
P
L
GD
EF
----------
S
F
RL
YGNL
D
KT
QADAWDINQGHQSARAGTHATTLP
AG
R
E
G
V
I
N
K
DINGVV
R
W
DF
AP
LQ
SL
ELEA
G
Y
S
R
Q
GN
L
-
Y
AGD
TQ
N
T
N
S
D
---
------
-
-
----
A
Y
TRSKY
G
--
-
-
D
E
T
N
RL
YRQ
N
Y
SL
T
W
N
GG
W
D
N
GV
T
--
-
T
S
NW
VQ
YE
H
---
--
---
--------
---
TR
N
SR
I
P
-
E
-----G
L
A
GG
TEGKFNEKAA
Q
DFV
D
I
-------
-----
D
LDDVM
L
H
S
E
V
N
L
P
I
D
F
L
-
--
V
-
NQT
L
T
L
G
T
E
W
N
QQ
RMKDLSS
N
TQA
L
T
G
T
N
T
----------
G
GAI
D
G
V
S
A
TDRSPYS
K
A
E
I
F
S
L
--
-
-
--
F
A
EN
N
ME
L
T
D
STI
V
TP
G
L
R
F
--------
D
H
H
-------
SIV
G
NNWSP
A
LN
I
S
Q
G
L
GDD
F
------
T
L
K
M
G
I
A
R
A
Y
K
--
-
A
P
SL
-
Y
-----------
QTNP
N
---
Y
I
L
Y
S
K
GQ
G-------C
YA
S
A
G
GCYLQG
N
D
DLK
A
E
T
S
-------------
INK
EIG
L
EF
----
KR
DGW
L
AGVTW
FR
N
D
YR
N
K
IE
AGYVAVG
QN
AV
GT
DLYQ
W
D
N
VP
K
AV
VEG
L
E
G
S
L
NVPVSETVM
W
TN
NIT
YML
K
-
-
---
SE
N
K
T
T
----------
-
-
G
D
R
L
SI
I
PE
Y
T
L
N
S
T
L
S
W
Q
A
R
E
D
L
S
M
Q
T
TFT
W
Y
G
KQQPKKY
N
YK
G
Q
PA
VGPETKEISP
Y
SI
VG
L
SAT
W
D
V
T
K
N
---
V
S
L
T
G
G
V
D
NLFD
K
RLWR
A
G
-----
N
AQTTGD
L
A
G
A
N
Y
IA
G
fig|749528.3.peg.4269
Escherichia coli MS 45-1 (13-710/732)
PL
L
L
T
MMA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SDYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|685038.3.peg.4659
Escherichia coli O83:H1 str. NRG 857C (16-710/732)
L
I
MMA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AV
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|679206.4.peg.4873
Escherichia coli MS 119-7 (13-710/732)
PL
L
L
T
MMA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|679205.4.peg.5114
Escherichia coli MS 124-1 (13-710/732)
PL
L
L
T
MMA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AV
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|525281.3.peg.3270
Escherichia coli 83972 (1-692/714)
MA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SDYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|655817.3.peg.3422
Escherichia coli ABU 83972 (1-692/714)
MA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SDYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|439855.10.peg.3338
Escherichia coli SMS-3-5 (13-716/732)
PL
L
L
T
MMA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
ADL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
GP
A
SL
Y
G
fig|585057.4.peg.3547
Escherichia coli IAI39 (1-692/714)
MA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|585057.6.peg.3556
Escherichia coli IAI39 (1-692/714)
MA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|749548.3.peg.1829
Escherichia coli MS 196-1 (1-692/714)
MA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|656393.3.peg.5368
Escherichia coli H299 (1-692/714)
MA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
Y
T
SQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|405955.13.peg.5174
Escherichia coli APEC O1 (13-710/732)
PL
L
L
T
MMA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
ADL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|405955.9.peg.4324
Escherichia coli APEC O1 (14-711/733)
PL
L
L
T
MMA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
ADL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|656419.3.peg.1941
Escherichia coli M718 (13-710/732)
PL
L
L
T
MMA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
ADL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|749533.3.peg.3393
Escherichia coli MS 84-1 (1-692/714)
MA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AV
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
V
DL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
fig|585035.6.peg.4993
Escherichia coli S88 (1-692/714)
MA
PA
VAQQTDDETF
V
VS
A
N
RS
N
RT
V
AEMA
Q
T
T
W---------
------
V
I
E
N
A
E
L
E
Q
QI
QG
GKE
L
K
DAL
A
Q
L
I
PGL
D
V
----
------
S
SRSRTNYG
M
N
V
RG
------
---
R
PLV
V
L
VDG
VRLNS
----------
SR
TDSR
Q
LDS
ID
PFNI
D
H
I
EVI
S
G
-
A
T
S
LYG
GG
S
T
GG
L
IN
IV
TK
K
GQ
PET
----
MMEF
EA
GTKS
G
F
S
S
S
K
DHDER
I
A
GA
V
SG
G
NEHI
-------
S
G
RL
S
V
AY
Q
K
FG
----------
GWFDGNGDATLL
D
NT
Q
TG
L
Q
YSD
-----
R
LD
I
MG
--
T
GTLN
ID
E
S
R
Q
L
Q
LI
TQ
Y
YKSQG
DDD
Y
GL
NLGKGFS
A
I
RGTST
P
FV
S
N
G
--
-
-
L
N
S
D
RI
PGTER
H
LI
-
SL
QY
S
DS
AFLG
Q
E
LV
G
QV
Y
Y
RD
--
--
---
--------
-
E
S
L
R
F
YPF
P
T------------
-----
V
N
AN
K
QV
T
AF
S
S
SQQDT
-------------
D
Q
Y
G
M
K
L
T
LN
S
KP
M
D
G
WQ
I
T
W
GL
D
ADH
E
---
R
F
TS
N
Q
M
FFD
L
AQ
-----------
ASASG
G
L
N
NKKIYTT
GR
YP
SY
D
I
-
T
N
L
AA
F
LQ
SGYD
I
N
N
L
F
T
LNGGVRY
QYTENKID
D
FI
-------------------
-
-
-
G
Y
A
Q
QR
------
QIA
A
G
K
A
T
S
ADAI
P
G
GS
V
D
Y
DNFLFNAGLLM
H
I
TE
RQQA
W
LN
FS
Q
G
V
E
L
P
DPG--K
Y
YGR
G
IYG
A
A
V
N
G
H
L
PLT
K
S
VNVSDSKLEGVKV
D
S
Y
E
L
G
W
RF
TGN
N
----
L
RTQI
A
A
Y
Y
S
IS
D
KS
VV
-------
A
N
K
D
L
T
ISVVDD---
K
R
R
I
Y
G
V
E
G
AV
DYLIPDTD
-
W
S
TG
V
N--
F
N
V
L
KTE
SK
VNG
T
W
Q
--------
-
-KYD
V
KTASPSK
A
TA
Y
IG
W
A
P
-
D
P
WS
L
R
V
Q
S
TTSF
D
V
-S
D
A
Q
G
YK----
-------
V
D
GYT
T
ADL
LGS
Y
QL
PVG
---
-T
L
SF
SI
E
NLFD
R
DY
TTVW
-----
--
---
GQ
R
A
PLY
Y
S
PG
Y
G
Consen1
Primary consensus
MKI
VrQtvlPAllvls
---
pvVfaAdtpsl
-
v
-
yAPQqsadpkfsrpVa
------
vVisEevikd
-
qgatNLtdaLk
-
VPGlgv
----
eNgnsttgD
-----
aiymRGadtsns
---
r
---
iYvDG
---------------
srDtfnq
---
id
---
teqVEVikGPsgtlYGrsApgGsiNmitkqgqn
------
dsgIeASasiGSawsrRgtLdvnga
GDgT
----------
avRlnvmGektH
----------------------
dagqdklkNe
------
rygvap
--
slafgid
AnrLyLiylhVtqhndddGGl
------
p
iglpgypapsag
--
-
nhsgkvdthnfygt
-
dsqYDdstT
--
qdtAgmRyErdin
tti
--------
ReTTrwsrvp
-
qdyLmtaimGga
-----
snitqptsdvnSwtwsR
-------------
nqtglt
TFyt
i
-
ghd
tTGleftretqtryGvNpvtLp
vn
-----------
GltrnGanan
-----
grtDtfai
--
n
--
YlqdtlQit
dfeLngGvRy
--------
DnY
-------------------
p
G
akgS
------
taksGna
nwK
--
pAGsL
-
Y
-----------
hlTengNvYlnyavgqepPggNnfayaqsGsgnSann
dlKPqka
-------------
dTsEIGtKwqvlD
----
lLLtaALFrTDidnEvv
-------
qnddgt
----
YsqyGKkRveGyEiav
---------
wagnitpafqv
ggyTqqkATikngkdvaQD
-
gssslPytpeha
tawigY
ptDdwsvG
gARYiGdmhkdsdgavgtPa
-------
tegYtvadlklGYrvNrn
---
lDlqlnVyNLFDtdYVaSi
-----
Nk
-----
sGyrYhpG
pIAWRFE
Consen2
Secondary consensus
l
mma
vaqqtddetf
vs
neqtmi
-
sa
vvseldtpaa
sdttrtmt
dg
qmrla
tpri
ses
t
vqaffagq
rqnyaq
lqlsi
fgsrstygi
-
girl
i
ipatmp
irdigsv
--
gqg
-
tsn
--
lssvqn
lr
fsad
n
-
st
vm
vtsqtprq
ppt
d
syy
fgfw
yg
katqv
t
qpgdvdytvstt
ftth
yrd
sgarknva
a
------
k
gvrlg
sk
s
nfns
dikatpp
i
t
aewkansqqaprta
l
--------------
aed
trk
ik
-
tq
tl
f
hsls
ddmsvmmyage
n
qyqsikmapq
npsha
sv
itlqrhyqgid
rwth
tantkdvsnkiltgelnvp
--
-
---
s
vnyenmsennk
y
nfr
n
--
iyhpdssihpg
pey
qkge
lqr
ernlmw
-
dp
aftqw
ls
kls
da
i
lssvwfdsn
h
hteydsatacggsgrgait
t
gdd
pvttvd
----
dl
yh
wl
-
a
k
am
daw
i
iaagrsfqt
ti
elslrad
--
q
gmr
gf
stn
n
i
s
trig
kr
g
sl
q
ted
ie
vdsssggrtt
kna
t
rq
a
ls
-
-
dqrfagdwrm
asw
wld
yrsnvcne
cngnrm
giarnm
flssq
ae
giya
e
m
simagdentaka
sf
---
wlvgaft
ky
yhnltv
ffgr
d
ke
g
vivnes
gryyep
p
rn
gv
n
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character