fig|6666666.5357.peg.4212
Escherichia coli TY-2482 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
FSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDN
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|1040638.4.peg.2197
Escherichia coli O104:H4 str. LB226692
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
FSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDN
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|585055.6.peg.3494
Escherichia coli 55989
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
FSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDN
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|585055.8.peg.3497
Escherichia coli 55989
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
FSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDN
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|573235.3.peg.4243
Escherichia coli O26:H11 str. 11368 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|595495.4.peg.886
Escherichia coli KO11 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
G
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|566546.3.peg.562
Escherichia coli W (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
G
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|566546.3.peg.561
Escherichia coli W
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
G
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|566546.4.peg.3274
Escherichia coli W
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
G
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|340184.6.peg.4106
Escherichia coli B7A
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|562.375.peg.681
Escherichia coli EC4100B
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|656408.3.peg.3455
Escherichia coli H591 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|585034.4.peg.3127
Escherichia coli IAI1
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|585034.5.peg.3125
Escherichia coli IAI1
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|679207.4.peg.3758
Escherichia coli MS 107-1
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|679206.4.peg.1409
Escherichia coli MS 119-7
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|679204.3.peg.768
Escherichia coli MS 145-7
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|585395.4.peg.3888
Escherichia coli O103:H2 str. 12009
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|656443.3.peg.3991
Escherichia coli TA271
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|585396.4.peg.4003
Escherichia coli O111:H- str. 11128 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
C
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|749532.3.peg.624
Escherichia coli MS 78-1
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
I
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|340186.5.peg.514
Escherichia coli E110019 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
GASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|409438.11.peg.3494
Escherichia coli SE11
MYKK
F
KLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|344601.5.peg.431
Escherichia coli B171 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
D
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|550672.3.peg.3288
Escherichia coli B088
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
G
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|340185.4.peg.449
Escherichia coli E22
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QT
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|749545.3.peg.1807
Escherichia coli MS 182-1
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
I
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|331111.12.peg.3781
Escherichia coli E24377A
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
V
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|749547.3.peg.3987
Escherichia coli MS 187-1
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNLTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQTLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|340184.3.peg.3927
Escherichia coli B7A
M
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|344601.3.peg.433
Escherichia coli B171
M
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
D
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|340186.3.peg.493
Escherichia coli E110019
M
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
GASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|340185.3.peg.409
Escherichia coli E22
M
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QT
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|481805.6.peg.693
Escherichia coli ATCC 8739 (4-835/835)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNLTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
S
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
NKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
S
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NNISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMEAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPKSSNLTTGTV
ILPC
IS
fig|331111.3.peg.1197
Escherichia coli E24377A
M
IKN
----
I
Y--CSLSV
L
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
NAL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
VAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
V
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NKISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
K
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTV
ILPC
IS
QN
fig|358709.5.peg.1197
Escherichia coli 101-1
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
NH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQATNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NEISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
TEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTA
ILPC
IS
QN
fig|656414.3.peg.3507
Escherichia coli H736 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
NH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQATNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NEISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
TEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTA
ILPC
IS
QN
fig|749538.3.peg.3863
Escherichia coli MS 116-1 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
NH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQATNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NEISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
TEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTA
ILPC
IS
QN
fig|316401.4.peg.3765
Escherichia coli ETEC H10407 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
NH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQATNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
ISLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NEISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
TEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTA
ILPC
IS
QN
fig|413997.3.peg.3053
Escherichia coli B str. REL606
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
NH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQATNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTNVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NEISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
TEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTA
ILPC
IS
QN
fig|511693.5.peg.3062
Escherichia coli BL21 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
NH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQATNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTNVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NEISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
TEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTA
ILPC
IS
QN
fig|469008.4.peg.712
Escherichia coli BL21(DE3) (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
NH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQATNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTNVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NEISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
TEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTA
ILPC
IS
QN
fig|637912.3.peg.374
Escherichia coli OP50 (4-837/837)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
NH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQATNLDFPRI
YL
F
R
P
I
PAMN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTNVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
A
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
DKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
T
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWEE-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NEISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMKAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
TEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDNKLA
L
R
W
GD-K
-
--S
C
F
I
Q-PPNSSNLTTGTA
ILPC
IS
QN
fig|344610.7.peg.3203
Escherichia coli 53638 (4-835/835)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNLTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
S
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
NKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
S
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
K
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NNISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMEAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIS
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPKSSNLTTGTV
ILPC
IS
fig|749537.3.peg.668
Escherichia coli MS 115-1
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNLTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
KTQQTLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
A
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
NKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
S
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NNISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMEAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLP
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPKSSNLTTGTV
ILPC
IS
fig|331112.6.peg.3148
Escherichia coli HS (4-836/836)
MYKKLKLTTISE
L
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PEIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
S
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
NKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
S
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NNISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMEAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIQ
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGI
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKIT
L
H
W
GD-K
-
--S
C
F
I
QPPPNSSNLTTGTV
ILPC
IS
fig|481805.3.peg.695
Escherichia coli ATCC 8739
M
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNLTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
S
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
NKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
S
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NNISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMEAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPKSSNLTTGTV
ILPC
IS
fig|585057.4.peg.3661
Escherichia coli IAI39
M
IKN
----
I
Y--CSLSV
I
VIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEFIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGRMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SXANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
S
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
NKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
S
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NNISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMEAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIP
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIIGV
I
RLP
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPKSSNLTTGTV
ILPC
IS
QN
fig|344610.3.peg.4737
Escherichia coli 53638
M
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNLTA----
-------
-V
C
V
T
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PAIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
VS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
S
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
NKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
S
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
K
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NNISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMEAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIS
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGV
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKLT
L
R
W
GD-K
-
--S
C
F
I
Q-PPKSSNLTTGTV
ILPC
IS
fig|331112.3.peg.3014
Escherichia coli HS
M
IKN
----
I
Y--CSLSV
I
IIGCASAY
------
A
-----V
EFN
KDL
I
E
----
AEDREN
VN
LSQFETD
G
QLP
-
V
G
K
Y
S
L
STL
I
N
NKRTPIHLDLQ
W
VLIDNQTA----
-------
-V
CLT
PEQ
L
TLL
G
F-TDEIIE
---------
EAQQNLIDG
C
YPIE-KEKQITTYL
D
KGKMQ
L
S
IS
A
PQA
W
L
KYKDAN
W
T
PP
E
L
W
DH
GIAG
AFL
DYN
LY
A
S
--
HYAPHQGD
-
N
S
QNI
S
---------------------
SY
G
QA
G
V
N
L
GAWRLR
T
D
YQ--YDQSFNNGK
--------
SQANNLDFPRI
YL
F
R
P
I
PEIN
A
K
LT
I
G
Q
YDTE
S
S
IFDS
F
H
FS
G
IS
L
K
SD
E
N
MLP
PD
LRGYAP
Q
I
T
G
V
A
Q
TNA
K
V
T
V
S
Q
N
N
R
I
IYQ
EN
VPPGPF
S
I
T
N
L
FN
T
LQ-
-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQ
W
Q
V
AS
N
S
I
P
Y
L
T
R
K
G
QI
RY
TTAM
GK
PTSVGGDSLQQPF
F
WTG
E
FS
WG
WL
N
NV
SLYGG
SVL
T
NRD
Y
Q
S
L
A
T
G
V
G
F
N
LNSL
G
S
LS
F
DVT
R
S
DAQLHNQ
---
NKET--
G
Y
SYR
A
NYSK
R
F
ESTG
S
Q
L
TFAG
-
YRFS
D
K
N
F
V
S
MNE
Y
I
--------
N
------DTNHYTN
------------------------
YQNE
---------
K
ESYI
V
TFN
Q
Y
------------
L
----
ESLRL
N
T
Y
VSLARNT
YW
DASSNVNY
S
LSLSRDFDIGPLKN
VS
T
S
L
T
FSR
------
INWED-DNQDQL
YL
NI
S
I
P
WG
----------------
TSRTLS
Y
GMQRNQ
D
NNISHTA
S
WY
D
SS-
-
D
RN
-
N
SW
S
VS
A
S
GDNDEFKDMEAS
--
LRASYQ
H
NTEN
G
RLYL
S
G
T
SQ
--
RDS
Y
Y
S
LNASWN
GS
F
T
A
T
RH
G
A
AFH
DYSGS
-
ADSRF
M
I
D
A
DG
AEDIQ
L
-
-NNKRAV
TN
RY
G
I
G
V
I
PS
V
S
S
Y
IT
T
SLS
VD
TRN
LP
EN
V
DIEN
S
VITTTL
TEGAIG
YAK
L
DTRK
G
YQIMGI
I
RLA
DG
SH
PPLG
I
---
S
V
KD
E
TSH
---
KEL
G
L
V
A
D
G
G
F
V
YL
N
G
I
QDDSKIT
L
H
W
GD-K
-
--S
C
F
I
QPPPNSSNLTTGTV
ILPC
IS
fig|656444.3.peg.4291
Escherichia coli TA280 (20-836/840)
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
V
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DA
GIAG
FIL
DYN
LY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNHDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSDNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGVE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|550677.3.peg.3604
Escherichia coli B354 (20-836/840)
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DA
GIAG
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNHDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTV
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSDNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGEFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGVE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|753642.3.peg.4921
Escherichia coli NC101 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GI
T
G
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSTSNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSTLH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|655817.3.peg.3617
Escherichia coli ABU 83972 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GI
T
G
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|199310.1.peg.3706
Escherichia coli CFT073 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GI
T
G
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|749546.3.peg.1881
Escherichia coli MS 185-1 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GI
T
G
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|749528.3.peg.471
Escherichia coli MS 45-1 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GI
T
G
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|685038.3.peg.3095
Escherichia coli O83:H1 str. NRG 857C (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GI
T
G
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|656417.3.peg.3879
Escherichia coli M605 (11-836/840)
T
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PAH
------
A
-----V
EFN
VDM
I
D
----
VEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
A
RVQ
I
N
KNMLPQALILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DA
GIAG
FIL
DYN
VY
A
S
--
QYAPHHGY
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFTDGR
--------
SVNYDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQKQ
---
NSDNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYA
-
DNARM
MV
D
TDG
ISGVE
I
-
-NSNRTV
TN
GL
G
I
AVV
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|405955.9.peg.2880
Escherichia coli APEC O1 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GIAG
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
L
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
I
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSTLH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|585035.6.peg.3371
Escherichia coli S88 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GIAG
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
L
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
I
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSTLH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|869729.3.peg.230
Escherichia coli UM146 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIE
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GIAG
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
L
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
I
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSTLH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|216592.1.peg.3963
Escherichia coli 042 (20-836/840)
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
VEDREN
ID
ISRFEKK
G
YIT
-
P
G
K
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TSF
G
L-NTEFIE
---------
SLQTIAGSE
C
LNLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DA
GIAG
FIL
DYN
MY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNHDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSDNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGVE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
Q
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|216592.3.peg.3468
Escherichia coli 042 (20-836/840)
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
VEDREN
ID
ISRFEKK
G
YIT
-
P
G
K
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TSF
G
L-NTEFIE
---------
SLQTIAGSE
C
LNLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DA
GIAG
FIL
DYN
MY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNHDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSDNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGVE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
Q
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|749531.3.peg.2413
Escherichia coli MS 69-1 (20-836/840)
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
VEDREN
ID
ISRFEKK
G
YIT
-
P
G
K
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TSF
G
L-NTEFIE
---------
SLQTIAGSE
C
LNLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DA
GIAG
FIL
DYN
MY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
SVNHDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSDNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
V
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGVE
I
-
-NSNRTV
TN
GL
G
I
AV
I
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSILH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|562.376.peg.3910
Escherichia coli WV_060327 (12-836/840)
L
S
H
A
IKN
A
LSG
V
V--CSLLF
V
L----PVH
------
A
-----V
EFN
VDM
I
D
----
AEDREN
ID
ISRFEKK
G
YIP
-
P
G
R
Y
L
V
RVQ
I
N
KNMLPQTLILE
W
VKADNESGS---
-------
LL
CLT
KEN
L
TNF
G
L-NTEFIG
---------
SLQNIAGSE
C
LDLS-QRQELTTRL
D
KATMI
L
S
L
S
V
PQA
W
L
KYQATN
W
T
PP
E
F
W
DT
GIAG
FIL
DYN
VY
A
S
--
QYAPHHGD
-
S
T
QNV
S
---------------------
SY
G
TL
G
F
N
L
GAWRLRSD
YQ--YNQNFADGR
--------
PVNRDSEFART
YL
F
R
P
I
PSWS
S
K
F
T
M
G
Q
YDLS
S
N
LY
D
T
F
H
FT
G
AS
L
E
SD
E
S
MLP
PD
L
Q
GYAP
Q
I
T
GIA
Q
TNA
K
V
T
V
A
Q
N
GRV
L
YQ
TT
V
A
PGPF
T
I
S
DL
GQ
S
FQ-
-
G
Q
L
D
V
T
V
E
E
E
DG
RTST
F
Q
V
GS
AS
I
P
Y
L
T
R
K
G
QV
RY
KTSL
GK
PTSVGHNDINNPF
F
WTA
E
AS
WG
WL
N
NV
SLYGG
GMF
T
ADD
Y
Q
A
I
T
T
G
I
G
F
N
LNQF
G
S
LS
F
DVT
G
A
DASLQQQ
---
NSGNLR
G
Y
SYR
F
NY
A
K
H
F
ESTG
S
Q
ITFAG
-
YRFS
D
K
D
Y
V
S
MSE
YL
--------
S
------SRNGDES
------------------------
IDNE
---------
K
ESYV
I
SLN
Q
Y
------------
F
----
ETLEL
N
S
Y
LNVTRNT
YW
DSASNTNY
S
VSVSKNFDIGDFKG
I
S
A
S
L
A
VSR
------
IRWDD-DEENQY
Y
F
SF
SLP
LQ
----------------
QNRNIS
Y
SMQRTG
S
SNTSQMI
S
WY
D
SS-
-
D
RN
-
N
I
W
N
I
S
A
S
ATDDNIRDGEPT
--
LRGSYQ
H
YSPW
G
RLNI
N
G
S
VQ
--
PNQ
Y
N
S
VTAGWY
GS
L
T
A
T
RH
G
V
A
L
H
DYSYG
-
DNARM
MV
D
TDG
ISGIE
I
-
-NSNRTV
TN
GL
G
I
AV
M
PS
L
S
N
Y
TT
S
MLR
V
N
NND
LP
EG
V
DVEN
S
VIRTTL
T
Q
GAIG
YAK
L
NATT
G
YQIVGV
I
RQE
N
G
RF
PPLG
V
---
N
V
TD
K
ATG
---
KDV
G
L
V
A
E
D
G
F
V
YLSG
I
QENSTLH
L
T
W
GD-N
-
--T
C
E
V
T-PPNQSNISESAI
ILPC
fig|656419.3.peg.991
Escherichia coli M718 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
A
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VNFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
DSHFDIQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TPLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITAGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|550676.3.peg.174
Escherichia coli B185 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TPLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
V
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTNRL
ILPC
fig|701177.3.peg.870
Escherichia coli O55:H7 str. CB9615 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
TQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|562.373.peg.3015
Escherichia coli 1125A (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|562.372.peg.1711
Escherichia coli 1212A (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|562.374.peg.5266
Escherichia coli 536A (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|444454.5.peg.5236
Escherichia coli O157:H7 str. EC4024 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|444449.5.peg.5572
Escherichia coli O157:H7 str. EC4042 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|444448.5.peg.3447
Escherichia coli O157:H7 str. EC4045 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|444453.5.peg.790
Escherichia coli O157:H7 str. EC4076 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|444452.5.peg.3574
Escherichia coli O157:H7 str. EC4113 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|444450.8.peg.906
Escherichia coli O157:H7 str. EC4115 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|444451.5.peg.4518
Escherichia coli O157:H7 str. EC4196 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|478005.5.peg.1294
Escherichia coli O157:H7 str. EC4486 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|478007.5.peg.3992
Escherichia coli O157:H7 str. EC508 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|478008.5.peg.2136
Escherichia coli O157:H7 str. EC869 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|637388.3.peg.1495
Escherichia coli O157:H7 str. FRIK2000 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|570506.3.peg.422
Escherichia coli O157:H7 str. FRIK966 (5-818/819)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|544404.4.peg.771
Escherichia coli O157:H7 str. TW14359 (2-815/816)
N
----
I
YRLSVLSC
L
AMVTPPAL
------
T
-----A
EFN
LNV
L
D
----
KSIRDS
V
D
ISLLNQK
G
VVA
-
P
G
D
Y
F
V
SVT
VN
NNKISNGQQIR
W
QKSGDKII----
-------
-P
C
IN
ESL
I
ELF
G
L-KSDFRK
---------
KLPA--IKE
C
VDFS-VFPEIIFTF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
NN
GI
P
G
FLM
DYN
LF
A
S
--
TYRPQSGS
-
SS
NNL
N
---------------------
AY
G
TT
G
L
N
A
GAWRLRSD
YQ--LSQS-DSGD
--------
NREQSGAISRT
YL
F
R
P
L
PQIG
S
R
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
S
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
R
PRSSMSHHTEDET
F
ISH
E
VS
WG
ML
S
NT
SLYGG
MLL
A
GDD
Y
R
S
G
A
L
G
I
G
Q
N
MLWM
GALS
F
DVT
W
A
DSHFDTQ
---
QDEQ--
G
Y
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
Y
I
--------
D
------HKYNDAD
------------------------
AQDE
---------
K
QTIS
L
SFG
Q
P
------------
I
----
TLLNL
N
L
Y
ANILHQS
W
W
NADTSTTA
N
ITVGFNVDIGDWKD
I
S
V
S
T
S
FNT
------
THYEDKDRDNQI
Y
F
SI
SLP
IG
----------------
ESGRLG
Y
DMQ-NN
S
NTTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
IQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDL
T
G
T
YA
--
AND
Y
T
S
ASASWS
GS
F
T
A
T
QH
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VGDIP
I
-
-QGNIDY
TN
RF
G
I
AVV
PF
V
S
S
Y
QP
T
TVA
V
N
MND
LP
DG
V
TVSE
N
VVKETW
TEGAIG
FKS
L
ASRA
G
KDLNVI
I
SDA
N
G
HF
PPLG
A
---
D
V
RQ
A
EGG
---
VSV
GMV
G
E
N
G
H
A
W
LSGV
DENQQFT
V
H
W
GDQK
-
--T
C
A
I
H-LPEHLEDVTKRL
ILPC
fig|749531.3.peg.1848
Escherichia coli MS 69-1 (2-815/816)
N
----
I
YRLSFISC
L
VMAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSDN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
N
S
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
V
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNF
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
N
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVS
V
N
MND
LP
DG
V
TVAD
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
V
W
GDSQ
-
--R
C
S
I
H-LPEHMEDTANRL
ILPC
fig|656437.3.peg.789
Escherichia coli TA143 (2-815/816)
N
----
I
YRLSFISC
L
VMAIPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKGK
G
GIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKID
W
KKNGDQTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPR--LNQ
C
VDFS-SRPEILFIF
D
QASQQ
L
N
I
T
I
PQA
W
L
AWHSDN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
N
S
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
V
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNF
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
N
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAD
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
E
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
V
W
GDSQ
-
--H
C
S
L
Y-LPEHMENTANRL
ILPC
fig|409438.11.peg.903
Escherichia coli SE11 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
L
N
GV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|550672.3.peg.961
Escherichia coli B088 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PV
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
K
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|585034.4.peg.688
Escherichia coli IAI1 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PV
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
K
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|585034.5.peg.687
Escherichia coli IAI1 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PV
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
K
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|562.375.peg.1438
Escherichia coli EC4100B (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|656408.3.peg.660
Escherichia coli H591 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|679207.4.peg.1372
Escherichia coli MS 107-1 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|679206.4.peg.2773
Escherichia coli MS 119-7 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|585396.4.peg.766
Escherichia coli O111:H- str. 11128 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|656443.3.peg.983
Escherichia coli TA271 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|566546.3.peg.4582
Escherichia coli W (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|566546.4.peg.772
Escherichia coli W (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|340186.3.peg.3855
Escherichia coli E110019 (5-818/819)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|331111.3.peg.3259
Escherichia coli E24377A (5-818/819)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|340186.5.peg.4048
Escherichia coli E110019 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|331111.12.peg.1040
Escherichia coli E24377A (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|656379.3.peg.1492
Escherichia coli FVEC1302 (2-815/816)
N
----
I
YRLSFISC
L
VMAIPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKGK
G
GIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKID
W
KKNGDQTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPR--LNQ
C
VDFS-SRPEILFIF
D
QASQQ
L
N
I
T
I
PQA
W
L
AWHSDN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
N
S
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
V
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRTSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDN
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
N
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAD
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNLI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
E
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
I
W
GDSQ
-
--R
C
S
I
H-LPEHMEDTANRL
ILPC
fig|656380.3.peg.1320
Escherichia coli FVEC1412 (2-815/816)
N
----
I
YRLSFISC
L
VMAIPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKGK
G
GIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKID
W
KKNGDQTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPR--LNQ
C
VDFS-SRPEILFIF
D
QASQQ
L
N
I
T
I
PQA
W
L
AWHSDN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
N
S
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
V
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRTSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDN
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
N
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAD
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNLI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
E
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
I
W
GDSQ
-
--R
C
S
I
H-LPEHMEDTANRL
ILPC
fig|749549.3.peg.4481
Escherichia coli MS 198-1 (2-815/816)
N
----
I
YRLSFISC
L
VMAIPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKGK
G
GIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKID
W
KKNGDQTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPR--LNQ
C
VDFS-SRPEILFIF
D
QASQQ
L
N
I
T
I
PQA
W
L
AWHSDN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
N
S
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
V
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRTSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDN
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
N
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAD
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNLI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
E
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
I
W
GDSQ
-
--R
C
S
I
H-LPEHMEDTANRL
ILPC
fig|585056.7.peg.1000
Escherichia coli UMN026 (2-815/816)
N
----
I
YRLSFISC
L
VMAIPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKGK
G
GIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKID
W
KKNGDQTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPR--LNQ
C
VDFS-SRPEILFIF
D
QASQQ
L
N
I
T
I
PQA
W
L
AWHSDN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
N
S
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
V
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRTSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDN
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
N
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAD
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNLI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
E
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
I
W
GDSQ
-
--R
C
S
I
H-LPEHMEDTANRL
ILPC
fig|679205.4.peg.3671
Escherichia coli MS 124-1 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|749533.3.peg.4665
Escherichia coli MS 84-1 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|585055.6.peg.714
Escherichia coli 55989 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
K
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
I
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|585055.8.peg.716
Escherichia coli 55989 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
K
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
I
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|573235.3.peg.793
Escherichia coli O26:H11 str. 11368 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
D
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|316401.4.peg.846
Escherichia coli ETEC H10407 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
NYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
G
S
WRLRSD
YQ--LNNT-DSED
--------
SHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
R
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QY
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|481805.3.peg.3149
Escherichia coli ATCC 8739 (5-818/819)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVTE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|481805.6.peg.3134
Escherichia coli ATCC 8739 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVTE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|331112.3.peg.712
Escherichia coli HS (5-818/819)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVTE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|331112.6.peg.743
Escherichia coli HS (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVTE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|340184.3.peg.2282
Escherichia coli B7A (5-818/819)
N
----
I
YRLSLVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQNFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|340184.6.peg.2396
Escherichia coli B7A (2-815/816)
N
----
I
YRLSLVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQNFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|679204.3.peg.4494
Escherichia coli MS 145-7 (2-815/816)
N
----
I
YRLSLVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
I
T
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
T
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
I
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQNFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|749532.3.peg.2590
Escherichia coli MS 78-1 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
G
T
LS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
GSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|550677.3.peg.1123
Escherichia coli B354 (2-815/816)
N
----
I
YRLSFISC
L
VMAIPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKID
W
KKNGDQTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPR--LNQ
C
VDFS-SRPEILFIF
D
QASQQ
L
N
I
T
I
PQA
W
L
AWHSDN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
N
S
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
V
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
QSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSTT
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
NSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQMLT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|656393.3.peg.1403
Escherichia coli H299 (2-815/816)
N
----
I
YRLSFISC
L
VMAIPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
GIA
-
P
G
E
Y
F
V
SVT
VN
NNQISNGQKID
W
KKNGDQTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPR--LNQ
C
VDFS-SRPEILFIF
D
QASQQ
L
N
I
T
I
PQA
W
L
AWHSDN
W
T
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
N
S
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
V
S
H
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RANN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRTSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATD
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNNSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGVQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
NSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQMLT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|749547.3.peg.1737
Escherichia coli MS 187-1 (2-815/816)
N
----
I
YRLSFVSC
L
VMAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
NYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
G
S
WRLRSD
YQ--LNNT-DSED
--------
SHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TSLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|216592.1.peg.465
Escherichia coli 042 (19-832/833)
N
----
I
YRLSFISC
L
VMAMPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFDS
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLTA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDN
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
G
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAD
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
E
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
I
W
GDSQ
-
--R
C
S
I
H-LPEHMEDTANRL
ILPC
fig|216592.3.peg.769
Escherichia coli 042 (2-815/816)
N
----
I
YRLSFISC
L
VMAMPSAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGEISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFDS
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLTA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
GDN
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSQFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDND
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGNWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
G
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
T
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAD
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
E
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
I
W
GDSQ
-
--R
C
S
I
H-LPEHMEDTANRL
ILPC
fig|749545.3.peg.3930
Escherichia coli MS 182-1 (2-815/816)
N
----
I
YRLSFVSC
L
VVAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
HKNDDKTI----
-------
-P
C
IN
DLL
V
DKF
G
L-KPEVRQ
---------
SLPL--INQ
C
VDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
T
PPS
T
W
KE
G
V
AG
ILM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
GAWRLRSD
YQ--LNQT-DSDD
--------
NHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SD
E
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
G
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
GSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQQFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|670888.3.peg.1279
Escherichia coli 1827-70 (2-815/816)
N
----
I
YRLSFVSC
L
VMAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
NYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
G
S
WRLRSD
YQ--LNNT-DSED
--------
SHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|413997.3.peg.720
Escherichia coli B str. REL606 (2-815/816)
N
----
I
YRLSFVSC
L
VMAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
NYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
G
S
WRLRSD
YQ--LNNT-DSED
--------
SHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|511693.5.peg.717
Escherichia coli BL21 (2-815/816)
N
----
I
YRLSFVSC
L
VMAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
NYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
G
S
WRLRSD
YQ--LNNT-DSED
--------
SHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|469008.4.peg.3039
Escherichia coli BL21(DE3) (2-815/816)
N
----
I
YRLSFVSC
L
VMAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
NYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
G
S
WRLRSD
YQ--LNNT-DSED
--------
SHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|358709.5.peg.3965
Escherichia coli 101-1 (2-815/816)
N
----
I
YRLSFVSC
L
VMAMPCAL
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNQISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
NYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TT
G
I
N
A
G
S
WRLRSD
YQ--LNNT-DSED
--------
SHEQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLL
S
GDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTTHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
AND
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
MADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQKFT
V
V
W
GDSQ
-
--H
C
S
L
H-LPEHMEDTANRL
ILPC
fig|749548.3.peg.3226
Escherichia coli MS 196-1 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDH-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|595496.3.peg.644
Escherichia coli BW2952 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|536056.3.peg.3079
Escherichia coli DH1 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|656414.3.peg.899
Escherichia coli H736 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|749538.3.peg.653
Escherichia coli MS 116-1 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|749544.3.peg.3267
Escherichia coli MS 175-1 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|316407.3.peg.692
Escherichia coli W3110 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|316385.5.peg.781
Escherichia coli str. K-12 substr. DH10B (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|316385.7.peg.793
Escherichia coli str. K-12 substr. DH10B (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|511145.12.peg.748
Escherichia coli str. K-12 substr. MG1655 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|511145.6.peg.739
Escherichia coli str. K-12 substr. MG1655 (2-814/815)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|83333.1.peg.710
Escherichia coli K12 (5-817/818)
N
----
I
YRLSFVSC
L
VMAMPCAM
------
A
-----V
EFN
LNV
L
D
----
KSMRDR
ID
ISLLKEK
G
VIA
-
P
G
E
Y
F
V
SVA
VN
NNKISNGQKIN
W
QKKGDKTI----
-------
-P
C
IN
DSL
V
DKF
G
L-KPDIRQ
---------
SLPQ--IDR
C
IDFS-SRPEMLFNF
D
QANQQ
L
N
IS
I
PQA
W
L
AWHSEN
W
A
PPS
T
W
KE
G
V
AG
VLM
DYN
LF
A
S
--
SYRPQDGS
-
SS
TNL
N
---------------------
AY
G
TA
G
I
N
A
GAWRLRSD
YQ--LNKT-DSED
--------
NHDQSGGISRT
YL
F
R
P
L
PQLG
S
K
LTLGE
TDFS
S
N
IFD
G
F
S
YT
G
AA
L
A
SDD
R
MLP
WE
LRGYAP
Q
I
S
GIA
Q
TNA
T
V
T
I
S
Q
S
GRVIYQ
KK
VPPGPF
I
I
D
DL
NQ
S
VQ-
-
G
T
L
D
V
K
V
T
E
E
DG
RVNN
F
Q
V
SA
AS
T
P
F
L
T
R
Q
G
QV
RY
KLAA
G
Q
PRPSMSHQTENET
F
FSN
E
VS
WG
ML
S
NT
SLYGG
LLI
S
DDD
Y
H
S
A
A
M
G
I
G
Q
N
MLWL
GALS
F
DVT
W
A
SSHFDTQ
---
QDER--
G
L
SYR
F
NYSK
Q
V
DATN
S
T
I
S
L
A
A
-
YRFS
D
R
H
F
H
S
YAN
YL
--------
D
------HKYNDSD
------------------------
AQDE
---------
K
QTIS
L
SVG
Q
P
------------
I
----
TPLNL
N
L
Y
ANLLHQT
W
W
NADASTTA
N
ITAGFNVDIGDWRD
I
S
I
S
T
S
FNT
------
THYEDKDRDNQI
YL
SI
SLP
FG
----------------
NGGRVG
Y
DMQ-NS
S
HSTIHRM
S
WN
D
TL-
-
D
ER
-
N
SW
G
M
S
A
G
LQSDR-PDNGAQ
--
VSGNYQ
H
LSSA
G
EWDI
S
G
T
YA
--
ASD
Y
S
S
VSSSWS
GS
F
T
A
T
QY
G
A
AFH
RRSST
-
NEPRL
MV
S
TDG
VADIP
V
-
-QGNLDY
TN
HF
G
I
AVV
PL
IS
S
Y
QP
S
TVA
V
N
MND
LP
DG
V
TVAE
N
VIKETW
I
EGAIG
YKS
L
ASRS
G
KDVNVI
I
RNA
S
G
QF
PPLG
A
---
D
I
RQ
D
DSG
---
ISV
GMV
G
E
E
G
H
A
W
LSGV
AENQLFT
V
V
W
GE-Q
-
--S
C
I
I
H-LPERLEDTTKRL
ILPC
fig|525281.3.peg.1576
Escherichia coli 83972 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TIS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|655817.3.peg.5051
Escherichia coli ABU 83972 (16-842/843)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TIS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|749531.3.peg.1662
Escherichia coli MS 69-1 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
IARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|362663.8.peg.4568
Escherichia coli 536 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|362663.9.peg.4584
Escherichia coli 536 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|656379.3.peg.4380
Escherichia coli FVEC1302 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|656380.3.peg.4291
Escherichia coli FVEC1412 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|585057.4.peg.4730
Escherichia coli IAI39 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|585057.6.peg.4738
Escherichia coli IAI39 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|749549.3.peg.3329
Escherichia coli MS 198-1 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|585056.7.peg.3517
Escherichia coli UMN026 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|869729.3.peg.4655
Escherichia coli UM146 (16-842/843)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGMP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|364106.7.peg.4764
Escherichia coli UTI89 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGMP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|364106.8.peg.4764
Escherichia coli UTI89 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGMP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|749533.3.peg.482
Escherichia coli MS 84-1 (16-842/843)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
ITS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DEKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|656393.3.peg.4789
Escherichia coli H299 (25-851/852)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
ITS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DEKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|679206.4.peg.4768
Escherichia coli MS 119-7 (16-842/843)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
A
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|585035.6.peg.3202
Escherichia coli S88 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
A
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|199310.1.peg.5077
Escherichia coli CFT073 (16-842/843)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
MF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|199310.4.peg.3378
Escherichia coli CFT073 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
MF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|199310.4.peg.4864
Escherichia coli CFT073 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
MF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|199310.1.peg.3508
Escherichia coli CFT073 (13-839/840)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
MF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|340197.3.peg.857
Escherichia coli F11 (13-839/840)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVM
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|340197.5.peg.902
Escherichia coli F11 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVM
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|749550.3.peg.649
Escherichia coli MS 200-1 (16-842/843)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVM
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|749527.3.peg.1942
Escherichia coli MS 21-1 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPQSSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
GPF
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
S
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
D
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|405955.13.peg.3304
Escherichia coli APEC O1 (9-835/836)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
G
L
F
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
A
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|405955.9.peg.2742
Escherichia coli APEC O1 (13-839/840)
V
NNITCVIL
L
SLFCNAAS
------
A
-----V
EFN
TDV
L
D
----
AADKKN
ID
FTRFSEA
G
YVL
-
P
G
Q
Y
L
L
DVI
VN
GQSISPA---S
L
QISFVEPALSGD
KAEKKLP
QA
CLT
SDM
V
RLM
G
LTAESLDK
---------
VVYW-HDGQ
C
ADFH-GLPGVDIRP
D
TGAGV
L
R
I
N
M
PQA
W
L
EYSDAT
W
L
PPS
R
W
DD
GI
P
G
LML
DYN
LN
G
T
--
VSRNYQGG
-
D
S
HQF
S
---------------------
YN
G
TV
G
G
N
L
G
P
WRLR
A
D
YQGSQEQSRYNGE
-------
K
TTNRNFTWSRF
YL
F
R
A
I
PRWR
A
N
LTLGE
NNIN
S
D
IF
R
S
W
S
YT
G
AS
L
E
SDD
R
MLP
PR
LRGYAP
Q
I
T
GIA
E
TNA
R
V
V
V
S
Q
Q
GRV
L
Y
D
SM
VP
A
G
L
F
S
I
Q
DL
DS
S
VR-
-
G
R
L
D
V
E
V
I
E
Q
N
G
RKKT
F
Q
V
DT
AS
V
P
Y
L
T
R
P
G
QV
RY
KLVS
G
R
SRG-YGHETEGPV
F
ATG
E
AS
WG
LS
N
QW
SLYGG
AVL
A
G-D
Y
N
A
L
A
A
G
A
G
W
D
LGVP
G
T
LS
A
D
I
T
Q
S
VARIE--
---
GERTFQ
G
K
S
W
R
L
S
YSK
R
F
DNAD
A
D
ITFAG
-
YRFS
E
R
N
Y
M
T
MEQ
YL
--------
N
------ARYRNDY
------------------------
SSRE
---------
K
EMYT
V
TLN
K
N
------------
V
----
ADWNT
S
F
N
LQYSRQT
YW
DIRKTDYY
T
VSVNRYFNVFGLQG
V
A
V
G
L
A
ASR
------
SKYLG-RDNDSA
YL
RI
S
V
P
LG
----------------
-TGTAS
Y
SGS-MS
N
DRYVNMA
G
YT
D
TF-
-
N
DG
L
D
S
Y
S
L
N
A
G
LNSGGGLTSQRQ
--
INAYYS
H
RSPL
A
--NL
S
A
N
IA
SL
QKG
Y
T
S
FGVSAS
G
G
A
T
I
T
GK
G
A
A
L
H
AGGMS
-
GGTRL
L
V
D
TDG
VGGVP
V
-
-DGGQVV
TN
RW
G
T
G
VV
TD
IS
S
Y
YR
N
TTS
VD
LKR
LP
DD
V
EATR
S
VVESAL
TEGAIG
YRK
F
SVLK
G
KRLFAI
L
RLA
DG
SQ
PP
F
G
A
---
S
V
TS
E
K-G
---
REL
GMV
A
D
E
G
L
A
W
LSGV
TPGETLS
V
N
W
DGKI
-
--Q
C
Q
V
NVPETAISD--QQL
L
LPC
TP
Q
fig|656417.3.peg.3017
Escherichia coli M605 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQSLSDEYDIN
W
YVSENDPTKT--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
S
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
EDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
E-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAH
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ASQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
V
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NTDKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
N
RI-
-
D
DA
-
T
HY
Q
I
N
V
G
-------TSEQH
GS
VDGYLS
H
DGTL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
G
I
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFRGL
L
LPC
fig|431946.3.peg.2313
Escherichia coli SE15 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQSLSDEYDIN
W
YVSENDPTKT--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
S
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
EDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
E-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ASQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
V
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NTDKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
N
RI-
-
D
DA
-
T
HY
Q
I
N
V
G
-------TSEQH
GS
VDGYLS
H
DGTL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGALVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
G
I
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFRGL
L
LPC
fig|439855.10.peg.2662
Escherichia coli SMS-3-5 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKT--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
E-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
VS
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQSM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMT
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|585057.4.peg.2590
Escherichia coli IAI39 (9-844/883)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKT--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
EDDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
E-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQSM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMT
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|585057.6.peg.2593
Escherichia coli IAI39 (9-844/883)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKT--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
EDDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
E-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQSM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMT
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|749527.3.peg.1680
Escherichia coli MS 21-1 (9-844/883)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKT--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
EDDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHRTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
E-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQSM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMT
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|199310.1.peg.2808
Escherichia coli CFT073 (11-845/884)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NADEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|525281.3.peg.4146
Escherichia coli 83972 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NADEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|655817.3.peg.2790
Escherichia coli ABU 83972 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NADEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|199310.4.peg.2720
Escherichia coli CFT073 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NADEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|749546.3.peg.2772
Escherichia coli MS 185-1 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NADEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|749528.3.peg.2685
Escherichia coli MS 45-1 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NADEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|405955.9.peg.2108
Escherichia coli APEC O1 (11-845/884)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKNWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|585397.7.peg.2809
Escherichia coli ED1a (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KHPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NADEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|585397.9.peg.2806
Escherichia coli ED1a (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KHPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NADEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|685038.3.peg.2402
Escherichia coli O83:H1 str. NRG 857C (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
V
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|362663.8.peg.2404
Escherichia coli 536 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|362663.9.peg.2409
Escherichia coli 536 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|749550.3.peg.2880
Escherichia coli MS 200-1 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-EEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|753642.3.peg.2745
Escherichia coli NC101 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KHPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|656440.3.peg.2410
Escherichia coli TA206 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KHPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
GA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
V
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
ADGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|340197.3.peg.3888
Escherichia coli F11 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
G
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|340197.5.peg.4063
Escherichia coli F11 (9-843/882)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
G
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|340197.3.peg.4806
Escherichia coli F11 (11-845/884)
L
RGIACYIA
L
AISGGSVN
------
A
WADDSI
Q
F
D
PRF
L
E
----
LKGDTK
ID
LGKFSKK
G
YVD
-
A
G
K
Y
N
L
RVF
I
N
KQPLSDEYDIN
W
YVSENDPTKN--
-------
YA
CLT
PEL
V
AAL
G
L-KEGIAK
---------
SLQWTHNDE
C
LKPG-QLDGMEVEN
D
LSQSA
L
L
L
T
V
PQA
Y
L
EYTSSD
W
D
PPS
R
W
DD
GI
P
G
LIA
DY
S
LN
A
Q
--
TRHQEQGG
ED
S
HDI
S
---------------------
GN
G
TV
G
A
N
L
GAWR
F
R
A
D
WQSDYQHTRSNDD
-
DDDSSNS
TTSKHWDWSRY
Y
A
W
R
A
L
PSLK
A
K
L
S
LGE
DYLN
S
D
IFD
G
F
N
YI
G
SS
V
S
T
DD
Q
MLP
PN
LRGYAP
D
V
S
G
V
A
H
SS
A
K
V
T
I
S
Q
M
GRV
L
Y
E
TQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
-
G
T
L
H
V
R
V
E
E
Q
N
G
QVQE
Y
D
V
TT
AS
M
P
F
L
T
R
Q
G
QV
RY
KVMM
G
R
PED-WNHKTEGGF
F
SGG
E
AS
WG
VA
D
GW
SLYGG
ALA
D
K-H
Y
Q
S
A
A
M
G
G
G
R
D
LAQF
GAL
A
F
DVT
H
S
HVNLDHD
SAY
GKGKLD
G
N
S
F
R
V
S
Y
A
K
D
F
DELN
S
R
V
TFAG
-
YRFS
E
K
N
F
M
T
MSE
YL
--------
D
ANQSDMARTGND-
------------------------
----
---------
K
EMYT
I
TYN
Q
N
------------
F
----
AAAGV
S
I
Y
LNYSHRT
YW
DRPEQTNY
N
LMFSHYFNMGSIRN
M
S
I
S
V
T
GYR
------
YEYDD-NADKGM
YL
SM
S
I
P
WS
----------------
DSSTVT
Y
NGS-YG
S
GSDSSQV
G
YF
K
RV-
-
D
DA
-
T
HY
Q
V
N
V
G
-------TSEQH
GS
VDGYLS
H
DGSL
A
KVDL
S
A
N
YH
--
EGE
Y
R
S
AGIALQ
G
G
A
T
L
T
AH
G
G
A
L
H
RTQNM
-
GGTRL
L
I
D
A
DG
IANVP
V
E
SNGAPVY
TN
MF
G
K
AVV
AD
I
N
N
Y
YR
N
QAY
I
D
LNN
LP
ED
A
EATQ
S
VVQATL
TEGAIG
YRK
F
KVIS
G
QKAMAV
L
RLR
DG
SY
PP
F
G
A
---
E
V
KN
D
E-Q
---
QQV
G
I
V
D
D
E
G
N
V
YL
A
GV
NAGEHMM
V
F
W
EGSA
-
--Q
C
E
I
VLPKPLPADLFSGL
L
LPC
fig|216592.1.peg.4266
Escherichia coli 042 (33-883/891)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVNY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
ATL
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSNVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGRM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGSKSRS-
N
YSVGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
V
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|216592.3.peg.149
Escherichia coli 042 (15-865/873)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVNY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
ATL
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSNVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGRM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGSKSRS-
N
YSVGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
V
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|656379.3.peg.381
Escherichia coli FVEC1302 (15-865/873)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVNY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
ATL
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSNVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGRM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGSKSRS-
N
YSVGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
V
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|656380.3.peg.318
Escherichia coli FVEC1412 (15-865/873)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVNY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
ATL
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSNVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGRM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGSKSRS-
N
YSVGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
V
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|749549.3.peg.4678
Escherichia coli MS 198-1 (15-865/873)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVNY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
ATL
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSNVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGRM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGSKSRS-
N
YSVGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
V
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|656437.3.peg.205
Escherichia coli TA143 (15-865/873)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVNY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
ATL
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSNVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGRM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGSKSRS-
N
YSVGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
V
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|585056.7.peg.329
Escherichia coli UMN026 (15-865/873)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVNY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
ATL
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSNVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGRM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGSKSRS-
N
YSVGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
V
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|550677.3.peg.2920
Escherichia coli B354 (1-851/871)
MHQVL
L
LPRFARLT
I
ALSLATAV
------
F
PVDAEY
Y
FN
PRF
L
S
N
---
DLAES-
V
D
LSAFTKG
R
EAP
-
P
G
T
Y
R
V
DIY
L
N
DEFMTSR---D
I
TFIADDN-----
-
NADLI
-
-P
CL
S
TDL
L
VSL
G
IKKSALLD
---
N
K
ENSA
EKHVPDNSA
C
TPLQDRLADASTEF
D
VGQQH
L
S
L
S
V
PQ
I
Y
V
GRMARG
Y
V
S
P
D
L
W
EE
GI
N
A
GLL
N
Y
S
FN
G
N
SI
NNRSNHNA
-
GK
SNY
A
--------------------
Y
LN
L
QS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGSSNSS
--------
-DSNKWQHINT
SA
E
R
D
I
IPLR
S
R
LT
V
G
D
SYTD
G
D
IFDS
V
N
FR
G
LK
I
N
S
T
E
A
MLP
DS
Q
H
G
F
AP
V
I
H
GIA
R
GT
A
Q
V
S
V
K
Q
N
G
Y
D
IYQ
TT
VPPGPF
N
I
D
D
I
NS
A
ANG
-
G
D
L
Q
V
T
I
K
E
A
DG
SIQT
L
Y
V
PY
S
S
V
P
V
L
Q
R
A
G
YT
RY
ALAM
G
E
YRS-GNNLQSSPK
F
IQG
S
LM
H
G
LE
G
NW
T
P
YGG
MQI
A
ED-
Y
Q
A
F
N
L
G
I
G
K
D
LGLF
GA
F
S
F
D
I
T
Q
A
NTTL-AD
---
GTRH-S
G
Q
S
I
K
S
V
YSK
S
F
YQTG
T
N
I
QV
AG
-
YR
Y
S
T
Q
G
F
Y
N
L--
--
--------
-
----SDSAYSRMS
GYTVKPPTGDSNEQTQFIDYFNLF
YSKR
---------
G
QEQI
S
-IS
Q
Q
------------
L
----
GNYG-
T
T
F
FSASRQS
YW
NTSRSDQ-
Q
ISFGLN---VPFGD
IT
T
S
L
N
YSY
-----
S
NNIWQNDRDHLL
A
F
TL
NV
P
FS
----
H
-
W
MRTDSQ
S
AF
RNSNAS
Y
SMSNDL
K
GGMTNLS
G
VY
G
TLL
P
D
NN
-
L
N
Y
S
V
Q
V
G
NTHGGNTSSATS
--
GYSSLN
Y
RGAY
G
NTNV
G
Y
S
RS
--
GD-
S
S
Q
IYYGMS
G
G
I
I
A
H
AD
G
I
T
F
G
QPL--
-
GDTMV
L
V
K
AP
G
ADNVK
I
-
ENQTGIH
T
D
WR
G
Y
A
IL
PF
A
T
E
Y
RE
N
RVA
L
N
ANS
L
A
DN
V
ELDE
T
VVTVIP
T
H
GAI
A
RAT
F
NAQI
G
GKVLMT
L
KYG
N
K
S-
V
P
F
G
A
---
I
V
TH
G
ENK
---
-NG
S
I
V
A
E
N
G
Q
V
YL
T
G
L
PQSGKLQ
V
S
W
GKDK
-
NSN
C
I
V
E
fig|550677.3.peg.528
Escherichia coli B354 (15-865/873)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVSY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
ATL
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSNVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGRM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGGKSRS-
N
YSVGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
A
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|656444.3.peg.479
Escherichia coli TA280 (15-865/873)
K
RHNLTRIA
M
YCSIMYSG
-----
AT
IGAESV
E
Y
D
PTF
L
M
----
GGNASS
ID
VSRYSDG
N
PTL
-
P
G
V
Y
D
V
SIY
VN
EQPVANL---E
I
PFIAIPDKKNAQ
-------
-A
C
I
T
LKN
L
LQL
H
IKTPPADE
ENTILLPRD
E----TLGN
C
LDLSLAIPKSSVNY
D
PSEQR
L
D
I
N
V
PQA
W
V
MKNYQN
Y
V
D
PS
L
W
EN
GI
N
A
AML
S
YN
IN
A
Y
--
------RS
-
E
N
SNY
T
----------------
NDS
V
Y
TS
F
NG
G
V
N
L
GAWRLRS
S
GNYSWRND-----
--------
-AGSDVEFMNR
Y
V
Q
R
D
I
TAIR
S
Q
L
I
M
GE
SYTT
G
E
T
FDS
V
S
IR
G
IR
L
Y
SD
S
R
MLP
PV
L
AN
F
AP
T
I
R
G
V
A
N
TNA
K
V
S
I
T
Q
S
G
Y
K
IY
E
TT
VPPGPF
V
I
D
DL
SP
S
GYG
-
S
D
L
I
V
T
I
E
E
A
DG
SKRT
F
S
Q
PF
S
S
V
I
Q
M
L
R
P
G
VS
R
W
DISG
G
Q
INK--DDLHHEPN
L
LQA
T
YY
R
G
LS
N
LF
T
G
Y
T
G
FQV
T
DNH
Y
A
A
G
L
L
G
I
G
M
N
T-SV
GA
I
S
F
DVT
H
S
SVDI-PD
---
DKRY-Q
G
Q
SYR
I
S
WN
K
F
F
DLTD
T
S
L
NI
A
A
-
YR
Y
S
T
Q
D
Y
L
G
LND
A
L
TL
-----
I
D
EVEHPTQDLDPKT
MRN
---------------------
YGHM
---------
K
NQFT
V
SIN
Q
P
------------
L
RFGK
DDYG-
S
F
Y
TSGSWSD
YW
GGGKSRS-
N
YSIGYS---NSASW
G
S
Y
S
I
S
AQR
------
SWDEYSQTENSI
YL
SF
S
I
P
IE
----
KLMGTEHRD
S
GF
QS--ID
T
QLSTNM
D
GNNQFNM
S
SS
G
YSN
-
D
NR
-
V
S
Y
S
V
N
A
G
YGMNKTGKDLSN
--
IGGYAS
Y
ESPW
G
TLAG
S
V
S
AT
--
SDN
N
R
Q
YSVNTD
G
G
F
V
L
H
SG
G
L
T
F
S
NDSFS
D
NDTIA
L
V
K
AP
G
AKGAR
I
-
-NYGNNT
VD
RW
G
Y
G
V
T
SA
L
S
P
Y
QE
N
KIA
L
D
TEN
L
E
ND
I
EMKS
T
STVAVP
R
Q
G
S
V
I
FAG
F
ETNQ
G
QSAIMN
I
KRS
DG
KG
I
P
F
A
A
---
D
I
HD
E
TNA
---
-II
G
N
V
G
Q
G
G
Q
A
FV
R
G
I
QQQGTIK
I
T
W
LEAS
M
PKT
C
I
A
QYQQPNVTSEKIQQ
T
I
IL
fig|701177.3.peg.4309
Escherichia coli O55:H7 str. CB9615 (14-850/857)
SISV
V
AVAVASTF
------
S
AHAGK-
-
FN
PKF
L
E
DV
-
Q
GVGQH-
V
D
LTMFEKG
Q
EQQ
L
P
G
I
Y
R
V
SVY
VN
EQRMETR---T
L
EFKEATEAQRKA
MG
E
SLV
-
-P
CL
S
RTQ
L
AEM
G
VRVESFPA
---
L
NLVSA
E-------A
C
VPFDEIIPLASSHF
D
FSEQK
L
V
L
S
F
PQA
A
M
HQVARG
T
V
P
E
S
L
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NSEYDSTG
-
SS
SSY
VDDNGTVHHDDGKDTLKS
DSYY
LN
L
RS
G
L
N
L
GAWRLR
N
Y
STWSHSGG-----
--------
--KAQWDNIGT
S
L
S
R
A
I
IPFK
A
Q
LT
M
G
D
TATA
G
D
IFDS
V
Q
MR
G
AM
L
A
SD
E
E
MLP
DS
Q
RG
F
AP
I
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TY
V
Q
PG
A
F
E
I
N
DL
YP
T
ANS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
S
V
P
I
F
Q
R
E
G
HL
K
Y
SFAA
G
E
YQA-GNYDSASPR
F
GQL
D
LI
Y
G
LP
W
GM
TA
YGG
VLI
S
NN-
Y
N
A
F
A
L
G
I
G
K
N
FGYI
GA
I
S
I
DVT
Q
A
KSEL-NN
---
DRDS-Q
G
Q
SYR
F
L
YSK
S
F
-ESG
T
D
FR
L
AG
-
YR
Y
S
T
S
G
F
Y
T
FQE
A
T
--------
D
VRSDADSDYNR--
------------------------
YHKR
---------
S
EIQG
N
-LT
Q
Q
------------
L
----
GAYG-
S
V
Y
LNLTQQD
YW
NDAGKQN-
T
VSAGYN---GRIGK
VS
Y
S
I
A
YSW
-----
N
KSPEWDESDRLW
SF
NI
S
V
P
LG
----
R
AW
---------
----SN
Y
RVTTDQ
D
GRTNQQV
G
VS
G
TLL
E
D
RN
-
L
S
Y
S
V
Q
E
G
YASNG---VGNS
--
GNANVG
Y
QGGS
G
NVNV
G
Y
S
YG
--
KD-
Y
R
Q
LNYSVR
G
G
V
I
V
H
SE
G
V
TLS
QPL--
-
GETMT
L
I
S
V
P
G
ARNAR
V
-
VNNGGVQ
VD
WM
G
N
A
I
V
PY
AM
P
Y
RE
N
EIS
L
R
SDS
L
G
DD
V
DVEN
A
FQKVVP
T
R
GAI
V
RAR
F
DTRV
G
YRVLMT
L
LRS
A
G
SP
V
P
F
G
A
TAT
L
I
TD
K
QNE
---
-VS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGRVL
I
K
W
GNDA
-
SQQ
C
V
A
PYKLSLELKQGGII
fig|155864.1.peg.4437
Escherichia coli O157:H7 EDL933 (14-850/857)
SISV
V
AVAVASTF
------
S
AHAGK-
-
FN
PKF
L
E
DV
-
Q
GVGQH-
V
D
LTMFEKG
Q
EQQ
L
P
G
I
Y
R
V
SVY
VN
EQRMETR---T
L
EFKEATEAQRKA
MG
E
SLV
-
-P
CL
S
RTQ
L
AEM
G
VRVESFPA
---
L
NLVSA
E-------A
C
VPFDEIIPLASSHF
D
FSEQK
L
V
L
S
F
PQA
A
M
HQVARG
T
V
P
E
S
L
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NSEYDSTG
-
SS
SSY
VDDNGTVHHDDGKDTLKS
DSYY
LN
L
RS
G
L
N
L
GAWRLR
N
Y
STWSHSGG-----
--------
--KAQWDNIGT
S
L
S
R
A
I
IPFK
A
Q
LT
M
G
D
TATA
G
D
IFDS
V
Q
MR
G
AM
L
A
SD
E
E
MLP
DS
Q
RG
F
AP
I
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TY
V
Q
PG
A
F
E
I
N
DL
YP
T
ANS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
X
PF
S
S
V
P
I
F
Q
R
E
G
HL
K
Y
SFAA
G
E
YQA-GNYDSASPR
F
GQL
D
LI
Y
G
LP
W
GM
TA
YGG
VLI
S
NN-
Y
N
A
F
T
L
G
I
G
K
N
FGYI
GA
I
S
I
DVT
Q
A
KSEL-NN
---
DRDS-Q
G
Q
SYR
F
L
YSK
S
F
-ESG
T
D
FR
L
AG
-
YR
Y
S
T
S
G
F
Y
T
FQE
A
T
--------
D
VRSDADSDYNR--
------------------------
YHKR
---------
S
EIQG
N
-LT
Q
Q
------------
L
----
GAYG-
S
V
Y
LNLTQQD
YW
NDAGKQN-
T
VSAGYN---GRIGK
VS
Y
S
I
A
YSW
-----
N
KSPEWDESDRLW
SF
NI
S
V
P
LG
----
R
AW
---------
----SN
Y
RVTTDQ
D
GRTNQQV
G
VS
G
TLL
E
D
RN
-
L
S
Y
S
V
Q
E
G
YASNG---VGNS
--
GNANVG
Y
QGGS
G
NVNV
G
Y
S
YG
--
KD-
Y
R
Q
LNYSVR
G
G
V
I
V
H
SE
G
V
TLS
QPL--
-
GETMT
L
I
S
V
P
G
ARNAR
V
-
VNNGGVQ
VD
WM
G
N
A
I
V
PY
AM
P
Y
RE
N
EIS
L
R
SDS
L
G
DD
V
DVEN
A
FQKVVP
T
R
GAI
V
RAR
F
DTRV
G
YRVLMT
L
LRS
A
G
SP
V
P
F
G
A
TAT
L
I
TD
K
QNE
---
-VS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGRVL
I
K
W
GNDA
-
SQQ
C
V
A
PYKLSLELKQGGII
fig|362663.8.peg.3815
Escherichia coli 536 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINA
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSASINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
PS
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
SS
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
I
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
E
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
L
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFGAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAA
I
-
NNASGSR
L
D
FW
G
N
G
I
V
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|362663.9.peg.3829
Escherichia coli 536 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINA
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSASINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
PS
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
SS
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
I
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
E
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
L
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFGAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAA
I
-
NNASGSR
L
D
FW
G
N
G
I
V
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|525281.3.peg.1568
Escherichia coli 83972 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINV
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSASINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
N
S
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
I
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFSAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
I
V
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|655817.3.peg.5064
Escherichia coli ABU 83972 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINV
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSASINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
N
S
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
I
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFSAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
I
V
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|749546.3.peg.3032
Escherichia coli MS 185-1 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINV
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSASINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
N
S
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
I
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFSAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
I
V
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|749528.3.peg.1477
Escherichia coli MS 45-1 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINV
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSASINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
N
S
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
I
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFSAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
I
V
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|340197.3.peg.849
Escherichia coli F11 (29-857/863)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINA
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSARINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
SS
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
L
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFGAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
VV
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|340197.5.peg.891
Escherichia coli F11 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINA
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSARINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
SS
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
L
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFGAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
VV
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|749550.3.peg.1517
Escherichia coli MS 200-1 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINA
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSARINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
SS
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
L
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFGAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
VV
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|869729.3.peg.4667
Escherichia coli UM146 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINA
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSARINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
SS
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
L
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFGAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
VV
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|364106.7.peg.4774
Escherichia coli UTI89 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINA
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSARINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
SS
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
L
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFGAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
VV
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|364106.8.peg.4773
Escherichia coli UTI89 (10-838/844)
I
YCRCSLLL
F
AALGLTVT
----
N
H
S
FAAEEA
EF
D
SEF
L
H
L
---
DKGINA
ID
IRRFSHG
N
PVP
-
E
G
R
Y
Y
S
DIY
VN
NVWKGKA---D
L
QYLRTANTGAPT
-------
-L
CLT
PEL
L
SLI
D
LVKDTMSG
---------
------NTS
C
FPASTGLSSARINF
D
LSTLR
L
N
I
E
I
PQA
L
L
NTRPRG
Y
I
S
P
A
Q
W
QS
G
VPA
AFI
N
Y
D
AN
Y
Y
--
------QY
-
SS
SGT
S
----------------
N
EQT
Y
LG
L
KA
G
F
N
L
WG
W
A
LR
HR
GSESWNNS-----
--------
-YPAGYQNIET
S
I
M
H
D
L
APLR
A
Q
F
TLG
D
FYTN
G
E
L
M
DS
L
S
LR
G
VR
L
A
SD
E
R
MLP
GS
LRGYAP
A
V
R
GIA
N
S
NA
K
V
T
I
Y
Q
N
AH
IL
Y
E
TT
VP
A
GPF
V
I
N
DL
YP
S
GYA
-
G
D
L
L
V
K
I
T
E
S
N
G
QTRM
F
T
V
PF
A
A
V
A
Q
L
I
R
P
G
FS
R
W
QMSV
GK
YRY-ANKTYND-L
I
AQG
T
YQ
Y
G
LT
N
DI
T
L
N
S
G
LTT
A
SG-
Y
T
A
G
L
A
G
L
A
F
N
T-PL
GA
IA
S
D
I
T
L
S
RTAFRYS
---
GVTR-K
G
Y
S
LH
S
S
YS
I
N
I
PASN
T
N
IT
L
A
A
-
YR
Y
S
S
K
D
F
Y
H
LKD
A
L
SANHNAFI
D
DVSVKSTAF----
------------------------
YRPR
---------
-
NQFQ
I
SIN
Q
E
------------
L
---
G
EKWG-
G
M
Y
LTGTTYN
YW
GHKGSRN-
E
YQMGYS---NFWKQ
L
G
Y
Q
I
G
LSQ
-----
S
RDNEQQRRDDRF
Y
I
NF
T
LP
LG
-------------
G
S
V
QSPVFS
T
VLNYSK
E
EKNSIQT
S
IS
G
TGG
E
D
NQ
-
F
S
Y
G
I
S
G
N
SQENGPSGYAMN
--
G----G
Y
RSPY
V
NITT
T
V
G
HD
--
TQN
N
N
Q
RSFGAS
G
A
V
V
A
H
PY
G
V
TLS
NDL--
-
SDTFA
I
I
H
A
E
G
AQGAV
I
-
NNASGSR
L
D
FW
G
N
G
VV
PY
VT
P
Y
EK
N
QIS
I
D
PSN
L
D
LN
V
ELSA
T
EQEIIP
RAN
S
AT
LVK
F
DTKT
G
RSLLFD
I
RMS
T
G
NP
PP
M
A
S
---
E
V
LD
E
HGQ
---
-LA
G
Y
V
A
Q
A
G
K
V
F
TR
G
L
PEKGHLS
V
V
W
GPDN
-
KDR
C
S
F
VYHVAHNKDDMQSQ
LV
P
V
LC
IQ
H
fig|340185.3.peg.49
Escherichia coli E22 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
N
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|340185.4.peg.52
Escherichia coli E22 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
N
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|585034.4.peg.3629
Escherichia coli IAI1 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
N
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|585034.5.peg.3626
Escherichia coli IAI1 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
N
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|585395.4.peg.4914
Escherichia coli O103:H2 str. 12009 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
N
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|585396.4.peg.4519
Escherichia coli O111:H- str. 11128 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
N
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|573235.3.peg.5210
Escherichia coli O26:H11 str. 11368 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
N
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|1040638.4.peg.2193
Escherichia coli O104:H4 str. LB226692 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|6666666.5357.peg.3663
Escherichia coli TY-2482 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|585055.6.peg.4033
Escherichia coli 55989 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|585055.8.peg.4036
Escherichia coli 55989 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|595495.4.peg.3877
Escherichia coli KO11 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|679207.4.peg.242
Escherichia coli MS 107-1 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|566546.3.peg.4195
Escherichia coli W (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|566546.4.peg.3777
Escherichia coli W (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|562.375.peg.476
Escherichia coli EC4100B (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRL
L
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
N
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
RN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|550672.3.peg.3605
Escherichia coli B088 (16-831/843)
SFSL
L
ALTIASSL
------
P
AYGGK-
-
FN
PKF
L
E
N
V
-
Q
GIDQH-
ID
LSVYDSP
V
GQQ
I
P
G
K
Y
R
V
SVF
VN
EEKMASR---T
L
DFSTASEAKRKA
SG
E
SLM
-
-P
CL
S
RVQ
L
EEM
G
VRVDSFPA
---
LKMSPP
E-------A
C
VAFDEIIPQATSRF
D
FNTQT
L
H
L
S
F
PQA
A
M
MMTARG
T
V
D
PS
R
W
DE
GI
PA
LLL
DY
S
FS
G
S
--
NGRNEGTG
-
SS
SDS
T
----------------
S
DSYY
LN
L
RS
G
L
N
V
G
P
WRLR
NN
SIWNRTDG-----
--------
--KNQWDNVGT
S
L
N
R
A
I
IPLK
S
Q
I
TLG
D
TATP
G
E
IFDS
V
Q
MR
G
AL
L
A
SDD
E
MLP
DS
Q
RG
F
AP
V
V
R
GIA
K
S
NA
E
V
S
I
E
Q
N
G
Y
VIY
R
TF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
-
G
D
L
T
V
I
I
K
E
A
DG
SEQR
F
I
Q
PF
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SLAA
G
E
YRA-GNYDSDKPH
F
GQF
T
AM
Y
G
LP
W
GM
TA
YGG
ALL
S
AD-
Y
S
A
L
A
L
G
L
G
K
N
FGTI
GA
V
S
V
DVT
Q
A
KSQL-RN
---
NEKE-E
G
Q
SYR
F
L
YSK
S
F
-EGG
T
D
L
R
L
L
G
-
Y
K
Y
S
T
S
G
Y
Y
T
FQE
A
T
--------
D
VRSDADSDYRR--
------------------------
YHKR
---------
S
QIQG
N
-IT
Q
Q
------------
L
----
GDYG-
S
V
Y
FNMTQQD
YW
NVDGKEN-
S
LSAGYH---GHIGR
V
N
Y
S
V
A
YTW
-----
T
RSPEWEEDDRLW
SF
SV
S
I
P
LG
----
GAW
---------
----SS
Y
RMTTDQ
N
GKTSQQA
S
VS
G
TLL
E
D
CN
-
L
S
Y
N
V
Q
Q
G
YTSNG---VGYS
--
GSVNMG
Y
MGGS
G
NIDV
G
Y
N
YS
--
KD-
N
Q
Q
VNYGVR
G
G
V
I
V
H
SE
G
I
TLS
QPL--
-
GESLA
I
V
S
AP
G
ARGGH
V
-
VNSSGVE
VD
WM
G
N
AVV
PY
LT
P
Y
RE
T
IVE
L
R
SDT
L
G
QN
V
ELQE
A
FQKVVP
T
R
GA
VV
RSR
F
DTRV
G
YRVLMS
L
KQA
N
G
NA
V
P
F
G
A
TAA
L
I
-D
E
SKP
---
-AS
S
I
V
G
E
E
G
Q
L
Y
I
SG
M
PEEGELQ
V
S
W
GNEQ
-
AQR
C
R
V
PFRLP-ENKDN
fig|340186.3.peg.3387
Escherichia coli E110019 (2-818/830)
KMK
QNRLCLLA
V
CTLLLSHK
------
-
--SGAV
S
F
D
PSL
L
A
----
GASGE-
S
D
LSRFSEN
N
AMP
-
A
G
S
Q
E
M
DIY
VN
GSWKGRY---T
V
IYGEQRDDIR--
-------
--
-
I
A
WKD
A
RSL
G
INTTSVPA
---
PAIAHG
Q--------
-
VQLRDLVQGGEVKT
D
TSTLS
L
A
L
T
V
PQA
A
V
LRTEEG
Y
I
A
P
Q
F
W
DE
GI
PA
LML
S
W
N
TT
W
Y
--
NTRAKGAA
-
KD
TN-
-
-----------------
D
DF
Y
AG
L
DS
G
A
N
L
FG
W
Q
F
R
DS
SAWRKTAS-----
--------
-GESSWQNNTR
YL
R
R
P
L
ASLK
S
N
LTLG
D
FYIP
G
D
L
FDS
L
R
VR
G
VS
L
A
SD
M
K
M
R
P
NS
QQ
G
F
S
P
V
V
H
G
V
A
R
TNA
L
V
K
V
I
Q
N
G
N
VIYQ
EN
VPPG
Q
F
T
L
D
S
I
QP
T
GSA
-
G
D
L
L
V
V
V
R
E
A
DG
SQQS
F
T
V
PF
S
A
V
P
G
M
L
K
E
G
VS
Q
Y
SVVA
GK
VHQ--NTLDAEPA
F
MQA
T
LR
Y
G
FN
N
LI
T
G
Y
T
G
TII
S
DN-
Y
Q
A
G
L
V
G
T
G
W
N
L-PF
GA
V
S
F
DVT
H
A
KTTL--Q
---
DRTS-S
G
Q
SYR
V
S
YSK
F
I
DTTA
T
N
F
T
L
A
A
-
YR
Y
S
T
K
G
Y
Y
S
FSD
A
L
YS
-----
R
E
GYQRLRAQYDD--
------------------------
YEDR
FGVAPDMSL
S
TWDA
L
RAA
Q
P
KNTFTLNLNQRL
L
----
NNWG-
T
V
F
VSGTHRD
YW
NSQQTTR-
E
YQMGYS---NAIGR
A
S
Y
T
L
S
ASR
-----
V
RNRDSEEETRLY
-
L
SL
SLP
FA
LFDNN
AW
---------
----IT
S
SLTASD
S
HYEQSNI
S
MS
G
NAL
AS
NR
-
L
S
Y
T
L
S
G
S
NTRGG----ENA
--
ASVNAA
Y
RSNF
A
TLGG
S
Y
S
ES
--
TD-
Y
R
Q
TGLSGR
GS
L
V
A
Y
PW
H
V
LA
S
NET--
-
GTTMT
I
V
D
AP
K
AEGLM
V
-
NGDESIM
TN
RD
G
V
A
L
V
PY
A
T
P
Y
RK
N
AIT
L
T
ETE
NS
AG
A
EVIG
N
MANVAP
Y
D
GA
V
S
YIR
F
ETDK
R
QSWVLH
A
TRA
DG
KP
L
P
F
G
T
---
E
V
LD
E
HGE
---
-SV
G
Y
V
G
Q
A
S
V
L
Y
I
RAE
RPPRALN
V
H
L
---R
-
GGK
C
E
I
SSP
fig|340186.5.peg.3534
Escherichia coli E110019 (2-818/830)
KMK
QNRLCLLA
V
CTLLLSHK
------
-
--SGAV
S
F
D
PSL
L
A
----
GASGE-
S
D
LSRFSEN
N
AMP
-
A
G
S
Q
E
M
DIY
VN
GSWKGRY---T
V
IYGEQRDDIR--
-------
--
-
I
A
WKD
A
RSL
G
INTTSVPA
---
PAIAHG
Q--------
-
VQLRDLVQGGEVKT
D
TSTLS
L
A
L
T
V
PQA
A
V
LRTEEG
Y
I
A
P
Q
F
W
DE
GI
PA
LML
S
W
N
TT
W
Y
--
NTRAKGAA
-
KD
TN-
-
-----------------
D
DF
Y
AG
L
DS
G
A
N
L
FG
W
Q
F
R
DS
SAWRKTAS-----
--------
-GESSWQNNTR
YL
R
R
P
L
ASLK
S
N
LTLG
D
FYIP
G
D
L
FDS
L
R
VR
G
VS
L
A
SD
M
K
M
R
P
NS
QQ
G
F
S
P
V
V
H
G
V
A
R
TNA
L
V
K
V
I
Q
N
G
N
VIYQ
EN
VPPG
Q
F
T
L
D
S
I
QP
T
GSA
-
G
D
L
L
V
V
V
R
E
A
DG
SQQS
F
T
V
PF
S
A
V
P
G
M
L
K
E
G
VS
Q
Y
SVVA
GK
VHQ--NTLDAEPA
F
MQA
T
LR
Y
G
FN
N
LI
T
G
Y
T
G
TII
S
DN-
Y
Q
A
G
L
V
G
T
G
W
N
L-PF
GA
V
S
F
DVT
H
A
KTTL--Q
---
DRTS-S
G
Q
SYR
V
S
YSK
F
I
DTTA
T
N
F
T
L
A
A
-
YR
Y
S
T
K
G
Y
Y
S
FSD
A
L
YS
-----
R
E
GYQRLRAQYDD--
------------------------
YEDR
FGVAPDMSL
S
TWDA
L
RAA
Q
P
KNTFTLNLNQRL
L
----
NNWG-
T
V
F
VSGTHRD
YW
NSQQTTR-
E
YQMGYS---NAIGR
A
S
Y
T
L
S
ASR
-----
V
RNRDSEEETRLY
-
L
SL
SLP
FA
LFDNN
AW
---------
----IT
S
SLTASD
S
HYEQSNI
S
MS
G
NAL
AS
NR
-
L
S
Y
T
L
S
G
S
NTRGG----ENA
--
ASVNAA
Y
RSNF
A
TLGG
S
Y
S
ES
--
TD-
Y
R
Q
TGLSGR
GS
L
V
A
Y
PW
H
V
LA
S
NET--
-
GTTMT
I
V
D
AP
K
AEGLM
V
-
NGDESIM
TN
RD
G
V
A
L
V
PY
A
T
P
Y
RK
N
AIT
L
T
ETE
NS
AG
A
EVIG
N
MANVAP
Y
D
GA
V
S
YIR
F
ETDK
R
QSWVLH
A
TRA
DG
KP
L
P
F
G
T
---
E
V
LD
E
HGE
---
-SV
G
Y
V
G
Q
A
S
V
L
Y
I
RAE
RPPRALN
V
H
L
---R
-
GGK
C
E
I
SSP
fig|340185.3.peg.3801
Escherichia coli E22 (6-800/817)
TISA
L
---ILTST
--
INAGD
SFAKEY
K
FN
YSY
L
-
----
GMDEN-
A
D
LNFFDSK
-
-TT
-
Q
G
N
Y
V
V
DIY
I
N
NELKETT---E
I
YF---RN-----
K
GNNLV
-
-P
CLT
QEN
L
IKY
G
FLQKKIDE
---
FIFDDE
Q--------
C
VNLD--KENIKYLF
N
PTNQI
L
L
L
N
I
P
SG
F
L
ADKNSE
I
A
D
E
S
L
W
DD
G
L
N
A
LIF
N
Y
Q
A-
-
-
--
N-----YL
-
K
S
NNR
R
----------------
G
DSY
F
GQ
I
EP
G
L
N
V
G
P
WR
I
R
N
L
STWKKSKE-----
--------
-NTD-FESAYT
Y
A
E
R
G
I
NSLK
S
R
L
I
I
G
D
KYTN
T
N
IFDS
I
S
FR
G
VT
F
N
K
D
E
N
M
I
P
YS
A
R
A
Y
S
P
K
I
R
GIA
K
T
Q
A
V
V
E
I
R
Q
Q
G
Y
L
L
Y
S
TS
VPPG
E
F
E
I
N
SN
QF
S
NFG
S
G
L
F
D
V
T
I
I
E
S
N
G
QKQM
Y
S
V
PY
TI
P
V
I
S
L
A
K
G
YS
N
Y
SFTA
GK
YRN-ADIHKNEPI
F
AEG
T
YS
Y
G
LP
Y
GL
S
IF
GG
VQL
A
DI-
Y
S
S
Y
A
V
G
I
S
K
D
VGEY
GA
I
S
F
D
M
K
Y
A
KSKPYEK
---
TSFI-N
G
S
A
Y
G
I
K
Y
T
K
N
F
NTTN
T
D
I
S
I
A
NY
Y
N
Y
S
K
D
-
Y
R
T
LSE
T
I
--------
D
SYNDHVY------
------------------------
YSKK
---------
S
TTYA
M
-LS
Q
P
------------
L
----
GAWG-
S
I
N
LSYNHDN
Y
-
WEKNGSN-
S
IAVWYG---KNIGS
T
S
L
S
L
S
YTR
---
TAF
KKYGKNDNEDLF
N
I
ML
NI
P
LQ
----
DL
---
TNKEIY
-
----AN
Y
QLTSSS
D
NKTTHDI
G
LN
G
MAF
-
D
RR
-
M
SW
Q
V
R
E
-
QIQEASK-YKKF
--
SYFNAS
W
NGTY
G
TLGA
N
Y
N
YS
--
ST-
H
R
E
VGLALS
G
G
I
L
A
H
SS
G
I
T
F
G
QRI--
-
SNTTA
L
V
E
A
K
G
VSGAT
V
-
LGLPGIK
T
D
FR
G
Y
TFS
SS
L
M
P
Y
MD
N
TVS
I
D
PSS
LP
NN
S
SIKQ
T
DIKVVP
T
T
GAI
V
KAK
Y
NTSI
G
VNALIK
I
SNK
K
G
KH
L
P
F
G
T
ILA
-
-
-V
K
DEK
GVV
QST
S
I
V
G
D
N
G
E
AY
VT
G
L
DGTQEIN
A
T
W
GREA
-
SDS
C
K
V
SYNLT
fig|340185.4.peg.4000
Escherichia coli E22 (4-798/815)
TISA
L
---ILTST
--
INAGD
SFAKEY
K
FN
YSY
L
-
----
GMDEN-
A
D
LNFFDSK
-
-TT
-
Q
G
N
Y
V
V
DIY
I
N
NELKETT---E
I
YF---RN-----
K
GNNLV
-
-P
CLT
QEN
L
IKY
G
FLQKKIDE
---
FIFDDE
Q--------
C
VNLD--KENIKYLF
N
PTNQI
L
L
L
N
I
P
SG
F
L
ADKNSE
I
A
D
E
S
L
W
DD
G
L
N
A
LIF
N
Y
Q
A-
-
-
--
N-----YL
-
K
S
NNR
R
----------------
G
DSY
F
GQ
I
EP
G
L
N
V
G
P
WR
I
R
N
L
STWKKSKE-----
--------
-NTD-FESAYT
Y
A
E
R
G
I
NSLK
S
R
L
I
I
G
D
KYTN
T
N
IFDS
I
S
FR
G
VT
F
N
K
D
E
N
M
I
P
YS
A
R
A
Y
S
P
K
I
R
GIA
K
T
Q
A
V
V
E
I
R
Q
Q
G
Y
L
L
Y
S
TS
VPPG
E
F
E
I
N
SN
QF
S
NFG
S
G
L
F
D
V
T
I
I
E
S
N
G
QKQM
Y
S
V
PY
TI
P
V
I
S
L
A
K
G
YS
N
Y
SFTA
GK
YRN-ADIHKNEPI
F
AEG
T
YS
Y
G
LP
Y
GL
S
IF
GG
VQL
A
DI-
Y
S
S
Y
A
V
G
I
S
K
D
VGEY
GA
I
S
F
D
M
K
Y
A
KSKPYEK
---
TSFI-N
G
S
A
Y
G
I
K
Y
T
K
N
F
NTTN
T
D
I
S
I
A
NY
Y
N
Y
S
K
D
-
Y
R
T
LSE
T
I
--------
D
SYNDHVY------
------------------------
YSKK
---------
S
TTYA
M
-LS
Q
P
------------
L
----
GAWG-
S
I
N
LSYNHDN
Y
-
WEKNGSN-
S
IAVWYG---KNIGS
T
S
L
S
L
S
YTR
---
TAF
KKYGKNDNEDLF
N
I
ML
NI
P
LQ
----
DL
---
TNKEIY
-
----AN
Y
QLTSSS
D
NKTTHDI
G
LN
G
MAF
-
D
RR
-
M
SW
Q
V
R
E
-
QIQEASK-YKKF
--
SYFNAS
W
NGTY
G
TLGA
N
Y
N
YS
--
ST-
H
R
E
VGLALS
G
G
I
L
A
H
SS
G
I
T
F
G
QRI--
-
SNTTA
L
V
E
A
K
G
VSGAT
V
-
LGLPGIK
T
D
FR
G
Y
TFS
SS
L
M
P
Y
MD
N
TVS
I
D
PSS
LP
NN
S
SIKQ
T
DIKVVP
T
T
GAI
V
KAK
Y
NTSI
G
VNALIK
I
SNK
K
G
KH
L
P
F
G
T
ILA
-
-
-V
K
DEK
GVV
QST
S
I
V
G
D
N
G
E
AY
VT
G
L
DGTQEIN
A
T
W
GREA
-
SDS
C
K
V
SYNLT
fig|340184.3.peg.4476
Escherichia coli B7A (1-813/822)
M
V
E
R
V
K
YTMKE
L
NKKYTILC
M
AWLMLISS
----
NT
S
FAKNKY
EF
D
TEL
I
N
SVDK
DLSTDV
L
N
FNPFP--
-
---
-
A
G
D
Y
V
V
DIF
I
N
GNYRLTH---S
V
VFVKNNN-----
EHNELS
-
-P
CL
D
EKL
L
LAL
G
IKKNLIKT
---
VDTCSN
Q--------
-
-----NNENWLFKS
N
LYEQS
L
N
I
T
I
P
E
V
D
L
DKTIDG
V
A
P
KI
T
W
DD
G
V
N
A
FLL
N
Y
K
GN
L
S
--
NINNKKLH
-
D
N
QNY
A
--------------------
Y
LD
L
SP
G
F
N
F
GAWR
F
R
N
K
TFFNYVNS-----
--------
-RESKWQNVNN
Y
F
E
R
G
I
KEFN
S
R
F
TLG
D
FLTK
N
D
L
F
S
S
S
S
LR
G
VA
L
S
T
D
E
L
M
I
P
GR
L
K
DN
S
P
L
I
R
GIA
K
T
Q
A
R
V
E
V
E
Y
N
G
YI
IY
T
KT
V
D
A
G
V
F
E
I
N
DL
PN
M
GAA
-
G
E
Y
K
V
T
V
F
E
T
DG
TKNV
I
L
I
PF
IR
S
P
L
S
L
K
K
G
FS
K
Y
AVSF
G
R
YRR-NSDKNAGPP
I
FDA
G
YS
Y
G
LN
E
FI
T
TSTS
VQV
S
NI-
Y
E
A
Y
A
V
G
L
S
L
S
LGSF
GALS
L
E
G
S
N
A
NAREYDK
---
KKAEKK
G
E
A
SI
L
K
YSK
G
F
NSID
A
D
L
Y
F
N-
-
H
GH
N
S
R
G
Y
K
S
LN-
--
--------
D
VYSTLDSEAID--
------------------------
SSGR
---------
K
NSTS
I
GIN
K
L
------------
I
----
NNYG-
R
L
L
FSYNLDH
YW
DGSRNE--
Y
IDVSFD---GVLKE
IA
Y
S
L
G
YT-
-----
A
NSDKYSKNNHVF
S
L
SI
NI
P
FK
RENERFIS
--------
----AG
Y
KYNNSK
S
QGEYHTV
G
FS
G
AEY
-
N
NT
-
L
SW
N
I
R
Q
K
YSNNN--YYGVS
--
GNTTLR
Y
Q--L
G
YLGI
G
A
S
TE
--
RD-
G
S
S
YNADIG
G
G
L
V
Y
S
EH
G
L
T
F
G
QEI--
-
TQSSA
IL
V
A
K
G
AVGVP
V
-
TGTIGVR
TN
LQ
G
R
A
L
V
TG
L
Q
P
Y
RE
N
ILS
L
D
PLE
T
P
DD
I
EILQ
S
DIKVIP
T
N
GAI
V
EGK
F
RTSE
G
QKTLVR
I
TTS
D
N
KN
I
P
F
G
S
IVT
L
K
GS
Y
NNA
---
---
G
I
V
G
D
N
G
E
V
F
L
T
G
L
PESGILQ
I
K
W
GNDN
-
NSS
C
S
V
NFKKLSN
fig|749546.3.peg.1803
Escherichia coli MS 185-1 (10-850/850)
LSA
L
YIAVLSSL
PFFFCAD
VAARSY
T
F
E
PSM
L
-
----
NVDGND
ID
LSIFESG
-
-AQ
L
P
G
T
Y
Y
V
DIM
L
N
GKLVDTK---E
M
EFSRERN-----
K
DGEFVL
SS
CLT
QSM
L
NRY
G
VKVGDYPE
---
L
FVNSS
N-----GKV
C
GDLSVIPGAFSY-F
D
FYNQQ
L
N
L
S
I
P
NV
A
L
YPKYKG
I
A
SE
E
L
W
DN
GI
N
A
FLM
N
Y
Q
A-
-
-
--
NAQINQYR
-
N
K
KNR
E
----------------
VS
SY
W
AR
I
EP
G
M
N
I
G
S
WR
I
R
N
L
TTFTKENG-----
--------
-NSEKRESVYT
Y
A
E
R
G
L
TSIK
S
N
L
L
I
GE
SYTN
S
D
IFDS
I
S
FR
G
IM
L
H
SD
E
S
M
V
P
YS
KYA
F
AP
V
I
R
GIA
Q
S
Q
A
L
I
E
V
R
Q
N
G
Y
L
I
H
T
VS
V
A
PG
A
F
E
I
S
DL
PV
T
GSG
-
G
D
L
Q
V
S
V
I
E
T
N
G
KNQS
F
T
V
PY
T
T
P
V
I
A
L
R
E
N
YL
K
Y
SLVG
G
M
YRS-AYSGVDNTA
L
IQM
T
AM
Y
G
MP
W
NL
T
T
F
V
G
FQG
S
EH-
Y
N
S
V
A
T
G
V
G
L
S
MGDM
GA
I
S
L
D
GI
Y
A
RGQK-EK
---
QNKE-D
G
Y
S
W
R
V
R
YSK
V
F
DITG
T
N
F-I
A
A
S
HQ
Y
S
S
D
G
Y
Q
T
LSD
V
L
--------
D
TYGHSNYSYGG--
------------------------
YANR
---------
S
MRNS
L
TIS
Q
S
------------
M
----
GEWG-
T
F
S
FGGVRDE
Y
R
GNRSPQN-
S
INALYS---NSMEW
G
T
L
S
L
N
WSQ
NKITDS
SRSVKDKKENIF
SF
WV
S
I
P
LY
----
RLLGNTSNNIN
-
----AT
T
QIQKYD
N
QKMQYEF
G
MN
G
RAF
-
N
RQ
-
L
Y
W
D
I
S
Q
-
RLAPGNENYNDA
--
SRLNLE
W
YGTY
G
QIRG
G
Y
G
YS
--
DS-
L
R
Q
MNAGIS
G
T
A
I
V
H
SN
G
V
T
F
G
QKQ--
-
GGTIA
L
V
E
A
Q
G
VDGAE
V
-
IGWPGVK
T
D
FR
G
Y
TA
L
GH
LT
P
Y
QE
N
TVS
L
N
PAS
F
P
EY
A
EVLQ
T
DTKVIP
T
K
GA
VV
SAR
F
KTSI
G
KKALFK
L
TRH
DG
KK
V
P
F
G
A
VVS
S
A
TA
D
DNK
RVV
---
G
I
V
N
E
S
G
E
V
Y
M
SG
L
SEKGQLD
V
K
W
--NS
-
HGS
C
K
A
VYKLSDNKSIVNIY
NASL
TC
I
Consen1
Primary consensus
MYKKLKLTTiSe
IKN
----
i
l
------
a
eFn
l
----
id
g
-
G
Y
v
vN
w
-------
Clt
v
G
---------
C
D
L
is
PQA
l
w
pPs
W
Giag
dYn
a
--
-
ss
s
---------------------
g
G
N
GaWRlRsd
--------
yl
R
i
s
ltlGe
s
ifds
s
G
l
sDd
MLP
lrGyAP
i
GiA
tnA
V
i
q
grviYq
VppGpF
I
dl
s
-
G
L
V
v
E
dG
f
v
as
p
l
R
G
rY
Gk
F
e
wG
n
slYgG
s
Y
s
a
G
G
n
Gals
DvT
s
---
G
SyR
nYsK
f
s
itfAg
-
YRfS
r
f
t
yl
--------
d
------------------------
---------
k
v
q
------------
----
n
y
yW
n
vs
s
s
------
yl
SlP
----------------
Y
s
s
d
-
d
-
sw
vs
g
--
h
g
s
t
--
y
s
Gs
t
t
G
afh
-
mv
tdG
v
-
tn
G
aVv
is
Y
t
vd
Lp
v
s
teGAig
l
G
i
dG
pPlG
---
v
e
---
GmV
e
G
aylsGv
v
W
-
C
i
ilPc
qnH
Consen2
Secondary consensus
l
h
lsg
h
d
i
n
vn
n
l
i
l
kaekklp
in
l
lkmspp
l
m
y
d
e
vpa
n
s
g
t
ndsyy
l
p
f
k
sa
l
a
fs
g
lyrg
v
t
e
qq
f
v
v
ss
v
h
nyil
qa
a
ni
t
i
n
q
a
a
f
k
t
y
ta
s
a
l
d
ia
i
a
s
a
v
l
a
y
y
s
ai
n
s
k
s
n
w
g
t
sf
gaw
ss
g
g
en
l
hy
n
s
y
a
sl
n
q
g
v
h
tls
i
ap
vd
g
i
n
g
a
i
vv
f
l
v
f
i
d
vwi
l
lv
v
iq
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character